Introduction
- "There are three types of lies: lies, damned lies and statistics"
Mark Twain
- 400,000 people die per year from smoking related causes. (Source)
- To put it in perspective there are about 250 million Americans and in 1997 there were
2.3 million deaths. So, about 17% of all deaths are smoking related.
- Who is a smoker?
- How do you know what someone dies of?
- Smokers tend to be poorer, less educated and drink more alcohol than non-smokers. Does
this complicate estimating the the number of smoking related deaths?
- What does smoking related mean?
- Does this mean that if everyone stopped smoking today that annual mortality would
decrease by 400,000?
- What does it mean?
- A black person is nearly 4 times more likely to be executed under death penalty laws
than a white person. (How do you figure?)
- But I thought you just said the majority of people executed were white?
- Why is this so? Does this statistic tell you? (12% of Americans are black but 36% of
executed people are black.)
- If you think that there is a racial bias in death penalty cases which statistic would
you prefer to cite? What if you didn't think there is a bias?
- Motorcycle Helmets (read The Case of No Helmets
(the article discussed on page 13 of the text) and the CDC Head Injury report)
- Dick Teresi states "Some studies indicate that there are fewer motorcycle deaths
per one-million residents in states with helmet laws than in states where helmets are not
mandatory." He says this is a CDC statistic from the years 1979-1986.
- Mr. Teresi thinks this rate should be expressed in number of motorcycle deaths per
10,000 registered motorcycles, not per one-million residents. Using this
criteria he finds that death rates in helmet laws states is lower than in non-helmet law
states (3.05 versus 3.38).
- Which statistic would you use if you were in favor of motorcycle helmet laws? What if
you were against them?
- Simple statistics
- Some statistics are known without error
- Dow Jones index
- IQ
- 611 people have been executed via the death penalty since 1976 (as of 1/11/2000, Source)
- The majority (55%) of those executed under death penalty laws are white
- Some statistics are estimates
- An aspirin a day decreases your risk of heart attack 44% (Source)
- unemployment rates
- polling results
- Descriptive Statistics
- simple statistics, summaries
- graphs
- Inferential Statistics
- uses data collected from samples
to make generalizations about populations
- subjects can be anything: people, manufactured parts, car accidents, etc. (see the
first paragraph of Section 1.3 of the text on page 4)
- A dataset: how data is recorded on paper or in a computer database. The most common type
a dataset is a table with 1 row per subject and one column per variable. The data itself
resides within the table.
- Example: Say we are interested in the average income of 1990 WSU graduates
- Population: all 1990 graduates
- Sample: those graduates we are able to contact
- Variable: yearly income in dollars
- Data: the set of all incomes reported by the sample
- Example: We want to estimate the effect of regular aspirin use on heart attack risk (Physicians' Health Study)
- Population: all Americans
- Sample: 22,071 male physicians
- Variables:
- Data:
 | Whether or not each physician took 325mg aspirin every other day or whether they took a
sugar pill every other day |
 | Whether or not each physician had a heart attack
|
|
- Types of Inferential Statistics
- estimation: estimate of a parameter of a population
- Mean income of a population
- Percent of people that will vote for a certain candidate.
- How much will a person's cholesterol level drop if they take the drug Lovastatin?
- How many people die per year from smoking?
- decisions
- Does regular use of aspirin decrease the risk of heart attacks?
- Is the death penalty racially biased?
- Does prayer increase survival rates?
- Should mammograms be performed annually on all women over 45
- prediction
- Reliability
- Much of statistics focuses on measuring the reliability of inferences
- How reliable are the results of a census?
- Polling data: usually state that poll results are +/- 3%
|
|