Data Files for Class Use
These are data files that we will use for JMP files and in addition I've included
datasets I just find interesting and might use in class in the future. I've documented the
source or the data as well as possible.
To use these data files first you must download them to your hard drive. After the file
is on your computer you can then open it from there using JMP. How you download the file
depends on your browser (Netscape, MS Internet Explorer, etc), but you probably just need
to click on the link once and your browser should ask you where you want to save the file.
JMP files should be saved with the extension JMP (<filename.JMP>, nba.jmp for
example).

NBA player salaries
Player salaries and statistics for the 1997-1998 NBA season. Variables include player
name, team name, games played, minutes played, total rebounds, total points and salary.
Salary data reportedly from the Dallas Morning News and was found at http://www.nationwide.net/~patricia.
(Patricia seems to have too much time on her hands!).
JMP file (42k)
TXT file (25k)
CEO golf handicaps
An investment compensation expert, Graef Crystal, carried out a study purporting to
show that the major companies, whose C.E.O's had low golf scores, had high performing
stocks. Crystal obtained data for golf scores from the journal Golf Digest and used his
own data on the stock market performance of the companies of 51 chief executives. He
created a Stock Rating which gave each company a stock rating based on how investors who
held their stock did with 100 being highest and 0 lowest.
The last seven observations were excluded as outliers in Mr. Crystal's analysis.
Data is from the New York Times article "Investing it; duffers need not apply"
available here.
JMP file (3k)
TXT file (2k)
Race and the death penalty
Assignment of the death penalty for 326 defendants in homicide indictments in 20
Florida Counties between 1976-1977. The race of the defendant, the race of the victim and
whether or not the defendant received the death penalty are reported.
from "Racial characteristics and the imposition of the death penalty" by M.
Radelet. Amer. Sociol. Rev. 46:918-927. 1981
Summarized in Agretsti's book "Categorical Data Analysis"
JMP file (1k)
Race and the death penalty II
Race of executed convicts between 1976 through January 11, 2000. From the Death Penalty Information Center.
JMP file (1k)
Mortality data
Data from the CDC on mortality rates due to vehicular death, suicide, homicide and
firearm related deaths. Also if each state has the death penalty or not. Also the percent
of the population that lives in urban situations.
Mortality data is from the CDC and is for 1997. Death penalty status is from the Death
Penalty Information Center. Percent Urban data is from the US Census and is estimate of
January 1999 populations.
JMP file (4k)
IQ and brain size
Are the size and weight of your brain indicators of your mental capacity? In this study
by Willerman et al. (1991) the researchers use Magnetic Resonance Imaging (MRI) to
determine the brain size of the subjects. The researchers take into account gender and
body size to draw conclusions about the connection between brain size and intelligence.
Willerman et al. (1991) conducted their study at a large southwestern university. They
selected a sample of 40 right-handed Anglo introductory psychology students who had
indicated no history of alcoholism, unconsciousness, brain damage, epilepsy, or heart
disease. These subjects were drawn from a larger pool of introductory psychology students
with total Scholastic Aptitude Test Scores higher than 1350 or lower than 940 who had
agreed to satisfy a course requirement by allowing the administration of four subtests
(Vocabulary, Similarities, Block Design, and Picture Completion) of the Wechsler (1981)
Adult Intelligence Scale-Revised. With prior approval of the University's research review
board, students selected for MRI were required to obtain prorated full-scale IQs of
greater than 130 or less than 103, and were equally divided by sex and IQ classification.
The MRI Scans were performed at the same facility for all 40 subjects. The scans
consisted of 18 horizontal MR images. The computer counted all pixels with non-zero gray
scale in each of the 18 images and the total count served as an index for brain size.
Willerman, L., Schultz, R., Rutledge, J. N., and Bigler, E. (1991), "In Vivo Brain
Size and Intelligence," Intelligence, 15, 223-228.
This dataset is from the DASL archive.
JMP file (3k)
Hodgkin's disease
538 Hodgkin's patients were classified as to the type of disease they had and as to the
level of response to treatment the patient had (none, partial, positive).
There are four forms of Hodgkin's disease:
LP = lymphocyte predominance
NS = nodular sclerosis
MC = mixed cellularity
LD = lymphocyte depletion
The column "Histological Type" refers to the type of Hodgkin's disease and
"Response" refers to the level of response to treatment.
from Brant Deppa's archive
JMP file (1k)
Property taxes
How taxes change in response to changing market value of homes is a question of concern
to citizens as a policy matter as well as a personal financial concern. This dataset
permits an examination of this question for used homes in Albuquerque in 1993. The data
are a random sample of records of resales of homes from Feb 15 to Apr 30, 1993 from the
files maintained by the Albuquerque Board of Realtors. This type of data is collected by
multiple listing agencies in many cities and is used by realtors as an information base.
The dataset contains three variables: PRICE (Selling price in hundreds of dollars),
SQFT (Square feet of living space) and TAX (Annual taxes).
This dataset is from the DASL archive.
JMP file (3k)
Umbilical cord lead levels
These data list the monthly gasoline lead sales in Massachusetts and mean lead
concentrations in umbilical cord blood of babies born at a major Boston hospital over 14
months in 1980-1981.
The variables in this data file are:
 | Time - month of study (1 - 14) |
 | Mo - month number (1 = Jan, ..., 12 = Dec) |
 | Year - year 1980 or 1981 |
 | Lead - statewide lead sold in gasoline (metric tons) |
 | LIUC - mean lead in umbilical cord blood (micrograms/deciliter) |
|
References:
Rabinowitz et. al. 1985. Environmental correlates of infant blood levels in Boston.
Environmental Research, 38:96-107
Rabinowits, et. al. 1984. Lead in umbilical blood, indoor air, tap water, and
gasoline in Boston. Archives of Environmental Health. 39(4):299-301
From Brant Deppa's archive
JMP file (1k)
Seatbelt usage
The effect of seatbelt usage on severity of injury in car accidents.
From Brant Deppa's archive
JMP file (1k)
Alcohol and tobacco
Data from a British government survey of household spending may be used to examine the
relationship between household spending on tobacco products and alcoholic beverages. A
scatterplot of spending on alcohol vs. spending on tobacco in the 11 regions of Great
Britain shows an overall positive linear relationship with Northern Ireland as an outlier.
This dataset is from the DASL archive.
JMP file (1k)
Aspirin and heart attacks
Data from Final Report on the Aspirin
Component of the Ongoing Physicians' Health Study in the New England Journal of
Medicine 321:129-135, 1989. Number of subjects with fatal, non-fatal and no heart attacks
and strokes are reported for treatment and placebo groups.
JMP file (1k)
Smoking deaths by age class
Number of deaths attributed to smoking by age class. From the Cato Institute's Lies, Damned Lies and 400,000 Smoking-Related
Deaths by Robert A. Levy and Rosalind B. Marimont.
JMP file (1k)
TXT file (1k)