home.gif (1194 bytes)grades.gif (1215 bytes)assignments.gif (1284 bytes)feedback.gif (1254 bytes)discboard.gif (1264 bytes)

syllabus.gif (1124 bytes)terminology.gif (1142 bytes)lectures.gif (1112 bytes)resources.gif (1130 bytes)jmp.gif (1086 bytes)

 

title.gif (3960 bytes)

 

Data Files for Class Use

These are data files that we will use for JMP files and in addition I've included datasets I just find interesting and might use in class in the future. I've documented the source or the data as well as possible.

To use these data files first you must download them to your hard drive. After the file is on your computer you can then open it from there using JMP. How you download the file depends on your browser (Netscape, MS Internet Explorer, etc), but you probably just need to click on the link once and your browser should ask you where you want to save the file. JMP files should be saved with the extension JMP (<filename.JMP>, nba.jmp for example).

NBA player salaries

Player salaries and statistics for the 1997-1998 NBA season. Variables include player name, team name, games played, minutes played, total rebounds, total points and salary. Salary data reportedly from the Dallas Morning News and was found at http://www.nationwide.net/~patricia. (Patricia seems to have too much time on her hands!).

JMP file (42k)
TXT file (25k)


CEO golf handicaps

An investment compensation expert, Graef Crystal, carried out a study purporting to show that the major companies, whose C.E.O's had low golf scores, had high performing stocks. Crystal obtained data for golf scores from the journal Golf Digest and used his own data on the stock market performance of the companies of 51 chief executives. He created a Stock Rating which gave each company a stock rating based on how investors who held their stock did with 100 being highest and 0 lowest.

The last seven observations were excluded as outliers in Mr. Crystal's analysis.

Data is from the New York Times article "Investing it; duffers need not apply" available here.

JMP file (3k)
TXT file (2k)

Race and the death penalty

Assignment of the death penalty for 326 defendants in homicide indictments in 20 Florida Counties between 1976-1977. The race of the defendant, the race of the victim and whether or not the defendant received the death penalty are reported.

from "Racial characteristics and the imposition of the death penalty" by M. Radelet. Amer. Sociol. Rev. 46:918-927. 1981

Summarized in Agretsti's book "Categorical Data Analysis"

JMP file (1k)

Race and the death penalty II

Race of executed convicts between 1976 through January 11, 2000. From the Death Penalty Information Center.

JMP file (1k)

Mortality data

Data from the CDC on mortality rates due to vehicular death, suicide, homicide and firearm related deaths. Also if each state has the death penalty or not. Also the percent of the population that lives in urban situations.

Mortality data is from the CDC and is for 1997. Death penalty status is from the Death Penalty Information Center. Percent Urban data is from the US Census and is estimate of January 1999 populations.

JMP file (4k)

IQ and brain size

Are the size and weight of your brain indicators of your mental capacity? In this study by Willerman et al. (1991) the researchers use Magnetic Resonance Imaging (MRI) to determine the brain size of the subjects. The researchers take into account gender and body size to draw conclusions about the connection between brain size and intelligence.

Willerman et al. (1991) conducted their study at a large southwestern university. They selected a sample of 40 right-handed Anglo introductory psychology students who had indicated no history of alcoholism, unconsciousness, brain damage, epilepsy, or heart disease. These subjects were drawn from a larger pool of introductory psychology students with total Scholastic Aptitude Test Scores higher than 1350 or lower than 940 who had agreed to satisfy a course requirement by allowing the administration of four subtests (Vocabulary, Similarities, Block Design, and Picture Completion) of the Wechsler (1981) Adult Intelligence Scale-Revised. With prior approval of the University's research review board, students selected for MRI were required to obtain prorated full-scale IQs of greater than 130 or less than 103, and were equally divided by sex and IQ classification.

The MRI Scans were performed at the same facility for all 40 subjects. The scans consisted of 18 horizontal MR images. The computer counted all pixels with non-zero gray scale in each of the 18 images and the total count served as an index for brain size.

Willerman, L., Schultz, R., Rutledge, J. N., and Bigler, E. (1991), "In Vivo Brain Size and Intelligence," Intelligence, 15, 223-228.

This dataset is from the DASL archive.

JMP file (3k)

Hodgkin's disease

538 Hodgkin's patients were classified as to the type of disease they had and as to the level of response to treatment the patient had (none, partial, positive).

There are four forms of Hodgkin's disease:

LP = lymphocyte predominance
NS = nodular sclerosis
MC = mixed cellularity
LD = lymphocyte depletion

The column "Histological Type" refers to the type of Hodgkin's disease and "Response" refers to the level of response to treatment.

from Brant Deppa's archive

JMP file (1k)

Property taxes

How taxes change in response to changing market value of homes is a question of concern to citizens as a policy matter as well as a personal financial concern. This dataset permits an examination of this question for used homes in Albuquerque in 1993. The data are a random sample of records of resales of homes from Feb 15 to Apr 30, 1993 from the files maintained by the Albuquerque Board of Realtors. This type of data is collected by multiple listing agencies in many cities and is used by realtors as an information base.

The dataset contains three variables: PRICE (Selling price in hundreds of dollars), SQFT (Square feet of living space) and TAX (Annual taxes).

This dataset is from the DASL archive.

JMP file (3k)

Umbilical cord lead levels

These data list the monthly gasoline lead sales in Massachusetts and mean lead concentrations in umbilical cord blood of babies born at a major Boston hospital over 14 months in 1980-1981.

The variables in this data file are:

Time - month of study (1 - 14)
Mo - month number (1 = Jan, ..., 12 = Dec)
Year - year 1980 or 1981
Lead - statewide lead sold in gasoline (metric tons)
LIUC - mean lead in umbilical cord blood (micrograms/deciliter)

References:

Rabinowitz et. al. 1985. Environmental correlates of infant blood levels in Boston. Environmental Research, 38:96-107

Rabinowits, et. al. 1984. Lead in umbilical blood, indoor air, tap water, and gasoline in Boston. Archives of Environmental Health. 39(4):299-301

From Brant Deppa's archive

JMP file (1k)

Seatbelt usage

The effect of seatbelt usage on severity of injury in car accidents.

From Brant Deppa's archive

JMP file (1k)

Alcohol and tobacco

Data from a British government survey of household spending may be used to examine the relationship between household spending on tobacco products and alcoholic beverages. A scatterplot of spending on alcohol vs. spending on tobacco in the 11 regions of Great Britain shows an overall positive linear relationship with Northern Ireland as an outlier.

This dataset is from the DASL archive.

JMP file (1k)

Aspirin and heart attacks

Data from Final Report on the Aspirin Component of the Ongoing Physicians' Health Study in the New England Journal of Medicine 321:129-135, 1989. Number of subjects with fatal, non-fatal and no heart attacks and strokes are reported for treatment and placebo groups.

JMP file (1k)

Smoking deaths by age class

Number of deaths attributed to smoking by age class. From the Cato Institute's Lies, Damned Lies and 400,000 Smoking-Related Deaths by Robert A. Levy and Rosalind B. Marimont.

JMP file (1k)
TXT file (1k)

 

 

E-mail Mr. Callahan at stat110@edcallahan.com with questions or comments about this web site or about the class itself.

This page was last modified on January 20, 2000.