Data Archive
Welcome to OPR's Data Archive. This page provides a quick
overview of our holdings, and links to other data resources including joint
projects hosted elsewhere.
Jump to what's new, recent additions,
or read about our historic collections.
You can search the archive, and browse links
to other data resources.
What's New
The Success and Failure in Cultural Markets project (overview) was
motivated by puzzling aspects of contemporary cultural markets,
released data from a series of four web-based experiments involving a
total of 27,267 participants. Included in this release are 167 data
files, 48 music files (mp3 format), and detailed documentation. The
experiments were conducted by Prof. Matthew J. Salganik between 2004 and
2007.
The National Longitudinal Survey of Freshmen (NLSF)
has released the wave 4 (Junior in Spring 2002) and wave 5 (Senior in Spring 2003)
public use datasets. Information on participants’ graduation from college is
available in a separate graduation dataset. The two final waves contain similar
information as wave 2 (Freshman in Spring 2000) and wave 3 (Sophomore in Spring 2001),
as well as detailed information on extracurricular group involvement, health and
emotional problems, college debts, future plans for employment, career and higher
education, respondents’ perception of their own/other racial and ethnic groups in
terms of identity, incidences of discrimination and prejudice to name a few. The NLSF
follows a cohort of first-time freshman at selective colleges and universities through
their college careers. Equal numbers of whites, blacks, Hispanics, and Asians were
sampled at each of the 28 participating schools, with nearly 4,000 respondents.
The Fragile Families and
Child Wellbeing Study, being conducted by the Bendheim-Thoman
Center for Research on Child Wellbeing (CRCW), has re-released
Baseline, One-Year, Three-Year Core and Three-Year In-Home files on May 15, 2008.
These files now contain the final versions of the weights.
The Fragile Families and Child Wellbeing Study follows a cohort of nearly
5,000 children born in the U.S. between 1998 and 2000. The study over
samples births to unmarried couples; and, when weighted, the data are representative
of births in large U.S. cities at the turn of the century. The in-home study
collects information on children's cognitive and emotional development, health,
and home environment, and is conducted by the Center for Health and
Wellbeing (CHW).
The Mexican Migration Project (MMP), an ongoing
multidisciplinary study of migration from Mexico to the United States,
has released data for 118 communities, which includes the original 114
communities (MMP114) plus 4 new additional communities. The new 4 communities
are from the state of Morelos, which is a new state for the MMP.
In addition, these 4 new communities have health variables for the
household head and the spouse. MMP118 has now information on
18,804 Mexican households, 922 U.S. households, and individual-level
data on 128,940 persons. These data contain information on 6,848
household heads with migration experience to the U.S. and information
on 47 household heads with Canadian migration experience. Data are
available in three system formats: SAS, SPSS, and STATA. This project
requires registration to download data. Users of the previous Users
of the previous MMP114 data are encouraged to download the new dataset MMP118.
The Latin American Migration Project (LAMP), which extends the MMP
design to a study of migration flows originating in other Latin American countries, has now released
data for Guatemala and a 5th community in Peru. The data for Guatemala contains information on
3 communities interviewed in the Department of Quetzaltenango. The data offers information about
523 households and information on 2813 persons. As for Peru data, it now has information about
822 households and information on 4,467 persons.
The Texas Higher Education Opportunity Project
(THEOP) is a multi-year study that investigates
college planning and enrollment behavior under a policy that guarantees
admission to any Texas public college or university to high school
seniors who graduate in the top decile of their class. THEOP
recently released the senior cohort second wave public use data and codebook.
This release includes information on how to merge the first and second wave
data and the table of similar questions across the waves.
The New Immigrant Survey(NIS) is a panel survey of
a nationally representative sample of new legal immigrants to
the United States. The first full cohort (NIS2003-1) data are now available for download.
Recent Additions
The Little Village Surveys (LVS), a study of a large
Mexican community in the south side of Chicago, include business and household surveys.
The National Migration Study (NMS) in Thailand
conducted by the Institute for Population and Social Research,
Mahidol University, Thailand.
The PCP Archive-- maintained
by the Central American Population Program
in San José, Costa Rica and OPR-- is
a growing collection of fertility
and health surveys conducted in Central America and Panama.
Historic Datasets
A collection of ten American Fertility Surveys
covering a 35-year period, and including four surveys
designed and nurtured at OPR.
Datasets connected with the Princeton
European Fertility Project,
including the famous Hutterite
fertility data first analyzed by Mindel Sheps and later used to establish
standards for the analysis of the European fertility decline.
U.S. Cohort and Period Fertility Tables 1917-1980,
produced by the National Institute of Child Health and Development, National
Institutes of Health, compiled by Robert L. Heuser.
Population and death statistics tables from developing
countries amassed by the Organisation of Economic Co-operation
and Development (OECD).
The World Fertility Survey (WFS),
a collection of high-quality, internationally comparable
surveys of human fertility conducted in 41 developing countries in the
late seventies and early eighties.
OPR has many more datasets of interest to researchers.
We have a complete list
of all our studies, or,
you can use this form to search all the holdings of the OPR Data Archive. Enter your search terms in the box below.
Phrases should be enclosed in quotes.
Other Resources
The Social Environment and Biomarkers of Aging Study (SEBAS) is an unusually rich,
population-based longitudinal study focusing on the health and well-being of older
persons in Taiwan. SEBAS explores the relationship between life challenges and mental
and physical health, the impact of social environment on the health and well-being of
the elderly, and biological markers of health and stress. For more information about
SEBAS, a joint project of Georgetown University's Center for Population and Health (CPH)
and OPR, visit the project's page at
CPH.
Public use data from the project are
available at ICPSR under
study 3792.
Princeton University's Data Library is maintained by Data and Statistical
Services (DSS), part of
Firestone Library's Social Science Reference Center.
The DSS collection includes more than 24,000 machine-readable data
files, with emphasis on Census Data and the Current Population
Survey.
DSS has a Data Librarian who can assist you in finding the data you
need, and Statistical Consultants who can provide assistance in the
use of statistical packages. Check their site for consulting
hours.
InterUniversity Consortium of Political and
Social Research (ICPSR)
If you can't find the data you need at Princeton, the next step is
the ICPSR Archive
at the University of Michigan.
ICPSR provides a nice web interface for browsing their holdings
in several areas of interest to social and political scientists,
and has a special section with recent additions. Some holdings
have restricted access. Please contact the DSS Data Librarian
if you need to access some of these restricted datasets.
|