Project 90 (Partial Data)
Project 90 was a prospective study of the influence of network structure on the dynamics of HIV transmission in a community of high-risk heterosexuals. The data was collected between 1988 and 1992 in Colorado Springs, CO, and the project was funded by the Centers for Disease Control and Prevention (CDC). For more details of the Project 90 study, please see the references below.
Stephen Muth and John Potterat kindly provided the data to Sharad Goel and Matthew Salganik in 2007, and it was later used in their paper, S. Goel and M. J. Salganik (2010) "Assessing respondent-driven sampling" Proceedings of the National Academy of Sciences (PNAS). The release of these data allows others to replicate the analyses of Goel and Salganik.
Included in this release are two tab-separated files, edges.tsv and nodes.tsv, that describe the structure of the Project 90 network and the individual-level attributes of study participants.
edges.tsv: Each row indicates an edge in the network, specified by a pair of node ids. Edges represent social, sexual, and/or drug affiliation. Each edge is recorded twice. In other words, if there is an edge between 12 and 15, there is also an edge between 15 and 12. There are 43,288 edges in the file.
nodes.tsv: Each row corresponds to a study participant. In addition to node id, the following attributes are listed for each individual:
Missing values are denoted by 'NA' and there are 5,492 individuals in the file. Please direct any questions to Prof. Matthew Salganik.
readme: In addition, this text file gives one-way frequencies of all the individual attributes.
To access these datasets, please login or register as a user of the data archive.
Office of Population Research, Princeton University, Wallace Hall, Princeton NJ 08544-1005
Phone: (609) 258-4870 Fax: (609) 258-1039 Email: webmaster