August 2008
Welcome to the ICPSR Bibliography of Data-related Literature, a searchable database that contains over 45,000 citations of known published and unpublished works resulting from analyses of data held in the ICPSR archive. Developed with support from the National Science Foundation (SES-9977984), the Bibliography represents over 40 years of scholarship in the quantitative social sciences, extending from the inception of ICPSR in 1962 to the present.
In most cases, analyses of numeric data formed the foundation for the literature, although in some cases the data played a less prominent role in supporting the research. The literature includes journal articles, books, book chapters, government and agency reports, working papers, dissertations, conference papers, meeting presentations, unpublished manuscripts, magazine and newspaper articles, and audiovisual materials.
This is an ongoing project that is updated regularly. Additions and corrections are welcome. If you have published work that is based on ICPSR data, please send us the citation using our online Bibliography Citation Submission form.
Please send comments or questions via our Bibliography Comments form.
The Bibliography of Data-related Literature is actually a collection of smaller bibliographies based on individual studies archived at ICPSR. It can be accessed through two avenues on the ICPSR Web site:
To find literature based on a particular study, search for that study using the search engine on our Data page. On the data search results page (below the study title) and at the top of each study description is a link called "Related Literature". This links to a list of publications based on that study and replaces the former ICPSR field called "Related publications".
The Bibliography is also a searchable database. From the Bibliography Home page, a number of options are available: Search for authors, titles, source titles (e.g. journal, book, or conference titles), or publishers (includes sponsoring agencies and organizations), Browse by Author, or Browse by Journal Title. Advanced Search allows more complex searching of the citations, using Boolean search operators (and/or/not) for field specific searches and limiting searches to particular types of publications, such as Conference Proceedings only. On the search results pages, under each citation, is a link to related data in the ICPSR archive. These are the data collections used by the authors to generate their findings. When a publication is freely available online, the citation includes a link to that document.
The purpose of this project is to facilitate the use of the ICPSR holdings by providing a means to investigate previous research based on ICPSR data. Instructors often direct students to begin data-related research projects by reading some of the major works based on the data. Advanced researchers also find reading the existing literature to be an effective way to begin using a dataset. Reporters and policymakers looking for processed statistics look for reports explaining particular studies. Principal investigators and funding agencies often want to track how data are used after they are deposited.
The Bibliography makes it possible to:
Our goal is to collect accurate citations of publications making reference to data archived and distributed by ICPSR. The outcomes of the project, when it is complete, will include:
While items 1 and 2 have already been accomplished, we continue to add to the citations in the database. The vehicles for collecting, organizing, and displaying citations are functioning, and we are in the process of developing more effective database mechanisms for managing the Bibliography.
When this project began in July 2000, we identified its goals and developed a plan to systematically find and gather citations. This task involved developing a collection policy, selecting databases and print publications to search and/or browse, selecting a method for collecting and organizing citations, determining search strategies, and developing mechanisms for providing online access to the citations.
In developing the collection policy, it was helpful to think about what the uses of the collection would be. Primarily, this bibliographic collection is intended to allow researchers to trace uses of data for a variety of purposes. It is also meant to enable researchers to replicate others' findings and to demonstrate to funding agencies the varied uses of data. With these considerations in mind, we decided to capture citations to publications that met at least one of the following criteria:
Since some datasets are available from multiple archives or data providers and we could not always verify that the source of the data used was ICPSR, we made the determination that ICPSR need not be the actual source.
References to all types of literature and media are being collected. Thus far, the collection includes journal articles, books, book chapters, government and agency reports, working papers, dissertations, conference papers, meeting presentations, unpublished manuscripts, audiovisual materials, and electronic resources.
Our primary resource for searching has been the University of Michigan (UM) Libraries Networked Electronic Resources. This online collection is available to UM affiliates and includes a comprehensive selection of full-text databases and index and abstract services in the social sciences and other disciplines. We have made use of the UM Libraries' excellent collection of print publications and its inter-library loan service to browse materials not available online. We have also conducted Internet searches for various studies and publications.
In order to collect and organize citations, we decided to purchase database software that was specifically designed for collecting bibliographic citations. Our primary selection criteria included:
After comparing several options, we selected Biblioscape, which was the only software that allowed multiple simultaneous users and met the other requisites.
ICPSR was established in 1962 and presently archives over 5,000 studies in the social and behavioral sciences, covering a broad array of topics including: elections, education, mental health, criminal justice, aging, gender, race, religion, medical care, war, family, and geography. Given the 40-year time period and the large number of datasets and topics involved, the task of locating related publications was monumental, and we realized at the outset that we could never locate every single publication based on ICPSR data. Yet we wanted to cast a wide enough net to capture as many relevant citations as possible during the allotted three-year grant period. This project involved developing effective search strategies to strike a balance between comprehensiveness and precision. It also meant setting priorities in selecting search resources and formulating search queries.
We decided to use a two-pronged approach, whereby we would search online resources for specific study titles and also identify a list of core journals to browse, either in print or electronic format, for references to any ICPSR dataset. Using the resources available through UM Libraries, we began searching online databases and putting together a list of core journals in subject areas relevant to ICPSR data. Below is a summary of the searches that have been conducted thus far and lists of the electronic and print resources searched and browsed.
Initial searches:
In order to familiarize ourselves with the content and search capabilities of the available
online citation, abstract, and full-text resources, we began searching for the following
terms/phrases:
The following databases were searched (the number of hits is in parentheses):
Most of these searches yielded relevant hits, and we thus began the process of building the collection. Using the results of these searches, as well as recommendations from ICPSR staff, we began to identify the core journals that publish articles based on ICPSR data.
Study Titles:
We searched for the titles, partial titles, and variant titles of studies in the ICPSR archive.
The databases searched are described below. We are using spreadsheets to record the details of
each title search, including study titles, specific databases searched, dates searched, search queries, and
number of hits. To date, we have searched for thousands of titles.
P.I. names:
Authors do not always include the titles of datasets used in published works, or they may use a
variation of the ICPSR title. We made every attempt to formulate queries that would account for
variant versions of study titles. However, we were not always successful due to limitations of
database search mechanisms, spelling errors, and the mere absence of a reference to the name of
the dataset used in a published work. Therefore, in cases where no hits were generated when
searching for variations of a study title, the P.I. names were searched for in the relevant
databases. This occasionally yielded positive results.
The following online databases are being searched for a variety of study titles. Databases were selected based on the relevancy of their contents to the subject and time period of the study being searched. Therefore, every database is not searched for each study. Databases are listed by order of the number of citations found.
We identified a list of core journals in the fields of political and social science, health care, and criminal justice, based on their known relevancy. We also analyzed the references we found through our online searches in order to identify those that contained the most articles based on ICPSR data. We began browsing through articles in the print and online versions of these core journals, looking for references to ICPSR data. From 1974-1984, ICPSR staff had browsed through a number of core journals in the political and social sciences to capture citations to ICPSR data-related articles. For our search, we are omitting those journals in that time period since those citations have already been included in the Bibliography. We are making extensive use of full-text databases for browsing electronic versions of these journals. In particular, we've used JSTOR and Proquest, also ScienceDirect and Periodicals Contents Index (PCI) Full Text. Otherwise, we are browsing the print versions of these journals.
Citations were gathered from other sources as well. When a dataset is deposited with ICPSR, the Principal Investigator is asked to submit citations of publications based on the data. Since 1962, several thousand citations have been submitted in this way. Similarly, researchers requesting ICPSR data are asked to submit copies of work they publish based on that data. Government agencies usually submit a list of reports and other publications related to the data they deposit. These references were previously included in the "related publications" field of study descriptions and have been added to the Bibliography database.
In addition, staff of many of the large ongoing data collections in the social sciences maintain their own bibliographies of works related to their data. We conducted Web searches for major study titles using Google and found a number of these bibliographies, most of which have been included in the Bibliography. The Panel Study of Income Dynamics, Asset and Health Dynamics Among the Oldest-Old (AHEAD), and the Health and Retirement Study (HRS) contributed their citations in a tagged format for inclusion in the ICPSR Bibliography. Smaller online bibliographies were added manually, such as the World Values Survey bibliography. We have provided links to larger bibliographies that are not available in tagged or delimited format and are too large to manually input, such as those for the General Social Survey and the National Longitudinal Surveys.
We also stumbled upon many citations serendipitously. While browsing articles for references to ICPSR data, we often came upon references to other publications that used ICPSR data. In the process of conducting searches or browsing journals, we occasionally discovered bibliographies in books, journals, codebooks, and other publications. Some bibliographies were brought to our attention by researchers and librarians. All of these citations have been included in the database.
We originally collected citations in Biblioscape, a bibliographic database application. In order to link them to the ICPSR Web site, the references were exported from Biblioscape in a delimited format, then reorganized using perl scripts and imported into Oracle tables, from which they are displayed on the ICPSR site. We now enter and manage all the citations in Oracle, using a web-based data entry, management, and reporting tool developed at ICPSR.
We created search interfaces and displays to enable users to locate citations by linking through dataset descriptions or by conducting searches for specific publications. Enhanced search, display, and navigation tools are currently being developed.
Direct links to full-text articles and local library resources are now available to ICPSR Direct members using the Bibliography. Users at institutions that have signed up for this service see an additional link underneath each journal article citation ("Check full-text availability"). This link takes the user to the full-text of the article if it is available at their local institution, and to other local library services in case the article is not available digitally. These links are made possible using OpenURL technology. To sign up for OpenURL linking, the Official Representative at an ICPSR Direct member institution must send the required information to ICPSR using the online Full-Text Linking Information form.
Exporting individual citations is now possible. Underneath each ICPSR citation, there is an "Export citation" link. Clicking the link lets a user export directly into reference management software like EndNote or online tools like Zotero (a free extension to the Internet browser, Firefox).
The Bibliography's citations are exported in RIS file format, a tagged format for expressing bibliographic citations. It is compatible with most reference management software. For advice about exporting ICPSR's RIS-formatted citations, users can click the "information" button next to the citations export link. Users should check their reference software for specific instructions on importing citations in RIS format.
We have only begun to understand the uses of this collection and anticipate that researchers will discover further applications as it continues to grow. We look to developing standards for citing data as a means to facilitate the process of identifying works based on social science data. Our hope is that the existence of the Bibliography itself will inspire researchers to notify us when they use data acquired from ICPSR. These factors, combined with ongoing searches of online and print resources, will continue to expand this valuable social science research tool.
Creating this bibliography and making it accessible is a collaborative task involving many people who have contributed in a variety of ways. The following people in particular have assisted in this task:
Careers | Contact Us | Privacy Policy | Official Representative Site | Accessibility
© 2007 Regents of the University of Michigan. ICPSR is part of the Institute for Social Research at the University of Michigan.
