Using ICPSR's New Deposit Form

Beginning in October 2025, ICPSR is introducing a new and improved deposit form — called “single stream” — for sharing data with ICPSR. Use this page as a field-by-field guide for describing your data.

Create Your Citation

Use these fields to describe how users should cite your study.

Title

The official title that describes what the data collection is about, its geographic scope, and the time period it covered. For more information, see Title in the ICPSR Metadata Documentation Portal.

Use a clear, specific title that includes:

What the study is about (topic, population, or method). Capitalize major words and avoid vague titles like “Survey Data” or “Research Project.”
Where the study covers (city, state, country, or Global if truly worldwide).
What time period the study covers (e.g., 1999; 2001-2003; 1999, 2010, 2015; include months only to distinguish repeated releases). Match the time period to what the data describe (not necessarily when they were collected).

Bridge of Faith: Aim4Peace Community-Based Violence Prevention Project, Kansas City, Missouri, 2014-2017
Health and Relationships Project, United States, 2014-2015
Targeted Interventions to Prevent Chronic Low Back Pain in High Risk Patients: A Multi-Site Pragmatic Randomized Controlled Trial (TARGET Trial), 4 U.S. cities, 2016-2019
Aid Like A Paycheck (ALAP), Texas and California, 2014-2017
COVID-19 Disruptions Disproportionately Affect Female Academics, Global, 2020

Alternate Titles

The alternate name(s) or acronym(s) commonly used to refer to the data collection. For more information, see Alternate Titles in the ICPSR Metadata Documentation Portal.

Enter an alternate title for the data collection if users might search for it using an acronym or shortened version of the name.

Add Health Parent Study
FACES 2009
Survey of Consumers
Eurobarometer 85.2

Principal Investigators

The key people or organizations responsible for the data collection, listed by importance. Each data collection requires at least one PI, either a person or an organization. For more information, see Principal Investigators in the ICPSR Metadata Documentation Portal.

When the PI is a person:

Use “Search for a Person” to enter the name, affiliation, and ORCID ID that appear in the person’s ORCID profile.
If you can’t find an ORCID ID for a PI, enter their full name the way it appears in publications or their curriculum vitae. Enter middle names and initials in the Given Name field and suffixes (e.g. Jr., III) in the Family Name field.
Enter the PI’s affiliation as it appears in the Research Organization Registry (ROR). If the organization doesn’t have a ROR ID, use its full name, avoid acronyms, and do not include departments or colleges.
Enter a PI’s affiliation at the time the research was conducted.

When the PI is an organization:

Enter the organization as it appears in the Research Organization Registry (ROR). If the organization doesn’t have a ROR ID, use its full name and avoid acronyms.

ICPSR encourages using ORCID and ROR whenever possible and may revise Principal Investigator fields to add these values.

People:

Given Name (First Name): Robert J.
Family Name (Last Name): Shiller
ORCID: 0000-0002-0399-662X
Affiliation: Yale University; MacroMarkets
Given Name (First Name): Claudia
Family Name (Last Name): Goldin
ORCID: 0000-0003-3842-1604
Affiliation: Harvard University
Given Name (First Name): Miner P.
Family Name (Last Name): Marchbanks III

Organizations:

Funding Sources

The sources of funding that supported the data collection. For more information, see Funding Sources in the ICPSR Metadata Documentation Portal.

Whenever possible, enter the organization’s name as it appears in the Research Organization Registry (ROR).
Enter the name without any organization hierarchy, for example, “National Institute on Aging” instead of “United States Department of Health and Human Services. National Institutes of Health. National Institute on Aging”.
If the organization doesn’t have a ROR, use its full name and avoid acronyms.
Whenever possible, provide a grant number for the funding award. If one exists, you can also provide a URL, preferably a persistent one like a digital object identifier (DOI).

Organization: Robert Wood Johnson Foundation
Funding Identifier: MDR-8550085
Funding Identifier: MDR-8550204
Organization: Bureau of Justice Statistics
Funding Identifier: SES-1835721
Organization: United States Department of Education

Make Your Study Findable

Use these fields to describe your data collection in ways that will help users discover it.

Summary

A description of the data collection that helps users understand its purpose, substance, and key topics. For more information, see Summary in the ICPSR Metadata Documentation Portal.

Clearly describe what the study is about – even if this information is already in your study documentation.
Write in the third person.
Use past tense to describe how the data were collected. Use present tense to describe the data themselves.
Avoid discussing potential uses, target audiences, or the value of the data.

For a few good examples, see these studies:

Chicago Public Schools “Connect and Redirect to Respect” (CRR) Program, Illinois, 2015-2018 (ICPSR 37180)
Health and Relationships Project, United States, 2014-2015 (ICPSR 37404)

ICPSR Subject Terms

A controlled list of social science terms maintained by ICPSR and used to indicate topics related to the data collection. For more information, see ICPSR Subject Terms in the ICPSR Metadata Documentation Portal.

Choose subject terms from the ICPSR Subject Thesaurus that describe the topic of your data.
Considerations for choosing subject terms:
- What is the study about?
- What would an ICPSR user interested in these data search for?
- What subject terms are used for similar studies in ICPSR’s catalog?
You might need to combine terms to describe your topic, for example, “mothers” and “health” for “maternal health.”
You can also add subject terms in two other fields: JEL Classification Codes (for economics topics) and MeSH Terms (for health and biomedical topics).
Do not add terms for concepts that are described in other fields, like geography and methodology.
If existing terms do not adequately describe the study’s topic, email suggested new terms to [email protected] for possible addition to the ICPSR thesaurus. Most new terms come from the Library of Congress Subject Terms or other social science controlled vocabularies.

To explore all possible values and how they relate to one another, see the ICPSR Subject Thesaurus.

Journal of Economic Literature (JEL) Classification Codes

Classification codes used to categorize economic research. For more information, see JEL Classification Codes in the ICPSR Metadata Documentation Portal.

Add JEL classification codes if you are depositing your data with the Western Economic Alliance International (WEAI) Data and Code Repository or the American Economic Association (AEA) Data and Code Repository.
You may also add JEL classification codes to data related to economic research deposited in any archive.

For a list of all possible codes and how they relate to one another, see the JEL Classification Codes from the American Economic Association.

A12: Relation of Economics to Other Disciplines
B00: History of Economic Thought, Methodology, and Heterodox Approaches
N22: Economic History: Financial Markets and Institutions: U.S.; Canada: 1913-

Medical Subject Headings (MeSH) Terms

Biomedical and health-related terms from the National Library of Medicine that describe the data collection’s topics. For more information, see Medical Subject Headings (MeSH) Terms in the ICPSR Metadata Documentation Portal.

Add MeSH terms if your data are related to health or health care, including clinical care, mental and behavioral health, and public health.
Health conditions, medicines, health care occupations, or disciplines related to your data are good candidates for MeSH terms.

To search all possible values, and to see their definitions and relationships to one another, see Medical Subject Headings.

Time Periods

The time period(s) to which the data refer, regardless of when the data were collected. For more information, see Time Periods in the ICPSR Metadata Documentation Portal.

Specify the time periods covered by the data, rather than the dates of data collection.
Use the Start Date and End Date fields to specify the years and, optionally, months and days, covered by the data.
If the study title contains years, they should match the years in the Time Periods dialog box.
Use the Time Frame field only if needed to add context or when the date cannot be expressed exclusively in numbers.

2008 – 2016
2023-07 – 2024-06
Time Frame: Fiscal Year 2024
2020-03-13 – 2023-05-11
1990
Time Frame: Wave 1

Nationally Representative Sample

Indicates whether the data collection uses a sampling design intended to represent the demographics, behaviors, and/or characteristics of the entire nation. This typically involves probability-based methods that allow generalization. It does not include convenience samples that appear similar to the nation by chance. For more information, see Nationally Representative Sample in the ICPSR Metadata Documentation Portal.

Select a value only for studies that use a probabilistic study sample designed to represent an entire nation.
Describe the sampling strategy in more detail in the Sampling Note and Sampling Procedures fields.
If your data do not use a sample or you are unsure, leave this blank.

Possible values for this field are yes, no, and [blank].

Examples of studies with nationally representative samples:

CRELES: Costa Rican Longevity and Healthy Aging Study – Wave 1, 2005 (Costa Rica Estudio de Longevidad y Envejecimiento Saludable) (ICPSR 26681)
How Couples Meet and Stay Together (HCMST) 2017, 2020, 2022, United States (ICPSR 38873)
Well-Being and Basic Needs Survey, United States, 2021 (ICPSR 38759)

Geographic Coverage Areas

The geographic locations where the data refer or are related. For more information, see Geographic Coverage Areas in the ICPSR Metadata Documentation Portal.

Select the country, state, city, county, region, or continent covered by the study.
Spell out place names completely instead of using acronyms. For example, enter “United States” instead of “USA.”
Type at least four characters to see matches.
Choose only the narrowest level of geographic coverage. For example, if you select “Los Angeles, California, United States,” do not also add “California, United States” and “United States.”
For studies with participants from around the world or that are applicable everywhere, select “Earth.”

Possible values come from the GeoNames database, primarily administrative boundaries and populated places (feature classes A and P). To explore all possible values, see GeoNames.

Smallest Geographic Unit

The smallest geographic unit (e.g., state or census tract) used in the dataset. For more information, see Smallest Geographic Unit in the ICPSR Metadata Documentation Portal.

Choose a value only if the data contain a geographic variable that can be used for analysis, for example, the state or county where participants live.
Do not select a value if the data are all from the same place and not further divided up by geography. For example, if the study took place in New York City, but there are no other geographical variables in the data, leave this field blank. Do not select “City.”
You can enter geographic units that do not appear in the drop-down list. Do this only when the data contain a specific geographic variable that does not appear in the list, such as local court districts or school catchment areas.

For a list of all possible values and their definitions, see the Smallest Geographic Unit controlled vocabulary.

State
Census tract
Precinct

Study Context

Use these fields to help others understand your data collection and how it was conducted.

Study Design

The procedures used to contact participants and gather data. For more information, see Study Design in the ICPSR Metadata Documentation Portal.

Use this field, if needed, to provide more information about the study design than the summary.
Possible topics include survey design, interview methods, how data were obtained, and follow-ups to respondents.

Data on organizational culture in each of the 12 courts (Part 1) were obtained by administering the Court Culture Assessment Instrument (CCAI) to all judges with a felony criminal court docket and to all senior court administrators. A total of 224 respondents completed the questionnaire. The CCAI was used to assess five key dimensions of current court culture orientation: (1) dominant case management style, (2) judicial and court staff relations, (3) change management, (4) courthouse leadership, and (5) internal organization. The determination of what culture judges and court administrators desired to establish in the near future was also obtained through the application of the same instrument (CACI) as practitioners were asked to indicate the type of culture in each work area (or content dimension) they would like to see in their court in the next five years.

Universe

The total group of persons or other entities (e.g., households or organizations) that were the object of research and to which analytic results refer. For more information, see Universe in the ICPSR Metadata Documentation Portal.

Describe the study’s target population in a short phrase or sentence.
Identify the characteristics that determine inclusion in the study. For studies about people, this might be age, nationality, residence, or demographic characteristics.
The universe can also consist of things other than people, such as households, organizations, or geographic units.

All households in the United States with phones.
Part 1: Thirty cities in Massachusetts during 1980-1986. Parts 2-4: All residents in Massachusetts during 1986.
Individuals self-identified as transgender, trans, genderqueer, non-binary, or other identities on the transgender identity spectrum aged 18 and older residing in the fifty U.S. states, the District of Columbia, American Samoa, Guam, Puerto Rico, and U.S. military bases overseas.
All publicly funded medical examiner and coroner offices.
Uncertified ballots for the 2000 United States presidential election in Florida.

Time Methods

The methods used to collect data over time, like snapshots at one point (cross-sectional) or repeatedly (longitudinal) to study changes or trends. For more information, see Time Methods in the ICPSR Metadata Documentation Portal.

Use this field to indicate if and how the data collection repeated over time.
You can enter time methods that do not appear in the drop-down list. Do this only when the data contain a specific time method that does not appear in the list.

For a list of all possible values and their definitions, see the DDI Alliance Controlled Vocabulary for Time Method.

Cross-section: Select this value for data collections that took place at a single point in time. (For repeated cross-sectional data collections, use “Longitudinal: Trend/Repeated cross-section” instead.)
Longitudinal: Panel: Select this value for repeated data collections that use the same sample, for example, surveys of the same group of respondents over time.
Time series: Select this value for repeated data collections, usually of measurable phenomena such as crime rates or gross domestic product.

Methods: Sample Design

Data collections are often based on a sample, or subset of the population being studied. Use these fields to describe the sample for your data collection.

Units of Analysis

The object(s) of analysis for the data collection, such as an organization, individual, or household. For more information, see Units of Analysis in the ICPSR Metadata Documentation Portal.

In quantitative data, units of analysis often correspond to rows or observations in a dataset.
You can enter units of analysis that do not appear in the drop-down list. Do this only when no applicable option is available. For example, if the unit of analysis in your data collection is college students, choose “Individual”; do not enter “college students.”
You can choose more than one unit of analysis. For example, if data about individuals contain multiple observations over time, choose both “Individual” and “Time Unit.”

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Units of Analysis.

Individual: Select this value for data collected about the characteristics of individual people, such as their opinions or health state.
Event/Process/Activity: Select this value for data focused on events or activities that people or organizations engage in, such as elections or infections disease cases.
Household: Select this value for data collected about all individuals occupying the same living space, whether or not they are related.

Sampling Procedures

The type(s) of sample and sample design used to select survey respondents to represent the population. For more information, see Sampling Procedures in the ICPSR Metadata Documentation Portal.

Use this field to describe how the units of analysis in your data were selected from the universe or population of interest.
You can describe the sampling strategy in more detail in the optional Sampling Note field.
You can enter sampling procedures that do not appear in the drop-down list. Do this only when the data collection uses a specific, known sampling strategy that does not appear in the list.
If the sampling procedure resulted in a nationally representative sample, indicate this in the Nationally Representative Sample field.

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Sampling Procedures.

Total universe/Complete enumeration: Select this value when the sample contains all units within the target population, for example, census data.
Probability: Systematic random: Select this value when the sample contains every unit at a fixed interval, for example, every tenth record in a database.
Non-probability: Availability: Select this value for samples based on opportunity or convenience, for example, individuals who volunteered to take a survey.

Sampling Note

Supplemental information about the sampling process that does not fit neatly into the Sampling Procedures field. For more information, see Sampling Note in the ICPSR Metadata Documentation Portal.

Briefly describe the methodology used to select participants from the universe or population of interest.
In addition to completing the Sampling Note, use the Sampling Procedures field to describe the sampling strategy using standardized language.
A detailed discussion of such things as sampling error or other limitations of the sampling methodology is not required here.

National sample of telephone numbers from cell (RDD) sampling frame.
The probability sample selected to represent the universe consists of approximately 71,000 households.

Weight

The weight variables and the criteria for using them in data analysis, or other information about how the data are weighted if no weight variables are present. For more information, see Weight in the ICPSR Metadata Documentation Portal.

Give the name of any weighting variables in the data and explain when they should be used.
If weights are described in documentation provided with the data, indicate where that information can be found.

Both the TransPop and Cisgender datasets have the same variable named WEIGHT as the weighting variable. The combination datasets have a set of three weight variables (WEIGHT_TRANSPOP, WEIGHT_CISGENDER, WEIGHT_CISGENDER_TRANSPOP).
A weight variable with two implied decimal places has been included and must be used in any analysis.

Response Rates

The percentage of respondents in the sample who participated in the data collection. For more information, see Response Rates in the ICPSR Metadata Documentation Portal.

Fill out this field only if the data were collected with a survey instrument and the response rates are provided.
For longitudinal data, describe the response rate across waves of data collection.

The overall response rate for this survey was 20.22 percent; 72.6 percent for existing panelists and 10.4 percent for new panelists, using AAPOR Response Rate 1.
Of the 1,843 Midlife in the United States (MIDUS) respondents that researchers attempted to contact, 1,483 agreed to participate (8 percent refused participation and 11 percent either moved or were difficult to contact), yielding a response rate of approximately 81 percent.

Methods: Collection Design

Use these fields to describe how the data were collected, its sources, and key variables and scales.

Data Source Types

The source(s) of the data as collected by the Principal Investigators. For more information, see Data Source Types in the ICPSR Metadata Documentation Portal.

Use this field to describe what the data are derived from, not their current form.
People, things, and other data can all be data source types.
You can enter data source types that do not appear in the drop-down list. Do this only when no applicable option is available; select from the list whenever possible. For example, if the data are derived from birth certificates, choose “Registers/Records/Accounts: Administrative”; do not enter “birth certificates.”
You can provide more information about the data sources in the External Data Sources field.

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Data Source Types.

Communications: Public: Select this value for data collected from public communications, such as social media posts, press releases, or websites.
Events/Interactions: Select this value for data collected from one-time occurrences, for example, court cases or elections.
Registers/Records/Accounts: Medical/Clinical: Select this value for data collections based on health-related data collected in a health care setting, for example, blood pressure readings or diagnoses of health conditions.

External Data Sources

The source of the data, when that source is external to the data collection and can be independently cited. For more information, see External Data Sources in the ICPSR Metadata Documentation Portal.

Cite any websites, datasets, books, journal articles, or other sources used to compile the data.
At minimum, include the title, author, publication year, and journal (if applicable) of each external data source. Optionally, describe how each data source was used to create or compile the data.
For online sources, include the URL to access the data. A persistent identifier such as a digital object identifier (DOI) is preferable whenever possible.
Any citation format is accepted.

‘Voting Scores.’ Congressional Quarterly Almanac 33 (1977), 487-498
Multi-Resolution Land Characteristics Consortium. “National Land Cover Database (CONUS), All Years,” 2016. https://www.mrlc.gov/data/nlcd-land-cover-conus-all-years
Bureau of Transportation Statistics. “NTM Stops,” April 2018. https://geodata.bts.gov/datasets/usdot::national-transit-map-stops/about.

Collection Modes

The method(s) or procedure(s) used to collect the data. For more information, see Collection Modes in the ICPSR Metadata Documentation Portal.

Use this field to indicate how data were collected using standardized terminology.
Select the most specific value that applies to the data. For example, if data were collected via computer-assisted telephone interview, select “Telephone interview: Computer-assisted (CATI)” rather than “Telephone interview” or “Interview.”
You can enter collection modes that do not appear in the drop-down list. Do this only when no applicable option is available.
For studies that collected data in more than one way, select all collection modes that apply.

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Mode of Collection.

Face-to-face interview: Computer-assisted (CAPI/CAMI): Select this value for in-person surveys in which an interviewer asked questions and entered responses into a computer or mobile device.
Measurements and tests: Physical: Select this value for data collections derived from physical measurements or medical tests, for example, blood pressure readings.
Computer-based observation: Select this value for data collected on users’ interaction with technology or software, such as counts of clicks and downloads on a website.

Collection Dates

The date(s) data collection took place. For more information, see Collection Dates in the ICPSR Metadata Documentation Portal.

Collection Dates refer to when data collection occurred, while Time Periods describe when the data refer for. For example, for data about population growth between 1950 and 2000 collected in 2020, the collection date would be 2020.
Enter dates in YYYY-MM-DD, YYYY-MM, or YYYY format.
Use the Collection Time Frame for non-numeric dates (e.g., Fall 2018) or to provide context on the collection date (e.g. “Wave 3” in a longitudinal data collection). Do not restate the date in words.

2018 – 2018
Time Frame: Wave 1
2020-10 – 2020-10
Time Frame: Wave 2
2003-01-01 – 2003-12-31

Variable Description

Significant variables (particularly demographic variables) in the data files. For more information, see Variable Description in the ICPSR Metadata Documentation Portal.

Use this field to provide more detailed information than the Summary, including a review of variables that are important for users to know about.
List the key topics or groups of variables covered in the data collection.

The data includes variables about participants’ and their parents’ moods, interviewer observations, families’ activities, families’ health history, participants’ school records, and parents’ substance use. Demographic variables include race, religion, annual household income, and the participants’ parents’ employment statuses.
The LGBTQ Hate Crimes Interviews dataset contains more in-depth information, including victim demographic information, substance abuse history, information on whether the victim is open about their LGBTQ identification, the victim’s job status, and information about how the victim reacted to the crime, such as whether or not they reported the crime to the police and their level of cooperation with the police and prosecution.

Scales

Any commonly known scales, measures, or inventories used in the data collection. For more information, see Scales in the ICPSR Metadata Documentation Portal.

Scales are shared measurement tools that combine multiple variables or data elements into a single score. Common examples include the Minnesota Multiphasic Personality Inventory (MMPI), the Consumer Price Index (CPI), and the Beck Depression Inventory.
Provide a citation for each scale whenever possible. At minimum, include the title, author, publication year, and journal (if applicable). Include a DOI or URL if one is available.
Cite scales as a list or describe them in full sentences.
If the data contain Likert scales, it is allowed but not necessary to note them here.

A Scales field with full sentences:

The baseline data collection included one scale – the CES-D index for maternal depression [Cole, J. C., Rabin, A. S., Smith, T. L., and Kaufman, A. S. (2004). Development and validation of a Rasch-derived CES-D short form. Psychological assessment, 16(4), 360. https://doi.org/10.1037/1040-3590.16.4.360]. All scales used for outcomes at ages 1 through 3 are listed in Appendix Tables 1 and 2 in the User Guide. Please refer to the User Guide and P.I. Codebook, available under the ‘Data and Documentation’ tab, for details.

A Scales field containing a list of citations:

Squires, J., Bricker, D. D., and Twombly, E. (2009). Ages and stages questionnaires. Baltimore, MD: Paul H. Brookes.
Briggs-Gowan, M. J., Carter, A. S., Irwin, J. R., Wachtel, K., and Cicchetti, D. V. (2004). The Brief Infant-Toddler Social and Emotional Assessment: screening for social-emotional problems and delays in competence. Journal of pediatric psychology, 29(2), 143-155. https://doi.org/10.1093/jpepsy/jsh017
Yu, L., Buysse, D. J., Germain, A., Moul, D. E., Stover, A., Dodds, N. E., … and Pilkonis, P. A. (2012). Development of short forms from the PROMIS sleep disturbance and sleep-related impairment item banks. Behavioral sleep medicine, 10(1), 6-24. https://doi.org/10.1080/15402002.2012.636266

Research: Transparency & Impact

Use these fields to describe publications, software, and other resources related to this data collection.

Data-related Publications

Citations of publications that use or analyze the data.

Cite any published articles, reports, preprints, presentations, or other works that analyze, critique, or discuss the data and its collection process or methodology.
Exclude publications that mention, but do not analyze, the data collection.
For online resources, include the URL to access the publication. A persistent identifier such as a digital object identifier (DOI) is preferable whenever possible.
If a DOI is available, use the Import via DOI option to enter publication details. If the publication does not have a DOI, select “Manually Enter Citation” to provide the title, author, publication date, and other information manually.

Kirksey, J. Jacob. “Academic Harms of Missing High School and the Accuracy of Current Policy Thresholds: Analysis of Preregistered Administrative Data From a California School District.” AERA Open 5, no. 3 (July 2019): 233285841986769. https://doi.org/10.1177/2332858419867692.
Palmer, Ashley N., Mansi Patel, Sarah C. Narendorf, Shellye Sledge, and Katherine Sanchez. “Changes in Flourishing from Adolescence to Young Adulthood: An 8-year Follow-up.” Child & Family Social Work 28, no. 1 (February 2023): 194–209. https://doi.org/10.1111/cfs.12953.

Data Management Plan

A link to the data management plan (preferably a persistent identifier such as a DOI). For more information, see Data Management Plan in the ICPSR Metadata Documentation Portal.

Some funding agencies require grant recipients to follow a data management plan for sharing data. Sites like DMPTool and DMPonline support creating and publishing data management plans. Use this field to provide a link to a published data management plan.
Only URLs can be entered in this field. To provide a copy of a data management plan that is not available online, upload it along with the data and documentation.

Preregistration

A link to a research plan for the data collection (preferably a persistent identifier such as a DOI). For more information, see Preregistration in the ICPSR Metadata Documentation Portal.

Preregistration lets researchers publicly share hypotheses and data analysis plans before data collection begins. Sites like the Open Science Framework and the AEA RCT Registry support preregistering studies and sharing plans publicly.
Only URLs can be entered in this field. To provide a copy of a preregistration that is not available online, upload it along with the data and documentation.

Software Applications

Software used by the principal investigator(s) to collect or analyze data, required to understand how the data were obtained or to reproduce results. For more information, see Software Applications in the ICPSR Metadata Documentation Portal.

Use the fields in this dialog box for data collections that include code or depend on an uncommon or custom application.
If the data can be analyzed with commonly available statistical software, this field is usually not needed.
Provide as much information about the software application as is needed to access and run it. Depending on the application, this may include a version, download link, operating system information, and shared libraries.

This is an example of the Software Applications dialog box with several fields filled in.

Software Name: siegfried
Software Description: Siegfried is a signature-based file format identification tool, implementing the National Archives UK’s PRONOM file format signatures; freedesktop.org’s MIME-info file format signatures; the Library of Congress’s FDD file format signatures (beta); and Wikidata (beta).
Programming Languages: go; javascript; other
Operating Systems: Mac, Linux, Windows
License: https://www.apache.org/licenses/LICENSE-2.0
Download URL: https://github.com/richardlehane/siegfried/archive/refs/heads/main.zip

Additional Information

Use these fields to provide any other information that users should know about the data collection.

General Data Formats

The file format types present in the data collection. For more information, see General Data Formats in the ICPSR Metadata Documentation Portal.

Use this field to indicate broadly whether the data contain numeric information, text, images, software code, or some other format.
Select more than one value for data collections with multiple formats, for example, a mixed methods study that includes survey responses (Numeric) and interview transcripts (Text).

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for General Data Formats.

Numeric: Select this value for most quantitative data, such as survey responses and administrative statistics.
Text: Select this value for text-based qualitative data such as interview transcripts or social media posts.
Still image: Select this value for static images such as photographs, data visualizations, or X-rays.

Notes

Important details about the data collection (like unique authoring, discrepancies, or processing information) that can’t be recorded in other metadata elements. For more information, see Notes in the ICPSR Metadata Documentation Portal.

Use this field to provide any additional information that users need to know about the data collection.
Linking to a project website in a note is acceptable. However, links break and websites change over time. To ensure that users will be able to understand the data collection long into the future, include key details in the appropriate fields or in the documentation.

Information on the Index of Consumer Sentiment, the Index of Current Economic Conditions, and the Index of Consumer Expectations and how they were created can be found in the P.I. Codebook.
Dataset 1 should be attributed to Jane Doe while datasets 2-6 should be attributed to John Doe.
Additional information on the Survey of Consumers can be found by visiting the Survey of Consumers Website.

Deposit Settings

ICPSR Archive Collection

The ICPSR archive (also known as a thematic collection) to which the data collection should belong.

Most archives have collection criteria that determine what type of data collections can be deposited. To learn more about archive collections and their collection criteria, see Does ICPSR Accept My Data?.
The archive you select determines whether the data collection can be curated or self-published.
If you are not sure, select “Inter-university Consortium for Political and Social Research.” Our team at ICPSR will review the submission and make sure it gets in the right hands.

National Archive of Computerized Data on Aging (NACDA)
National Addiction and Health Data Archive Program (NAHDAP)
DataLumos

Study Curation Request

Indicates whether ICPSR staff should curate the data collection.

For detailed information about ICPSR curation, see Professional Curation vs Self-Publishing.
Whether you can request curation depends on the ICPSR archive collection you choose for your data. Some archives accept only data collections to be curated, while others accept only self-published data collections. A few accept both. For more information, see Does ICPSR Accept My Data?

Yes, I want my study to be curated.
No, I will curate the study myself and publish it immediately.

Disclosure Risk

Use these fields to indicate any privacy or sensitivity concerns with the data collection.

Can individuals be identified from this Data Collection?

Indicates whether the data contain personally identifying information such as names, telephone numbers, and birth dates.

To recognize personally identifiable information, consider: if the data were made public, could someone find individuals through direct or indirect identifiers?
Currently, ICPSR’s single stream deposit form can be used only for data collections that do not contain personally identifiable information. Data containing personally identifiable information can be deposited with ICPSR’s legacy deposit form.
For more information about identifying information and how to recognize it, see ICPSR’s Approaches to Confidentiality.

Does this Data Collection include sensitive information?

Indicates whether the data contain information that creates a risk of harm to individuals within the data collection.

To recognize sensitive information, consider: if the data were made public, would that create a risk of harm (e.g., psychological distress, social embarrassment, financial loss) greater than the risk that people experience in everyday life?
Currently, ICPSR’s new single stream deposit form can be used only for data collections that do not contain personally identifiable information. Data containing personally identifiable information can be deposited with ICPSR’s legacy deposit form.

ICPSR Review

Embargo Release Date

The date on which the data collection will become available to others.

If you would like to delay public release of your data, please enter an embargo release date. The embargo period may be set for up to three years from the date of your submission. After this date, your data will automatically become publicly available.

2027-01-01
2025-09-30

Additional Fields

Manuscript Number

A unique identifier that associates the data collection with a manuscript submitted to a journal. For more information, see Manuscript Number in the ICPSR Metadata Documentation Portal.

Fill out this field only for deposits to archive collections supported by journals, such as the American Economic Association (AEA) Data and Code Repository.
Include details about the related article in the Data-related Publications field.

ECIN-Mar-2025-0078.R2
AER-2019-0000

Accessibility Statement

ADA Accessibility

Indicates whether the data collection is ADA accessible, conforming to WCAG 2.1 AA standards, or qualifies for the ADA archival exception. For more information, see ADA Accessibility in the ICPSR Metadata Documentation Portal.

For guidance on making your deposit accessible, see Depositor Guidelines for Sharing Accessible Study Materials using ICPSR.

Review License

This is the license that sets the rules for how others will be able to use, share, or modify your data collection. For more information, see License in the ICPSR Metadata Documentation Portal.

Depositors select from a variety of licenses to distribute their data, including Creative Commons licenses and a Public Domain Mark.
Available software licenses include: Apache License 2.0; BSD 2-Clause “Simplified” or “FreeBSD” License; BSD 3-Clause “New” or “Revised” License; GPL 3.0; LGPL 3.0; MIT License; Mozilla Public License 2.0; Common Development and Distribution License; and the Eclipse Public License.
If depositors would like to use multiple licenses or create a customized license, they may upload a LICENSE file alongside the data and documentation within the project workspace.
ICPSR requires a license for the distribution of data, but copyright remains with the author.

Creative Commons Attribution 4.0 International
Apache License 2.0

Using ICPSR's New Deposit Form

Create Your Citation

Title

Tips for Entering a Title

Title Examples

Alternate Titles

Tips for Entering Alternate Titles

Alternate Title Examples

Principal Investigators

Tips for Entering Principal Investigators

Principal Investigator Examples

Funding Sources

Tips for Entering Funding Sources

Funding Source Examples

Make Your Study Findable

Summary

Tips for Writing a Summary

Summary Examples

ICPSR Subject Terms

Tips for Choosing Subject Terms

ICPSR Subject Term Possible Values

ICPSR Subject Term Examples

Journal of Economic Literature (JEL) Classification Codes

Tips for Choosing JEL Classification Codes

JEL Classification Code Possible Values

JEL Classification Code Examples

Medical Subject Headings (MeSH) Terms

Tips for Choosing Medical Subject Headings

Medical Subject Heading Possible Values

Medical Subject Heading Examples

Time Periods

Tips for Entering Time Periods

Time Period Examples

Nationally Representative Sample

Tips for Indicating a Nationally Representative Sample

Nationally Representative Sample Possible Values

Nationally Representative Sample Examples

Geographic Coverage Areas

Tips for Choosing Geographic Coverage Areas

Geographic Coverage Area Possible Values

Geographic Coverage Area Examples

Smallest Geographic Unit

Tips for Choosing a Smallest Geographic Unit

Smallest Geographic Unit Possible Values

Smallest Geographic Unit Examples

Study Context

Study Design

Tips for Describing the Study Design

Study Design Examples

Universe

Tips for Describing the Universe

Universe Examples

Time Methods

Tips for Choosing Time Methods

Time Method Possible Values

Time Method Examples and Guidance

Methods: Sample Design

Units of Analysis

Tips for Choosing Units of Analysis

Unit of Analysis Possible Values

Unit of Analysis Examples and Guidance

Sampling Procedures

Tips for Choosing Sampling Procedures

Sampling Procedure Possible Values

Sampling Procedure Examples and Guidance

Sampling Note

Tips for Writing a Sampling Note

Sampling Note Examples

Weight

Tips for Describing Weights

Weight Examples

Response Rates

Tips for Describing Response Rates

Response Rates Examples

Methods: Collection Design

Data Source Types

Tips for Choosing Data Source Types

Data Source Type Possible Values

Data Source Type Examples and Guidance

External Data Sources