Using ICPSR's New Deposit Form

Beginning in October 2025, ICPSR is introducing a new and improved deposit form — called “single stream” — for sharing data with ICPSR. Use this page as a field-by-field guide for describing your data.

Create Your Citation

Use these fields to describe how users should cite your study.

 

Title

The official title that describes what the data collection is about, its geographic scope, and the time period it covered.

Use a clear, specific title that includes:

  • What the study is about (topic, population, or method). Capitalize major words and avoid vague titles like “Survey Data” or “Research Project.”
  • Where the study covers (city, state, country, or Global if truly worldwide).
  • What time period the study covers (e.g., 1999; 2001-2003; 1999, 2010, 2015; include months only to distinguish repeated releases). Match the time period to what the data describe (not necessarily when they were collected).

  • Bridge of Faith: Aim4Peace Community-Based Violence Prevention Project, Kansas City, Missouri, 2014-2017
  • Health and Relationships Project, United States, 2014-2015
  • Targeted Interventions to Prevent Chronic Low Back Pain in High Risk Patients: A Multi-Site Pragmatic Randomized Controlled Trial (TARGET Trial), 4 U.S. cities, 2016-2019
  • Aid Like A Paycheck (ALAP), Texas and California, 2014-2017
  • COVID-19 Disruptions Disproportionately Affect Female Academics, Global, 2020

 

Alternate Titles

The alternate name(s) or acronym(s) commonly used to refer to the data collection.

Enter an alternate title for the data collection if users might search for it using an acronym or shortened version of the name.

  • Add Health Parent Study
  • FACES 2009
  • Survey of Consumers
  • Eurobarometer 85.2

 

Principal Investigators

The key people or organizations responsible for the data collection, listed by importance. Each data collection requires at least one PI, either a person or an organization.

When the PI is a person:

  • Use “Search for a Person” to enter the name, affiliation, and ORCID ID that appear in the person’s ORCID profile.
  • If you can’t find an ORCID ID for a PI, enter their full name the way it appears in publications or their curriculum vitae. Enter middle names and initials in the Given Name field and suffixes (e.g. Jr., III) in the Family Name field.
  • Enter the PI’s affiliation as it appears in the Research Organization Registry (ROR). If the organization doesn’t have a ROR ID, use its full name, avoid acronyms, and do not include departments or colleges.
  • Enter a PI’s affiliation at the time the research was conducted.

When the PI is an organization:

ICPSR encourages using ORCID and ROR whenever possible and may revise Principal Investigator fields to add these values.

People:

Organizations:

 

Funding Sources

The sources of funding that supported the data collection.

  • Whenever possible, enter the organization’s name as it appears in the Research Organization Registry (ROR).
  • Enter the name without any organization hierarchy, for example, “National Institute on Aging” instead of “United States Department of Health and Human Services. National Institutes of Health. National Institute on Aging”.
  • If the organization doesn’t have a ROR, use its full name and avoid acronyms.
  • Whenever possible, include a unique identifier for the funding award. This can be a grant number; a URL, preferably a persistent one like a digital object identifier (DOI); or both.

Make Your Study Findable

Use these fields to describe your data collection in ways that will help users discover it.

 

Summary

A description of the data collection that helps users understand its purpose, substance, and key topics.

  • Clearly describe what the study is about – even if this information is already in your study documentation.
  • Write in the third person.
  • Use past tense to describe how the data were collected. Use present tense to describe the data themselves.
  • Avoid discussing potential uses, target audiences, or the value of the data.

 

ICPSR Subject Terms

A controlled list of social science terms maintained by ICPSR and used to indicate topics related to the data collection.

  • Choose subject terms from the ICPSR Subject Thesaurus that describe the topic of your data. 
  • Considerations for choosing subject terms:
    • What is the study about?
    • What would an ICPSR user interested in these data search for?
    • What subject terms are used for similar studies in ICPSR’s catalog?
  • You might need to combine terms to describe your topic, for example, “mothers” and “health” for “maternal health.”
  • You can also add subject terms in two other fields: JEL Classification Codes (for economics topics) and MeSH Terms (for health and biomedical topics).
  • Do not add terms for concepts that are described in other fields, like geography and methodology.
  • If existing terms do not adequately describe the study’s topic, email suggested new terms to icpsr-help@umich.edu for possible addition to the ICPSR thesaurus. Most new terms come from the Library of Congress Subject Terms or other social science controlled vocabularies.

To explore all possible values and how they relate to one another, see the ICPSR Subject Thesaurus.

 

Journal of Economic Literature (JEL) Classification Codes

Classification codes used to categorize economic research.

For a list of all possible codes and how they relate to one another, see the JEL Classification Codes from the American Economic Association.

  • A12: Relation of Economics to Other Disciplines
  • B00: History of Economic Thought, Methodology, and Heterodox Approaches
  • N22: Economic History: Financial Markets and Institutions: U.S.; Canada: 1913-

 

Medical Subject Headings (MeSH) Terms

Biomedical and health-related terms from the National Library of Medicine that describe the data collection’s topics.

  • Add MeSH terms if your data are related to health or health care, including clinical care, mental and behavioral health, and public health.
  • Health conditions, medicines, health care occupations, or disciplines related to your data are good candidates for MeSH terms.

To search all possible values, and to see their definitions and relationships to one another, see Medical Subject Headings.

 

Time Periods

The time period(s) to which the data refer, regardless of when the data were collected.

  • Specify the time periods covered by the data, rather than the dates of data collection.
  • Use the Start Date and End Date fields to specify the years and, optionally, months and days, covered by the data.
  • If the study title contains years, they should match the years in the Time Periods dialog box.
  • Use the Time Frame field only if needed to add context or when the date cannot be expressed exclusively in numbers.

  • 2008 – 2016
  • 2023-07 – 2024-06
    Time Frame: Fiscal Year 2024
  • 2020-03-13 – 2023-05-11
  • 1990
    Time Frame: Wave 1

 

Nationally Representative Sample

Indicates whether the data collection uses a sampling design intended to represent the demographics, behaviors, and/or characteristics of the entire nation. This typically involves probability-based methods that allow generalization. It does not include convenience samples that appear similar to the nation by chance.

  • Select a value only for studies that use a probabilistic study sample designed to represent an entire nation.
  • Describe the sampling strategy in more detail in the Sampling Note and Sampling Procedures fields.
  • If your data do not use a sample or you are unsure, leave this blank.

Possible values for this field are yes, no, and [blank].

 

Geographic Coverage Areas

The geographic locations where the data refer or are related.

  • Select the country, state, city, county, region, or continent covered by the study.
  • Spell out place names completely instead of using acronyms. For example, enter “United States” instead of “USA.”
  • Type at least four characters to see matches.
  • Choose only the narrowest level of geographic coverage. For example, if you select “Los Angeles, California, United States,” do not also add “California, United States” and “United States.”
  • For studies with participants from around the world or that are applicable everywhere, select “Earth.”

Possible values come from the GeoNames database, primarily administrative boundaries and populated places (feature classes A and P). To explore all possible values, see GeoNames.

 

Smallest Geographic Unit

The smallest geographic unit (e.g., state or census tract) used in the dataset.

  • Choose a value only if the data contain a geographic variable that can be used for analysis, for example, the state or county where participants live.
  • Do not select a value if the data are all from the same place and not further divided up by geography. For example, if the study took place in New York City, but there are no other geographical variables in the data, leave this field blank. Do not select “City.”
  • You can enter geographic units that do not appear in the drop-down list. Do this only when the data contain a specific geographic variable that does not appear in the list, such as local court districts or school catchment areas.

  • State
  • Census tract
  • Precinct

Study Context

Use these fields to help others understand your data collection and how it was conducted.

 

Study Design

The procedures used to contact participants and gather data.

  • Use this field, if needed, to provide more information about the study design than the summary.
  • Possible topics include survey design, interview methods, how data were obtained, and follow-ups to respondents.

Data on organizational culture in each of the 12 courts (Part 1) were obtained by administering the Court Culture Assessment Instrument (CCAI) to all judges with a felony criminal court docket and to all senior court administrators. A total of 224 respondents completed the questionnaire. The CCAI was used to assess five key dimensions of current court culture orientation: (1) dominant case management style, (2) judicial and court staff relations, (3) change management, (4) courthouse leadership, and (5) internal organization. The determination of what culture judges and court administrators desired to establish in the near future was also obtained through the application of the same instrument (CACI) as practitioners were asked to indicate the type of culture in each work area (or content dimension) they would like to see in their court in the next five years.

 

Universe

The total group of persons or other entities (e.g., households or organizations) that were the object of research and to which analytic results refer.

  • Describe the study’s target population in a short phrase or sentence.
  • Identify the characteristics that determine inclusion in the study. For studies about people, this might be age, nationality, residence, or demographic characteristics.
  • The universe can also consist of things other than people, such as households, organizations, or geographic units.

  • All households in the United States with phones.
  • Part 1: Thirty cities in Massachusetts during 1980-1986. Parts 2-4: All residents in Massachusetts during 1986.
  • Individuals self-identified as transgender, trans, genderqueer, non-binary, or other identities on the transgender identity spectrum aged 18 and older residing in the fifty U.S. states, the District of Columbia, American Samoa, Guam, Puerto Rico, and U.S. military bases overseas.
  • All publicly funded medical examiner and coroner offices.
  • Uncertified ballots for the 2000 United States presidential election in Florida.

 

Time Methods

The methods used to collect data over time, like snapshots at one point (cross-sectional) or repeatedly (longitudinal) to study changes or trends.

  • Use this field to indicate if and how the data collection repeated over time.
  • You can enter time methods that do not appear in the drop-down list. Do this only when the data contain a specific time method that does not appear in the list.

For a list of all possible values and their definitions, see the DDI Alliance Controlled Vocabulary for Time Method.

  • Cross-section: Select this value for data collections that took place at a single point in time. (For repeated cross-sectional data collections, use “Longitudinal: Trend/Repeated cross-section” instead.)
  • Longitudinal: Panel: Select this value for repeated data collections that use the same sample, for example, surveys of the same group of respondents over time.
  • Time series: Select this value for repeated data collections, usually of measurable phenomena such as crime rates or gross domestic product.

Methods: Sample Design

Data collections are often based on a sample, or subset of the population being studied. Use these fields to describe the sample for your data collection.

 

Units of Analysis

The object(s) of analysis for the data collection, such as an organization, individual, or household.

  • In quantitative data, units of analysis often correspond to rows or observations in a dataset.
  • You can enter units of analysis that do not appear in the drop-down list. Do this only when no applicable option is available. For example, if the unit of analysis in your data collection is college students, choose “Individual”; do not enter “college students.”
  • You can choose more than one unit of analysis. For example, if data about individuals contain multiple observations over time, choose both “Individual” and “Time Unit.”

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Units of Analysis.

  • Individual: Select this value for data collected about the characteristics of individual people, such as their opinions or health state.
  • Event/Process/Activity: Select this value for data focused on events or activities that people or organizations engage in, such as elections or infections disease cases.
  • Household: Select this value for data collected about all individuals occupying the same living space, whether or not they are related.

 

Sampling Procedures

The type(s) of sample and sample design used to select survey respondents to represent the population.

  • Use this field to describe how the units of analysis in your data were selected from the universe or population of interest.
  • You can describe the sampling strategy in more detail in the optional Sampling Note field.
  • You can enter sampling procedures that do not appear in the drop-down list. Do this only when the data collection uses a specific, known sampling strategy that does not appear in the list.
  • If the sampling procedure resulted in a nationally representative sample, indicate this in the Nationally Representative Sample field.

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Sampling Procedures.

  • Total universe/Complete enumeration: Select this value when the sample contains all units within the target population, for example, census data.
  • Probability: Systematic random: Select this value when the sample contains every unit at a fixed interval, for example, every tenth record in a database.
  • Non-probability: Availability: Select this value for samples based on opportunity or convenience, for example, individuals who volunteered to take a survey.

 

Sampling Note

Supplemental information about the sampling process that does not fit neatly into the Sampling Procedures field.

  • Briefly describe the methodology used to select participants from the universe or population of interest.
  • In addition to completing the Sampling Note, use the Sampling Procedures field to describe the sampling strategy using standardized language.
  • A detailed discussion of such things as sampling error or other limitations of the sampling methodology is not required here.

  • National sample of telephone numbers from cell (RDD) sampling frame.
  • The probability sample selected to represent the universe consists of approximately 71,000 households.

 

Weight

The weight variables and the criteria for using them in data analysis, or other information about how the data are weighted if no weight variables are present.

  • Give the name of any weighting variables in the data and explain when they should be used.
  • If weights are described in documentation provided with the data, indicate where that information can be found.

  • Both the TransPop and Cisgender datasets have the same variable named WEIGHT as the weighting variable. The combination datasets have a set of three weight variables (WEIGHT_TRANSPOP, WEIGHT_CISGENDER, WEIGHT_CISGENDER_TRANSPOP).
  • A weight variable with two implied decimal places has been included and must be used in any analysis.

 

Response Rates

The percentage of respondents in the sample who participated in the data collection.

  • Fill out this field only if the data were collected with a survey instrument and the response rates are provided.
  • For longitudinal data, describe the response rate across waves of data collection.

  • The overall response rate for this survey was 20.22 percent; 72.6 percent for existing panelists and 10.4 percent for new panelists, using AAPOR Response Rate 1.
  • Of the 1,843 Midlife in the United States (MIDUS) respondents that researchers attempted to contact, 1,483 agreed to participate (8 percent refused participation and 11 percent either moved or were difficult to contact), yielding a response rate of approximately 81 percent.

Methods: Collection Design

Use these fields to describe how the data were collected, its sources, and key variables and scales.

 

Data Source Types

The source(s) of the data as collected by the Principal Investigators.

  • Use this field to describe what the data are derived from, not their current form.
  • People, things, and other data can all be data source types.
  • You can enter data source types that do not appear in the drop-down list. Do this only when no applicable option is available; select from the list whenever possible. For example, if the data are derived from birth certificates, choose “Registers/Records/Accounts: Administrative”; do not enter “birth certificates.”
  • You can provide more information about the data sources in the External Data Sources field.

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Data Source Types.

  • Communications: Public: Select this value for data collected from public communications, such as social media posts, press releases, or websites.
  • Events/Interactions: Select this value for data collected from one-time occurrences, for example, court cases or elections.
  • Registers/Records/Accounts: Medical/Clinical: Select this value for data collections based on health-related data collected in a health care setting, for example, blood pressure readings or diagnoses of health conditions.

 

External Data Sources

The source of the data, when that source is external to the data collection and can be independently cited.

  • Cite any websites, datasets, books, journal articles, or other sources used to compile the data.
  • At minimum, include the title, author, publication year, and journal (if applicable) of each external data source. Optionally, describe how each data source was used to create or compile the data.
  • For online sources, include the URL to access the data. A persistent identifier such as a digital object identifier (DOI) is preferable whenever possible.
  • Any citation format is accepted.

 

Collection Modes

The method(s) or procedure(s) used to collect the data.

  • Use this field to indicate how data were collected using standardized terminology.
  • Select the most specific value that applies to the data. For example, if data were collected via computer-assisted telephone interview, select “Telephone interview: Computer-assisted (CATI)” rather than “Telephone interview” or “Interview.”
  • You can enter collection modes that do not appear in the drop-down list. Do this only when no applicable option is available.
  • For studies that collected data in more than one way, select all collection modes that apply.

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for Mode of Collection.

  • Face-to-face interview: Computer-assisted (CAPI/CAMI): Select this value for in-person surveys in which an interviewer asked questions and entered responses into a computer or mobile device.
  • Measurements and tests: Physical: Select this value for data collections derived from physical measurements or medical tests, for example, blood pressure readings.
  • Computer-based observation: Select this value for data collected on users’ interaction with technology or software, such as counts of clicks and downloads on a website.

 

Collection Dates

The date(s) data collection took place.

  • Collection Dates refer to when data collection occurred, while Time Periods describe when the data refer for. For example, for data about population growth between 1950 and 2000 collected in 2020, the collection date would be 2020.
  • Enter dates in YYYY-MM-DD, YYYY-MM, or YYYY format.
  • Use the Collection Time Frame for non-numeric dates (e.g., Fall 2018) or to provide context on the collection date (e.g. “Wave 3” in a longitudinal data collection). Do not restate the date in words.

  • 2018 – 2018
    Time Frame: Wave 1
  • 2020-10 – 2020-10
    Time Frame: Wave 2
  • 2003-01-01 – 2003-12-31

 

Variable Description

Significant variables (particularly demographic variables) in the data files.

  • Use this field to provide more detailed information than the Summary, including a review of variables that are important for users to know about.
  • List the key topics or groups of variables covered in the data collection.

  • The data includes variables about participants’ and their parents’ moods, interviewer observations, families’ activities, families’ health history, participants’ school records, and parents’ substance use. Demographic variables include race, religion, annual household income, and the participants’ parents’ employment statuses.
  • The LGBTQ Hate Crimes Interviews dataset contains more in-depth information, including victim demographic information, substance abuse history, information on whether the victim is open about their LGBTQ identification, the victim’s job status, and information about how the victim reacted to the crime, such as whether or not they reported the crime to the police and their level of cooperation with the police and prosecution.

 

Scales

Any commonly known scales, measures, or inventories used in the data collection.

  • Scales are shared measurement tools that combine multiple variables or data elements into a single score. Common examples include the Minnesota Multiphasic Personality Inventory (MMPI), the Consumer Price Index (CPI), and the Beck Depression Inventory.
  • Provide a citation for each scale whenever possible. At minimum, include the title, author, publication year, and journal (if applicable). Include a DOI or URL if one is available.
  • Cite scales as a list or describe them in full sentences.
  • If the data contain Likert scales, it is allowed but not necessary to note them here.

A Scales field with full sentences:

  • The baseline data collection included one scale – the CES-D index for maternal depression [Cole, J. C., Rabin, A. S., Smith, T. L., and Kaufman, A. S. (2004). Development and validation of a Rasch-derived CES-D short form. Psychological assessment, 16(4), 360. https://doi.org/10.1037/1040-3590.16.4.360]. All scales used for outcomes at ages 1 through 3 are listed in Appendix Tables 1 and 2 in the User Guide. Please refer to the User Guide and P.I. Codebook, available under the ‘Data and Documentation’ tab, for details.

A Scales field containing a list of citations:

  • Squires, J., Bricker, D. D., and Twombly, E. (2009). Ages and stages questionnaires. Baltimore, MD: Paul H. Brookes.
  • Briggs-Gowan, M. J., Carter, A. S., Irwin, J. R., Wachtel, K., and Cicchetti, D. V. (2004). The Brief Infant-Toddler Social and Emotional Assessment: screening for social-emotional problems and delays in competence. Journal of pediatric psychology, 29(2), 143-155. https://doi.org/10.1093/jpepsy/jsh017
  • Yu, L., Buysse, D. J., Germain, A., Moul, D. E., Stover, A., Dodds, N. E., … and Pilkonis, P. A. (2012). Development of short forms from the PROMIS sleep disturbance and sleep-related impairment item banks. Behavioral sleep medicine, 10(1), 6-24. https://doi.org/10.1080/15402002.2012.636266

Research: Transparency & Impact

Use these fields to describe publications, software, and other resources related to this data collection.

 

Citations of publications that use or analyze the data.

  • Cite any published articles, reports, preprints, presentations, or other works that analyze, critique, or discuss the data and its collection process or methodology.
  • Exclude publications that mention, but do not analyze, the data collection.
  • For online resources, include the URL to access the publication. A persistent identifier such as a digital object identifier (DOI) is preferable whenever possible.
  • If a DOI is available, use the Import via DOI option to enter publication details. If the publication does not have a DOI, select “Manually Enter Citation” to provide the title, author, publication date, and other information manually.

  • Kirksey, J. Jacob. “Academic Harms of Missing High School and the Accuracy of Current Policy Thresholds: Analysis of Preregistered Administrative Data From a California School District.” AERA Open 5, no. 3 (July 2019): 233285841986769. https://doi.org/10.1177/2332858419867692.
  • Palmer, Ashley N., Mansi Patel, Sarah C. Narendorf, Shellye Sledge, and Katherine Sanchez. “Changes in Flourishing from Adolescence to Young Adulthood: An 8-year Follow-up.” Child & Family Social Work 28, no. 1 (February 2023): 194–209. https://doi.org/10.1111/cfs.12953.

 

Data Management Plan

A link to the data management plan (preferably a persistent identifier such as a DOI).

  • Some funding agencies require grant recipients to follow a data management plan for sharing data. Sites like DMPTool and DMPonline support creating and publishing data management plans. Use this field to provide a link to a published data management plan.
  • Only URLs can be entered in this field. To provide a copy of a data management plan that is not available online, upload it along with the data and documentation.

 

Preregistration

A link to a research plan for the data collection (preferably a persistent identifier such as a DOI).

  • Preregistration lets researchers publicly share hypotheses and data analysis plans before data collection begins. Sites like the Open Science Framework and the AEA RCT Registry support preregistering studies and sharing plans publicly.
  • Only URLs can be entered in this field. To provide a copy of a preregistration that is not available online, upload it along with the data and documentation.

 

Software Applications

Software used by the principal investigator(s) to collect or analyze data, required to understand how the data were obtained or to reproduce results.

  • Use the fields in this dialog box for data collections that include code or depend on an uncommon or custom application.
  • If the data can be analyzed with commonly available statistical software, this field is usually not needed.
  • Provide as much information about the software application as is needed to access and run it. Depending on the application, this may include a version, download link, operating system information, and shared libraries.

This is an example of the Software Applications dialog box with several fields filled in.

Software Name: siegfried
Software Description: Siegfried is a signature-based file format identification tool, implementing the National Archives UK’s PRONOM file format signatures; freedesktop.org’s MIME-info file format signatures; the Library of Congress’s FDD file format signatures (beta); and Wikidata (beta).
Programming Languages: go; javascript; other
Operating Systems: Mac, Linux, Windows
License: https://www.apache.org/licenses/LICENSE-2.0
Download URL: https://github.com/richardlehane/siegfried/archive/refs/heads/main.zip

Additional Information

Use these fields to provide any other information that users should know about the data collection.

 

General Data Formats

The file format types present in the data collection.

  • Use this field to indicate broadly whether the data contain numeric information, text, images, software code, or some other format.
  • Select more than one value for data collections with multiple formats, for example, a mixed methods study that includes survey responses (Numeric) and interview transcripts (Text).

For a list of all possible values and their definitions, see the DDI Alliance controlled vocabulary for General Data Formats.

  • Numeric: Select this value for most quantitative data, such as survey responses and administrative statistics.
  • Text: Select this value for text-based qualitative data such as interview transcripts or social media posts.
  • Still image: Select this value for static images such as photographs, data visualizations, or X-rays.

 

Notes

Important details about the data collection (like unique authoring, discrepancies, or processing information) that can’t be recorded in other metadata elements.

  • Use this field to provide any additional information that users need to know about the data collection.
  • Linking to a project website in a note is acceptable. However, links break and websites change over time. To ensure that users will be able to understand the data collection long into the future, include key details in the appropriate fields or in the documentation.

  • Information on the Index of Consumer Sentiment, the Index of Current Economic Conditions, and the Index of Consumer Expectations and how they were created can be found in the P.I. Codebook.
  • Dataset 1 should be attributed to Jane Doe while datasets 2-6 should be attributed to John Doe.
  • Additional information on the Survey of Consumers can be found by visiting the Survey of Consumers Website.

Deposit Settings

 

ICPSR Archive Collection

The ICPSR archive (also known as a thematic collection) to which the data collection should belong.

  • Most archives have collection criteria that determine what type of data collections can be deposited. To learn more about archive collections, see Thematic Data Collections at ICPSR.
  • The archive you select determines whether the data collection can be curated or self-published.
  • If you are not sure, select “Inter-university Consortium for Political and Social Research.” Our team at ICPSR will review the submission and make sure it gets in the right hands.

  • National Archive of Computerized Data on Aging (NACDA)
  • National Addiction and Health Data Archive Program (NAHDAP)
  • DataLumos

Disclosure Risk

Use these fields to indicate any privacy or sensitivity concerns with the data collection.

 

Can individuals be identified from this Data Collection?

Indicates whether the data contain personally identifying information such as names, telephone numbers, and birth dates.

  • To recognize personally identifiable information, consider: if the data were made public, could someone find individuals through direct or indirect identifiers?
  • Currently, ICPSR’s single stream deposit form can be used only for data collections that do not contain personally identifiable information. Data containing personally identifiable information can be deposited with ICPSR’s legacy deposit form.
  • For more information about identifying information and how to recognize it, see ICPSR’s Approaches to Confidentiality.

 

Does this Data Collection include sensitive information?

Indicates whether the data contain information that creates a risk of harm to individuals within the data collection.

  • To recognize sensitive information, consider: if the data were made public, would that create a risk of harm (e.g., psychological distress, social embarrassment, financial loss) greater than the risk that people experience in everyday life?
  • Currently, ICPSR’s new single stream deposit form can be used only for data collections that do not contain personally identifiable information. Data containing personally identifiable information can be deposited with ICPSR’s legacy deposit form.

ICPSR Review

 

Embargo Release Date

The date on which the data collection will become available to others.

If you would like to delay public release of your data, please enter an embargo release date. The embargo period may be set for up to three years from the date of your submission. After this date, your data will automatically become publicly available.

  • 2027-01-01
  • 2025-09-30

Additional Fields

 

Manuscript Number

A unique identifier that associates the data collection with a manuscript submitted to a journal.

  • Fill out this field only for deposits to archive collections supported by journals, such as the American Economic Association (AEA) Data and Code Repository.
  • Include details about the related article in the Data-related Publications field.

  • ECIN-Mar-2025-0078.R2
  • AER-2019-0000

Accessibility Statement

 

ADA Accessibility

Indicates whether the data collection is ADA accessible, conforming to WCAG 2.1 AA standards, or qualifies for the ADA archival exception.

For guidance on making your deposit accessible, see Depositor Guidelines for Sharing Accessible Study Materials using ICPSR.

  • Yes
  • No

 

Review License

This is the license that sets the rules for how others will be able to use, share, or modify your data collection.

  • Depositors select from a variety of licenses to distribute their data, including Creative Commons licenses and a Public Domain Mark.
  • Available software licenses include: Apache License 2.0; BSD 2-Clause “Simplified” or “FreeBSD” License; BSD 3-Clause “New” or “Revised” License; GPL 3.0; LGPL 3.0; MIT License; Mozilla Public License 2.0; Common Development and Distribution License; and the Eclipse Public License.
  • If depositors would like to use multiple licenses or create a customized license, they may upload a LICENSE file alongside the data and documentation within the project workspace.
  • ICPSR requires a license for the distribution of data, but copyright remains with the author.

  • Creative Commons Attribution 4.0 International
  • Apache License 2.0