ICPSR Data Fair 2016

ICPSR - Navigating the Data (R)Evolution!

Data are everywhere! There are big data, little data, organic data, data by design, public - and restricted-use data, in all kinds of formats and shared from all kinds of places. It's a fascinating and challenging time to be in the data sharing space.

In addition to an evolving external data environment, ICPSR is evolving internally. Maggie Levenstein began her term as ICPSR's new director on July 1, 2016; ICPSR is in the midst of updating its data enterprise system improving its ingest, curation, sharing, and preservation process from end to end; and, the ICPSR website has been refreshed to improve usability no matter the device. In addition, ICPSR is hosting a number of new data collections and datasets, looking for collaboration opportunities, and continuing to push to credit researchers for their data deposits by increasing the research impact of their data.

ICPSR's 2016 Data Fair aims to introduce, engage, and help the data community manage through the ongoing Data (R)Evolution. Come join us for one, some, or all of the ICPSR 2016 Data Fair!

ICPSR Data Fair Program

ICPSR is pleased to present its program (below) for the 2016 ICPSR Data Fair, which will take place Monday through Thursday, September 26-29, 2016.

  • Webinar broadcast times are listed in Eastern Daylight Time (EDT).
  • Webinars are free and open to the public, and it is permissible (even encouraged!) for organizations to broadcast these webcasts to groups of attendees.
  • Attendees must register for each webinar they want to attend.
  • Please note that the 1:00 pm and 3:00 pm sessions are limited to the first 100 participants to log into the session.
  • Recordings and slide decks (when made available) will be placed on ICPSR's YouTube Channel. Look for the playlist titled, "2016 Data Fair."

ICPSR Data Fair at a Glance

Webinar Title Time (EDT) Registration Link
Monday, September 26
Meet the ICPSR Director - Ask the ICPSR Director! 12:00-12:50 https://attendee.gotowebinar.com/register/6266429404687288836
Archonnex at ICPSR - Data Science Management for All 1:00-1:50 https://attendee.gotowebinar.com/register/4902297213962994178
Orientation to ICPSR with a Fresh, New Look! 2:00-2:50 https://attendee.gotowebinar.com/register/2843861913165149954
The Quest for Data: Acquisitions Activities and Archiving Options at ICPSR 3:00-3:50 https://attendee.gotowebinar.com/register/6514314301189837828
Manage your team's data, attach metadata, and publish to ICPSR using SEAD 4:00-4:50 https://attendee.gotowebinar.com/register/8516164335562360066
Tuesday, September 27
ICPSR in Education - Teaching, Learning, & Research Experience! 12:00-12:50 https://attendee.gotowebinar.com/register/5831386738437292290
The Mandate to go Public & the Pressure to Publish First, Big, & Fast 1:00-1:50 https://attendee.gotowebinar.com/register/5791329330834185476
A Thoroughly Gentle Introduction to Methods Metadata! 2:00-2:50 https://attendee.gotowebinar.com/register/1835957196169153795
Data Karma: How to deposit data that stands the test of time 3:00-3:50 https://attendee.gotowebinar.com/register/2332602200409159681
Wednesday, September 28
Open (Data) is Not Enough - Research Data Curation for Data Reuse 12:00-12:50 https://attendee.gotowebinar.com/register/1050903694916268547
Assisting Researchers Demonstrate Impact Using Data-Related Publications: How ICPSR Does It and How You Can Help 1:00-1:50 https://attendee.gotowebinar.com/register/2413997946701932803
Collaborating for Open Data Access AND Data Reuse ? How do we do it? 2:00-2:50 https://attendee.gotowebinar.com/register/8355536682010163204
A DDI Primer: An overview and examples of DDI in action 3:00-3:50 https://attendee.gotowebinar.com/register/3068790106391445507
Thursday, September 29
Biomedical Data: What is it? Who is Involved? What Data are Available? 12:00-12:50 https://attendee.gotowebinar.com/register/3694496884973564161
Many Disciplines, One Topic: CivicLEADS and the Potential for Multi-Disciplinary Research Data Archiving 1:00-1:50 https://attendee.gotowebinar.com/register/6945380033883318785
Cultural Participation of U.S. Adults: Featured NADAC Data Highlights 2:00-2:50 https://attendee.gotowebinar.com/register/6281748900200358148
Understanding the Data and Access Options Available in NAHDAP (Featuring the Population Assessment of Tobacco and Health (PATH) Study) 3:00-3:50 https://attendee.gotowebinar.com/register/6201232763227864577

Monday, September 26, 2016

Meet the ICPSR Director - Ask the ICPSR Director!

YouTube Video

PowerPoint Download

Margaret Levenstein, appointed the director of the ICPSR after a national search, began her five-year term on July 1, 2016. "Maggie's vision for ICPSR's future is exciting," said ICPSR Governing Council Chair Chandra Muller. "She has a strong academic research background, understands the potential of data science for current and future social science researchers, and has a stellar reputation both nationally and internationally. We were impressed by her skill in bringing together experts from diverse fields for the advancement of social science research goals."

Join Maggie as she articulates her vision for ICPSR during this evolving and exciting time in data and in data sharing. Ample time will be available for live Q&A, but why wait? Have a question for the new ICPSR Director? Submit it here now for response on September 26: Ask the Director Link - https://www.surveymonkey.com/r/ASKICPSR

Presenter: Maggie Levenstein


Archonnex at ICPSR - Data Science Management for All

YouTube Video

PowerPoint Download

Archonnex is a Digital Asset Management System (DAMS) architecture defined to transition to a newer technology stack meeting core and emerging business needs of the organization and the industry. It aims to build a digital technology platform that leverages ICPSR expertise and open source technologies that are proven and well supported by strong Open Source communities. This component based design identifies re-usable self-contained services as components. These components will be integrated and orchestrated using an Enterprise Service Bus and Message Broker to deliver complex business functions. All components start as a Minimum Viable Product (MVP) and are improved in iterative development phases.

This presentation will identify all the various operational components and the associated technology counterparts involved with running a data science repository. It will consider the process of the upfront integration with the researcher to allow better managed data collection, dissemination and management (e.g., SEAD) during research and follow the workflow process technologically through from the ingestion of data to the repository, curation, archiving, publication and re-use of the research data including the citation and bibliography management along the way. The integration of data management plans and their impact on this workflow should become apparent with this ground up architecture designed for the data science industry. Participants will leave with an understanding of how the Archonnex architecture at ICPSR is strengthening our data services offer to new researchers as well as data re-use and how repository brokering may be leveraged.

Presenter: Harsha Ummerpillai; Thomas Murphy


Orientation to ICPSR with a Fresh New Look!

YouTube Video

PowerPoint Download

New to ICPSR or considering membership in ICPSR? New to the Official or Designated Representative (OR/DR) role? Tasked with educating (training) your institution about what is available from ICPSR and its hosted archives and need tools and content to assist? Need to better understand the benefits of membership in ICPSR? This session is for you!

Bonus Feature: ICPSR has updated its website with a fresh new look, streamlined navigation, and other functional improvements. We'll begin this session with an overview of the updates and improvements.

Learning Objectives:

  1. Understand and explain what ICPSR offers to the data community, including faculty, students, and others affiliated with your organization, across numerous disciplines.
  2. Understand what options are available to individuals who need:
    • To deposit research data
    • To find and analyze data for articles or papers
    • Training in or teaching quantitative methods or data curation
    • Data management plans and quotes for inclusion in grant proposals and budgets
  3. Understand, in an era of public access data, why membership in ICPSR matters!

Presenter: Linda Detterman; Wendi Fornoff


The Quest for Data: Acquisitions Activities and Archiving Options at ICPSR

YouTube Video

PowerPoint Download

In this session, attendees will learn about the criteria that ICPSR uses to seek and select potential datasets. The Acquisitions Team will review ICPSR's Collection Development Policy and explore the various archiving options available to depositors. The presenters will also describe high-priority data areas for ICPSR as well as highlight past, ongoing, and future outreach efforts to identify and acquire data for ICPSR's collection of holdings.

Presenter: Justin Noble; Piper Simmons


Manage your team's data, attach metadata, and publish to ICPSR using SEAD

YouTube Video

PowerPoint Download

Join us to learn about new data services offered by the SEAD project and designed to support researchers through each step of the data life cycle from collection to publication. SEAD offers secure, team-controlled Project Spaces which can address all your data needs, even for large and heterogeneous data collections, by offering powerful tools for easy management, metadata, team collaboration, and sharing. And, you will be amazed to learn how easy it is to submit your datasets with a click of a button directly from your SEAD Project Space for archiving and publication in a longer-term repository like ICPSR.

Presenter: Anna Ovchinnikova


Tuesday, September 27, 2016

ICPSR in Education - Teaching, Learning, & Research Experience!

YouTube Video

PowerPoint Download

While any of ICPSR'S data and tools can be used in the classroom, ICPSR has created resources to make it easy for instructors to set up intriguing data-driven learning experiences, particularly for undergraduate students. This session will introduce and demonstrate ICPSR's teaching and learning resources including:

  1. Data-Driven Learning Guides & Instructor Tools
  2. Student Research Paper Competitions
  3. ICPSR Undergraduate Internship
  4. ICPSR Summer Program

Presenters: Lynette Hoelter


The Mandate to go Public & the Pressure to Publish First, Big, & Fast

YouTube Video

PowerPoint Download

An openICPSR Case Study (Political Persuasion and Attitude Change Study)

Presenters: Amy Pienta


A Thoroughly Gentle Introduction to Methods Metadata!

YouTube Video

PowerPoint Download

Weights? Panel study? Universe? Ever wonder what the terms in the "Methodology" metadata mean? ICPSR datasets are described in detail by the research methods employed while collecting and analyzing the data. The metadata can be somewhat of a mystery or even intimidating for those not trained in data collection (methods) or analysis techniques. This session will give you a gentle but thorough introduction to the vocabulary of social research such that you can use this information confidently! We will walk through a number studies, using the ICPSR Study Home Page, to explain project metadata including the contents of the Scope of Study, Methodology, Versions, and Variables.

Presenter: Lynette Hoelter


Data Karma: How to deposit data that stands the test of time

YouTube Video

PowerPoint Download

This session will illustrate how providing complete, quality data and documentation is better not only for long term preservation but also for quicker turnover time at ICPSR.

  • What does a "good" data deposit look like? A visual comparison and discussion.
  • What happens after it is deposited? The quick answer (hopefully).
  • What happens if I realize I need to resubmit materials?

Presenters: Kathryn Frania


Wednesday, September 28, 2016

Open (Data) is Not Enough - Research Data Curation for Data Reuse

YouTube Video

PowerPoint Download

Imagine you are a scientist and you find data from what seems like an interesting study but you have little documentation or context. Which files should be used? What should you do about values in the data that do not make sense? How will you open the data files with current statistical software? What analyses have already been conducted and published using these data? How will you interpret the results? How will you tell the story?

Increasing data sharing is a start, but it is not enough. Well-described, complete, and self-explanatory data collections are the ultimate goal -- beyond simply storing and sharing files in a repository. This session will explore the unique opportunities and challenges of open research data for re-use and discuss how data stewards can encourage data are both shared and re-usable. Specific topics will include: brief, non-technical explanation of research methods metadata and its value to data discovery, analysis, and interpretation; how to deal with sensitive and confidential data; and policy strategies to identify selected data files and documentation for curation. The talk will propose that indeed, open data is not enough, and metadata rocks because data have stories to tell!

Presenter: Linda Detterman


Assisting Researchers Demonstrate Impact Using Data-Related Publications: How ICPSR Does It and How You Can Help

YouTube Video

PowerPoint Download

Researchers who deposit their data for reuse should get credit for its impact on future scholarship. Beyond knowing the number of times their data files are downloaded, PIs can measure impact by counting how often findings based on their data are published in the scholarly literature. This is why ICPSR makes a concerted effort to link data deposited here with publications using those data. How do we do this? It's more difficult than you might think. This session offers a look into the current practice of data citation and outlines the technical and cultural challenges associated with identifying and systematically collecting examples of data reuse. Luckily, you are in a position to improve practices, and we will discuss how you can.

Presenter: Elizabeth Moss


Collaborating for Open Data Access AND Data Reuse ? How do we do it?

YouTube Video

PowerPoint Download

The 2013 OSTP memorandum on "Increasing Access to the Results of Federally Funded Scientific Research" required government agencies to develop and enact a plan for ensuring public access to data produced with their financial support. Quickly, researchers and institutions receiving agency funding began to identify and/or build infrastructures to provide open access to research outputs to demonstrate compliance. As a 54-year supporter of research data reuse, ICPSR has lauded the movement toward open access.

On the research data front, the movement has produced a myriad of data platforms in which research data can be shared including numerous hosted data sharing services, open data initiatives, and the launch of university-sponsored institutional repositories. Millions of research-related files, including data files, have been opened to the public. "Open" is definitely in!

As data stewards, we know that data access starts with discovery and data reuse requires working data files and documentation including issues of study methodology and provenance. We also know that not all data files are of interest or usable to those that desire to analyze. And we know that even in this open environment, we must ensure the confidentiality of research subjects. So how do data sharing services, institutional repositories, and domain repositories work together to open data, identify and select data with potential for reuse, curate it for the long term, and provide linkages to give researchers credit? In this session, ICPSR will lead an online discussion of these topics. Come ready to contribute and collaborate!

Presenters: Linda Detterman; Lynette Hoelter; David Thomas


A DDI Primer: An overview and examples of DDI in action

YouTube Video

PowerPoint Download

The Data Documentation Initiative (DDI) is an international standard for describing survey and other social science data. Documenting data with DDI facilitates interpretation and understanding -- both by humans and computers. Learn how DDI is helping researchers and organizations, including ICPSR, better document, discover, and interoperate!

Presenter: Jon Johnson (CLOSER), Jared Lyle (ICPSR), and Barry Radler (MIDUS)


Thursday, September 29, 2016

Biomedical Data: What is it? Who is Involved? What Data are Available?

YouTube Video

PowerPoint Download

The social science research community is adding various biomedical information to social science surveys in order to investigate how genetic variations and other human biological measurements affect particular human behaviors. Because of this, social science researchers need to understand how biomedical data can be used for social science research, learn a new set of terminology, learn about organizations involved in this area, and become familiar with biomedical data available for future research projects.

Learning Objectives

  1. Learn why the interest in biomedical data for social science research.
  2. Understand the differences in types of data:
    • Phenotypes vs. genotypes
    • Biomeasures vs. bioindicators vs.biomarkers vs. biospecimens
    • HIPPA data vs. social science research data with health information
  3. Learn about ICPSR's partners with other data repositories and the biomedical community
  4. Understand what biomedical data are available from ICPSR and potential outcomes that can be investigated with these data

Presenter: TBD


Many Disciplines, One Topic: CivicLEADS and the Potential for Multi-Disciplinary Research Data Archiving

YouTube Video

PowerPoint Download

Research into civic education, engagement, and action ? as well as the relationships between these areas ? is undertaken in a variety of fields including: Political Science, Sociology, Education, and Communications. Researchers examine countless topics related to individuals? interactions with their communities. The breadth of approaches and disciplines can make finding data and collaborating with colleagues around shared research topics difficult. One of ICPSR's newest archives, CivicLEADS, provides an infrastructure for sharing, discovering, and inspiring cross-disciplinary research on civic learning, engagement, and action. This session will provide an overview of CivicLEADS, a brief background into the need the archive addresses, and a discussion of the potential cross-disciplinary utility of data archived multiple disciplines.

Presenter: David Bleckley; Johanna Bleckman


Cultural Participation of U.S. Adults: Featured NADAC Data Highlights

YouTube Video

PowerPoint Download

This session, led by staff managing ICPSR's National Archive of Data on Arts and Culture, will take attendees into the exciting world of arts and culture and explore data on some types of cultural activities in which Americans participate. The session will offer helpful ways that the data users can find fascinating statistics and get the big picture of what?s happening in activities pertaining to reading, music, and the performing arts. This session will highlight studies like the American Time Use Survey 2003-2014 and the Consumer Expenditure Survey tables as well as some wonderful new infographics and other forms of data visualization.

Presenter: Jai Holt; Alison Stroud


Understanding the Data and Access Options Available in NAHDAP (Featuring the Population Assessment of Tobacco and Health (PATH) Study)

YouTube Video

PowerPoint Download

The National Addiction & HIV Data Archive Program (NAHDAP) acquires, preserves and disseminates data relevant to drug addiction and HIV research. The scope of the data housed at NAHDAP covers a wide range of legal and illicit drugs and the trajectories, patterns, and consequences of drug use. This session will discuss the types of research studies available from NAHDAP and the different access options to those studies. The Population Assessment of Tobacco and Health (PATH) Study will be featured as a demonstration of a longitudinal study with public-use and restricted-use data available.

Presenter: Robert Choate