Burton Tower IASSIST 2006: Data in a World of Networked Knowledge

IASSIST Program

Monday, May 22

9:00am-5:00pmAdmin Committee Meeting

Tuesday, May 23

8:30am-5:00pmRegistration
9:30am-12:30pm Morning Workshops
12:30pm-2:00pmLunch on your own
2:00pm-5:00pmAfternoon Workshops
6:00pm-8:00pmReception: Reference Reading Room, Harlan Hatcher Library, University of Michigan

Welcome by: Ernie Boyko, IASSIST President, and Dr. James Hilton, Associate Provost for Academic, Information and Instructional Technology Affairs, University of Michigan

Wednesday, May 24

8:30am-5:00pmRegistration
9:00am-10:00amPlenary 1: Cyberinfrastructure and the Social Sciences (Pendleton Room)

Speaker: Daniel Atkins, School of Information, University of Michigan; Discussant: Bjorn Henrichsen, Norwegian Social Science Data Services; Moderator: Myron Gutmann, ICPSR, University of Michigan

10:00am-10:30amBreak
10:30am-12:30pmConcurrent A
  • A1: Leading Users to Knowledge: Data Librarians to the Rescue (Pendleton Room)

    • Keeping Current in Social Science Data (Without Paddling Upstream)
      Joanne Juhnke, University of Wisconsin-Madison

    • Education on the Fly for the Accidental Library Data Professional: Design Your Professional Publication
      Michele Hayslett, North Carolina State University Libraries

    • Social Science Data Librarianship: A University Curriculum
      Fredric Gey, University of California at Berkeley, and Frank Olken, Lawrence Berkeley National Laboratory

    • Blending Traditional and Data Librarianship
      Jennifer Darragh, Pennsylvania State University; Paula Lackie, Carleton College; and Tiffani Conner, University of Connecticut

  • A2: The Essential Role of Metadata in Resource Discovery (Kuenzel Room)

    • Everything but the Kitchen Sink: Building a Metadata Repository for Time Series Data at the Federal Reserve Board
      San Cannon and Meredith Krug, Federal Reserve Board

    • Research-Based Metadata Requirements for a BLS Reports Archive
      Scott C. Berridge, John J. Bosley, and Daniel W. Gillman, U.S. Bureau of Labor Statistics

    • The Madiera Portal: Unified Access to European Data Resources
      Alette Gilhus Mykkeltvedt, Norwegian Social Science Data Services

    • Enabling Discovery, Integration, and Understanding of Criminal Justice Statistical Information: Developing a Metadata Application Profile
      Carol A. Hert, University of Washington, Tacoma, and Sheila O. Denn, University of North Carolina

  • A3: Innovations in Data Dissemination (Art Lounge)

    • What's New With SDA?
      Tom Piazza, University of California at Berkeley

    • Sociometrics
      Josefina J. Card, Sociometrics Corporation

    • University Information System RUSSIA: Database and Value-Added Service for Investigations of Life Quality and Economic Welfare of Households and Individuals in Russia
      Tatyana Yudina, Moscow State University

    • University Information System RUSSIA: Database and Value-Added Service for Investigations of Life Quality and Economic Welfare of Households and Individuals in Russia
      Anna Bogomolova, Moscow State University

12:30pm-1:30pmLunch
1:30pm-3:30pmConcurrent B
  • B1: Institutional Repositories and Social Science Data: Supporting the Data Life Cycle (Pendleton Room)

  • B2: Managing Metadata: Archival Processing (Kuenzel Room)

    • Efficient Ingest of Datasets in a Two-Stage Archival Process: The First Phase - Easy-Store
      Marion Wittenberg and Rutger Kramer, DANS - Data Archiving and Networked Services, The Hague, The Netherlands

    • Metadata Management: The Forgotten World of the Back Office
      Anne Etheridge, Economic and Social Data Service (ESDS)

    • Smart Qualitative Data: Methods and Community Tools for Data Mark-Up (SQUAD)
      Louise Corti and Libby Bishop, UK Data Archive

    • Building Infrastructure and Alliances to Meet Common Goals: "The Creation of a Canadian Public Opinion Data Index"
      Laine Ruus, University of Toronto, and Lois Timms-Ferrara, Roper Center for Public Opinion Research, University of Connecticut

  • B3: Compare and Contrast: Using Cross-National Data (Art Lounge)

    • International Comparative Data: Advice to Neophytes
      Susan Hook Czarnocki, McGill University

    • Evaluating the Quantity and Quality of Publicly Available Cross-National Crime Data
      Janet P. Stamatel, University at Albany

    • Let's Qualify What is Quantified: The Language of Change - Teachers and Their Expressions of Change in Six Countries
      Nora Arato, University of Michigan School of Nursing

    • Association of Religion Data Archives
      Brian J. Grim, Pennsylvania State University Survey Research Center and the ARDA (Association of Religion Data Archives)

3:30pm-4:00pmBreak
4:00pm-5:30pmConcurrent C
  • C1: Data Issues in the Sciences: An Environmental Scan (Pendleton Room)

    • Data Access and Preservation across the Sciences: New Ideas and Initiatives
      Bob Chen, Center for International Earth Science Information Network (CIESIN)

    • The Science Commons Data Project
      John Wilbanks, Science Commons

    • The Scientific Data Commons and Non-conventional Sources
      Harlan Onsrud, University of Maine

  • C2: Effective Design for Data-Rich Web Sites (Kuenzel Room)

    • Evaluation of Web Sites: What Works and What Doesn't
      Sue Ellen Hansen, Survey Research Operations, University of Michigan, and Matthew Richardson, ICPSR, University of Michigan

    • Building Data-Rich Web Sites: The Integration Projects of the Minnesota Population Center
      Bill Block, University of Minnesota Population Studies Center

    • Best Practices for Designing and Building Highly Interactive and Data-Aware Web Sites
      Mark Gregor, Velir Studios

  • C3: Effective Strategies for Metadata Management (Art Lounge)

    • International Household Survey Network: Microdata Management Toolkit
      Pascal Heus, International Household Survey Network

    • Implementing a National Data Archive in Ethiopia: Challenges and Experience
      Yakob Mudesir Seid, Ethiopia Central Statistical Agency

    • Microdata Information System MISSY
      Andrea Janssen, Jeanette Bohr, and Joachim Wackerow, GESIS/ZUMA (Centre for Survey Research and Methodology)

5:30pm-Dinner on your own
6:00pm-8:00pmRoadmap for DDI Tools Development: "Birds of a Feather" Meeting (G150 Perry)

Thursday, May 25

8:30am-5:00pmRegistration
9:00am-10:00amPlenary 2: Disclosure Risk Limitation in Social Science Data: A Plenary in Honor of Pat Doyle (Pendleton Room)

Speaker: Julia Lane, National Opinion Research Center; Discussants: Robert Groves, University of Michigan, and Cynthia Cook, Statistics Canada; Moderator: Judith Rowe, Princeton University (retired)

10:00am-10:30amBreak
10:30am-12:00pmConcurrent D
  • D1: Data Life Cycle Management and the Digital Repository: FEDORA-Based Initiatives (Pendleton Room)

    • A FEDORA-Based Institutional Repository to Support Multidisciplinary Collections
      Ron Jantz, Rutgers University Libraries

    • Exploring FEDORA's Possibilities to Create a Research Space for the Sciences
      Donna J. Tolson, University of Virginia

    • Migrating Numeric Data Collections into FEDORA
      Gretchen Gano, Yale University

  • D2: Metadata Models: Mining and Retrieval (Kuenzel Room)

    • Mine Your Data: Contrasting Data Mining Approaches to Numeric and Textual Data Sources
      Louise Corti, UK Data Archive, and Karsten Boye Rasmussen, University of Southern Denmark

    • Metadata by Design and Fielded Metadata: The Poles of a Space in Which Data Processing Takes Place
      Reto Hadorn, Swiss Information and Data Archive Service for the Social Sciences

    • The Nature of Data
      Frank Farance, Farance Inc., and Daniel W. Gillman, U.S. Bureau of Labor Statistics

  • D3: Enabling Access to Data: Promising Approaches (Anderson Room ABC)

    • The Special Licence Model for Access to More Detailed Microdata
      Karen Dennison, UK Data Archive

    • UK 2001 Census Microdata: Providing Access to Data Subject to Confidentiality Constraints
      Jo Wathan, University of Manchester

    • Open Access Movement and Data
      Jeffrey S. Bullington, University of Kansas Libraries

12:00pm-1:30pmIASSIST Business Lunch
1:30pm-3:00pmConcurrent E
  • E1: DDI for the Next Decade: Toward Version 3.0 (Part 1) (Pendleton Room)

    • Locating the Geographic Center of DDI 3.0
      Wendy Thomas, University of Minnesota

    • Problems of Comparability in the German Microcensus Over Time and the New DDI Version 3.0
      Jeanette Bohr, Andrea Janssen, and Joachim Wackerow, GESIS/ZUMA (Centre for Survey Research and Methodology)

    • DDI Version 3 and Instrument Documentation
      Karl Dinkelmann, Institute for Social Research, University of Michigan

  • E2: Archival Partners: Handling "Born Digital" Materials (Kuenzel Room)

  • E3: Applications for Managing and Distributing Geospatial Data (Anderson Room ABC)

    • An Update from Statistics Canada
      Bernie Gloyn, Statistics Canada

    • Leveraging Resources through Partnerships: A Case Study of a Distributed Web Mapping Service
      Michele Hayslett, North Carolina State University Libraries

    • Got Data? Google Map It!
      Paul Bern, Syracuse University

3:00pm-5:00pmBreak & Poster Session (Michigan Union Ballroom)
6:15pm-10:00pmBanquet, Weller's*
10:00pm-Turkey Action Group

Friday, May 26

8:30am-5:00pmRegistration
9:00am-10:00amPlenary 3: Toward a System of Trusted Digital Repositories (Pendleton Room)

Speaker: Robin Dale, Research Libraries Group; Discussant: David McMillen, National Archives and Records Administration; Moderator: John P. Wilkin, University of Michigan Library

10:00am-10:30amBreak
10:30am-12:30pmConcurrent F
  • F1: We All Count: Quantitative Literacy Efforts and Approaches (Pendleton Room)

    • Developing a Framework for Quantitative Literacy: Counting on IASSIST
      Wendy Watkins, Carleton University

    • Creating a Repository of Training Materials: The Canadian Experience
      Jane Fry, Carleton University

    • Statistical Literacy Survey Results
      Milo Schield, W.M. Keck Statistical Literacy Project, Augsburg College

    • European Social Survey Education Net: Research-Like Learning in the Social Sciences
      Atle Jastad, Norwegian Social Science Data Services

  • F2: Catch and Release: Best Practice Across the Data Life Cycle (Kuenzel Room)

    • Producing Archive-Ready Datasets: Compliance, Incentives, and Motivation
      Margaret Hedstrom, University of Michigan

    • Two Documents, Three Legs, and Five Stages: Developing an Organizational Response to Digital Preservation Requirements
      Nancy McGovern, Cornell University

    • The LEADS Database at ICPSR: Identifying Important Social Science Studies for Archiving
      Amy Pienta, ICPSR, University of Michigan

    • What Goes Around, Comes Around: We Must All be Data Curators Now
      Peter Burnhill, University of Edinburgh

  • F3: Moving Beyond Data to Networked Knowledge (Anderson Room ABC)

    • Alternative Ways of Presenting Historical Census Data
      Luuk Schreven and Anouk de Rijk, Netherlands Institute for Scientific Information Services

    • Database Developments to Establish Internet Content Services
      Zoltan Lux, 1956 Institute, Budapest

    • Delivering Government Data to Lawyers and Journalists
      Susan Long and Linda Roberge, Syracuse University

    • Disseminating Survey Information in the Networked World: A UK Resource
      Julie Lamb, University of Surrey

  • F4: The Big Picture: GIS Data Challenges and Solutions (Art Lounge)

    • State and Local Government Challenges for Geospatial Data Management and Distribution
      Robert R. Downs and Robert S. Chen, Center for International Earth Science Information Network (CIESIN)

    • Consideration for Security Issues of Geospatial Information Services in Local Governments
      Makoto Hanashima, Institute for Areal Studies, Foundation / Institute of Information Security

    • Organizing Data With Temporal and Spatial References
      Michal Paneth-Peleg, Israel Social Sciences Data Center, The Hebrew University

    • Integration of GIS With 2000 China Population Census Data
      Shuming Bao, China Data Center, University of Michigan

12:30pm-1:45pmLunch

What the Hell Were We Thinking?!?! UM's Deal with Google
John P. Wilkin, University of Michigan Library

1:45pm-3:45pmConcurrent G
  • G1: DDI for the Next Decade: Toward Version 3.0 (Part 2) (Pendleton Room)

    • DDI 3
      Arofan Gregory, Aeon LLC

    • DDI 3
      Jostein Ryssevik, presented by Chris Nelson, Open Data Foundation

    • Three Out of Two People Want to Know: The Issues Behind Conversion to DDI 3
      Ken Miller, UK Data Archive

  • G2: New Standards in Statistics and Data Citations (Kuenzel Room)

    • Basic Forms of Citation for Statistics and Data: Towards an Accepted Standard
      Gaetan Drolet, Statistics Canada

    • A Proposed Standard for the Scholarly Citation of Quantitative Data
      Micah Altman and Gary King, Harvard University

    • Tracking and Managing Citations: Data Centers and Best Practices
      W. Christopher Lenhardt, Center for International Earth Science Information Network (CIESIN)

    • Challenges and Opportunities in the Implementation of Citation Standards
      Jeri Schneider, ICPSR

  • G3: Supporting Data Users in a Networked World (Anderson Room ABC)

    • From Primitive Numbers to Knowledge: How Technology Has Enhanced the Dissemination of Social Science Data
      Chiu-chuang (Lu) Chou, University of Wisconsin, Madison

    • Networking in the University Environment: Building Bridges From the Bottom Up
      Jennifer Darragh, The Pennsylvania State University Population Research Institute, and Stephen Woods, The Pennsylvania State University Libraries

    • Developing a Social Science and GIS Data Service in a Predominantly Undergraduate Library: Past, Present, and Future
      Suzette Giles, Ryerson University

    • Data Services Awareness and Use Survey: What We Learned About Promoting Data Services
      Eleanor Read, University of Tennessee

3:45pm-4:00pmBreak
4:00pm-4:45pmConference Wrap-up
5:30pm-Detroit Tigers' Baseball Game*

Saturday, May 27

9:00am-5:00pmDDI Alliance Meeting
8:00am-3:00pmHenry Ford Excursion and Ford Rouge Factory Tour*
10:00am-4:00pmBus Tour of Detroit*

* As the event is off-campus, a bus will shuttle you to the event. The starting time for the event is the time when the bus loads/departs. If you are not there 10-15 minutes prior to that time, you may miss your ride.

Abstracts

Workshops

W1: A Gentle Introduction to DDI: What's in it for Me? (G150B Perry Bldg.)

Session Chair: Jim Jacobs, University of California, San Diego, and Wendy Thomas, University of Minnesota

This workshop is Part 1 of a two-part workshop, with the second part offered in the afternoon as Workshop 5 (Hands-On DDI 3.0 - Concept, Structure, and Tools).

DDI Version 3.0 is currently under review by the DDI Expert Committee and expected to move into public review following the DDI meeting during IASSIST. This long-anticipated move toward a modular approach based on the data life cycle brings increased coverage of comparative data, an instrumentation/questionnaire module, and data management provisions. The new version also raises questions as to what it means for current users of DDI 1 or 2 and what it means for data archivists and programmers. This two-part workshop will cover the broad questions of version differences, new 3.0 features, and the future of the DDI and data documentation (Part 1, Workshop 1, classroom format), and then address the practical aspects of migration to 3.0, metadata creation, and available tools (Part 2, Workshop 5, lab format). Attendees can register for the full-day workshop or either half-day session, depending on their needs and interests.

Topics to be covered:

  • Conceptual differences between DDI and traditional documentation (including codebooks and DDI 1 and 2)
  • Utility, functionality, and uses of DDI (parse-ability, re-usability, flexibility, use over the life cycle of data, human-usable documentation, software-usable documentation, metadata as data, etc.), with many examples
  • Key features of DDI 3
  • DDI Lite
  • The future of DDI and data documentation

Intended Audience: Anyone interested in DDI; no prior knowledge of DDI or XML is required. The morning workshop will present basic concepts of DDI.

W2: Introduction to GIS (4059 Shapiro Library)

Session Chair: Karl Longstreth, University of Michigan

This hands-on workshop will involve familiarizing participants with GIS software and working with an actual dataset in one of the packages. More detailed information about the workshop will be forthcoming.

Intended Audience: Individuals with little or no previous experience using GIS software.

W3: Introduction to Data Librarianship (4041 Shapiro Library; DIAD)

Session Chair: Paul Bern, Syracuse University

This workshop will serve as an introduction to the processes and challenges of being a Data Librarian. Using real questions from real users, this hands-on workshop will go through the process of:

  1. Acquisition -- finding and obtaining data
  2. Archiving -- preserving and cataloging data for daily and long-term use
  3. Access -- providing the means for users to get the data
  4. Assistance -- helping users make sense of data and metadata in order to get the file or files in a format they can use

Participants are encouraged to bring their own experiences to share and explore with one another. No prior experience will be necessary to attend.

Intended Audience: Individuals providing data services.

W4: Building an SDA Archive (G150A Perry Bldg.)

Session Chair: Tom Piazza and Charlie Thomas, University of California, Berkeley

This workshop will provide instruction to participants interested in building an SDA archive for online analysis. Many IASSIST members use SDA but find it difficult to set up an SDA archive and then to add new datasets to the archive.

The workshop will cover the various steps that one needs to negotiate in implementing an SDA archive. It will also show archivists how to use some newly developed procedures that facilitate the addition of datasets to an SDA archive.

Participants are encouraged to bring a data file (ASCII, fixed columns) and a matching DDI file to the workshop (on a diskette or a CD). They will be able to install that dataset on an SDA site for online analysis. Various other test files will be provided.

The Web site with materials for the workshop can be found at: http://sda.berkeley.edu/workshop/iassist06

Intended Audience: Individuals interested in creating an SDA archive.

W5: Hands-On DDI 3.0 - Concept, Structure, and Tools (G150A Perry Bldg.)

Session Chair: Jim Jacobs, University of California, San Diego, and Wendy Thomas, University of Minnesota

This workshop is Part 2 of a two-part workshop, with the first part offered in the morning as Workshop 1 (A Gentle Introduction to DDI - What's in it for Me?).

DDI Version 3.0 is currently under review by the DDI Expert Committee and expected to move into public review following the DDI meeting during IASSIST. This long-anticipated move toward a modular approach based on the data life cycle brings increased coverage of comparative data, an instrumentation/questionnaire module, and data management provisions. The new version also raises questions as to what it means for current users of DDI 1 or 2 and what it means for data archivists and programmers. This two-part workshop will cover the broad questions of version differences, new 3.0 features, and the future of the DDI and data documentation (Part 1, Workshop 1, classroom format), and then address the practical aspects of migration to 3.0, metadata creation, and available tools (Part 2, Workshop 5, lab format). Attendees can register for the full-day workshop or either half-day session, depending on their needs and interests.

This will be a hands-on workshop in a computer lab. Attendees are encouraged to bring their own DDI files and documentation for lab work, but samples will be provided.

Topics to be covered:

  • Quick overview of conceptual changes from DDI 1/2 to 3
  • Specifics of DDI 3: reusable classes, time, geography, comparative, instrument
  • Logical vs. physical structure of data
  • Migrating from DDI 2 to 3
  • Existing and planned tools and their functionality
  • Examples of use

Intended Audience: Attendees of the morning session and those already familiar with DDI 1 or 2. The afternoon workshop will present more specific information about technical details of DDI 3.

W6: Statistical Literacy and Learning Objects (4041 Shapiro Library; DIAD)

Session Chair: Milo Schield and Cynthia Schield, W.M. Keck Statistical Literacy Project

The 2002 Statistical Literacy Survey found that students, data analysts, and college instructors need help in forming ordinary English descriptions and comparisons of the rates and percentages presented in tables and graphs. The W.M. Keck Statistical Literacy Project developed a Web-based drill program that decodes students' descriptions and comparisons and gives users feedback on their errors. Students for whom English is not their native language may find this program very helpful. This statistical literacy learning object may be useful to students in the social sciences who need to be able to communicate statistical summaries involving rates and percentages. The goal of this workshop is to introduce users to the online program as a learning object. Those who complete this workshop should have the material they need to duplicate this workshop at their home institution.

Intended Audience: Individuals interested in statistical literacy.

W7: Using ATLAS.ti to Explore Archived Qualitative Data (4059 Shapiro Library)

Session Chair: Libby Bishop and Louise Corti, UK Data Archive, University of Essex

This workshop will present an overview of the uses and range of computer-assisted qualitative data analysis software (CAQDAS) packages. Through hands-on sessions and exercises focusing on the software ATLAS.ti, participants will be introduced to the particular applications and key functions of the software. Archived qualitative data from ESDS Qualidata will be used as the data sources. The session is intended to be practical and intensive and aims to get participants started with the software by familiarizing them with the following:

  • Initial usage tools
  • Data preparation considerations
  • Importing data into software
  • "Coding" of data (attaching thematic labels to segments of data)
  • Search and retrieval of coded data
  • Use of annotation and memoing tools
  • Exporting data

Intended Audience: Individuals interested in learning about qualitative data analysis software. The workshop assumes little or no experience with Atlas-ti or other qualitative software packages.

Plenaries

Plenary 1: Cyberinfrastructure and the Social Sciences (Pendleton Room)

Speaker: Daniel Atkins, School of Information, University of Michigan; Discussant: Bjorn Henrichsen, Norwegian Social Science Data Services; Moderator: Myron Gutmann, ICPSR, University of Michigan

Daniel E. Atkins, a professor in the School of Information and in the Department of Electrical and Computer Engineering at the University of Michigan, is the newly appointed Director of the Office of Cyberinfrastructure for the National Science Foundation (NSF). Dr. Atkins served as Chair of NSF's Blue-Ribbon Advisory Panel on Cyberinfrastructure.

Myron P. Gutmann is Professor of History and Director of the Inter-university Consortium for Political and Social Research at the University of Michigan. As Director of ICPSR, he is a leader in the archiving and dissemination of electronic research materials related to society, population, and health.

Bjorn Henrichsen, Director of the Norwegian Social Science Data Service, recently served as Chair of a Working Group contributing to the European Commission's Report "Towards New Research Infrastructures for Europe: The ESFRI List of Opportunities." He is active in the Council of European Social Science Data Archives (CESSDA).

Plenary 2: Disclosure Risk Limitation in Social Science Data: A Plenary in Honor of Pat Doyle (Pendleton Room)

Speaker: Julia Lane, National Opinion Research Center; Discussants: Robert Groves, University of Michigan, and Cynthia Cook, Statistics Canada; Moderator: Judith Rowe, Princeton University (retired)

This IASSIST plenary is undertaken to honor the life and contributions of Pat Doyle, a former Survey Improvement Coordinator for the U.S. Census Bureau's Demographic Surveys Division and active IASSIST and DDI member and contributor before her death in 2004. A staunch advocate for data sharing and access, Pat wrote extensively on confidentiality and disclosure issues in microdata. We remember not only Pat's many professional contributions but also her vibrant personality and spirit.

The main presentation in this plenary will focus on "Optimizing the Use of Microdata: An Overview of the Issues." Julia Lane will discuss the two substantial challenges that face collectors and producers of economic data: The first is how can the information derived from vast streams of data on human beings be used while protecting confidentiality? The second is the essence of good science: how can society best provide and promote access to rich and sensitive data so that empirical results can be generalized and replicated?

Julia Lane is a Senior Vice President and Director of Economics, Labor and Population Studies at the National Opinion Research Center at the University of Chicago and a Senior Research Fellow at the U.S. Bureau of the Census. She was previously an Economics Program Director at the National Science Foundation. She is the author of Confidentiality, Disclosure and Data Access: Theory and Practical Applications for Statistical Agencies (North Holland, 2001), coedited with Pat Doyle, Laura Zayatz and Jules Theeuwes.

Robert Groves is Professor of Sociology and Director of the Survey Research Center in the Institute for Social Research at the University of Michigan. Dr. Groves studies how alternative research designs affect the utility of data collected. He is an investigator on a Survey Research Center-ICPSR project on "Human Subject Protection and Disclosure Risk Analysis" funded by NICHD.

Cynthia Cook is Southwestern Ontario Regional Supervisor of the Statistics Canada Research Data Centre Program. The Research Data Centres (RDC) program is part of an initiative by Statistics Canada, the Social Sciences and Humanities Research Council (SSHRC) and university consortia to help strengthen Canada's social research capacity and to support the policy research community.

Judith Rowe retired after more than 30 years managing data and statistical services at Princeton University. During that period she served as President of both IASSIST and APDU and as chair of COPAFS as well as a member of the Depository Library Council and of numerous ALA committees, including the Subcommittee that wrote the initial rules for the cataloging of computer files, the Census Advisory Committee, and the advisory committee that recommended including computer materials as part of the depository library program.

Plenary 3: Toward a System of Trusted Digital Repositories (Pendleton Room)

Speaker: Robin Dale, Research Libraries Group; Discussant: David McMillen, National Archives and Records Administration; Moderator: John P. Wilkin, University of Michigan Library

Robin Dale, digital preservation expert for the Research Libraries Group, is co-chair of a joint digital repository certification task force between RLG and the National Archives and Records Administration (NARA). The RLG-NARA task force recently published a draft checklist for certifying digital repositories. As project manager of the Center for Research Libraries Mellon-funded Auditing and Certification of Digital Repositories project, she is leveraging the RLG-NARA checklist to conduct test audits of archives as well as formulate and model the processes and activities required to audit and certify digital archives.

David McMillen holds the recently created position of External Affairs Liaison at the US National Archives and Records Administration (NARA) where he manages the planning and execution of a continuous program of liaison and partnering with allied professional, scientific, and technical organizations. He comes to the National Archives from the professional staff of the House of Representatives Committee on Government Reform where he served from 1995 to the present and the corresponding Senate Committee from 1991 to 1995.

John P. Wilkin is the Associate University Librarian for Library Information Technology (LIT) and for Technical and Access Services (TAS) at the University of Michigan Library. He has coordinated all phases of Michigan's large-scale digitization effort with Google. Previously, he served as Head of the Digital Library Production Service from its inception in 1996 and founded the Humanities Text Initiative in 1994, also at the University of Michigan.

Concurrent Sessions

A1: Leading Users to Knowledge: Data Librarians to the Rescue (Pendleton Room)

Session Chair: Stuart Macdonald, University of Edinburgh

Keeping Current in Social Science Data (Without Paddling Upstream)
Joanne Juhnke, University of Wisconsin-Madison

Keeping current with the field of social science data, in a world of networked knowledge, is no trivial undertaking. On the one hand, the proliferation of information can challenge even seasoned professionals, while presenting a daunting array of possibilities to a newcomer in the field. On the other hand, the latest news and information about data can sometimes be quite well-hidden. This presentation will examine sources and strategies for balancing the flow of data-news while keeping abreast of news and developments in the field. A companion Web site will be available at the Data & Program Library Service (DPLS), UW-Madison.

Education on the Fly for the Accidental Library Data Professional: Design Your Professional Publication
Michele Hayslett, North Carolina State University Libraries

Interest has been expressed in a professional publication addressing the needs of data professionals working in libraries. Within the context of libraries, issues common to many types of data professionals take on special significance -- collection management, metadata developments, tools targeting specific academic user groups, preservation, and so on. Is there sufficient interest among data professionals in libraries to support a stand-alone publication or development of regular features in a pre-existing one? What topics are potential readers interested in? What formats are preferred? This session will be a facilitated discussion, beginning with presentation of initial survey results and alternative publication models, to exchange information about the needs of the group and devise a publication with the broadest utility.

Social Science Data Librarianship: A University Curriculum
Fredric Gey, University of California at Berkeley, and Frank Olken, Lawrence Berkeley National Laboratory

We describe a comprehensive curriculum for Social Science Data Librarianship to be incorporated into the graduate programs at major universities to offer a Master's Degree with specialization in social science data librarianship and to define a PhD degree concentrating on the research issues which affect the creation, storage, retrieval, indexing, and use of quantitative social science data. Courses in social science datasets, statistical database management, metadata and data semantics, data library operation, statistical disclosure analysis, and networking are described in detail. A strawman course requirements outline leading to the Master's Degree is also described. Possible institutional homes within the university setting are described, such as Library Schools, Information Systems Schools, Computer and Information Science Departments, Social Science Divisions, and Public Health Schools.

Blending Traditional and Data Librarianship
Jennifer Darragh, Pennsylvania State University; Paula Lackie, Carleton College; and Tiffani Conner, University of Connecticut

During this presentation we will address two areas of data librarianship that extend the traditional LIS educational curriculum, the reference interview and instruction. We will discuss the similarities of both elements of librarianship with special attention on extensions to basic training received in library and information science course work. This presentation is a precursor to our interactive poster session entitled Building Outreach and Dialog-Data Librarianship: The Continuing LIS Education.

A2: The Essential Role of Metadata in Resource Discovery (Kuenzel Room)

Session Chair: Tess Trost, Texas Tech University (retired)

Everything but the Kitchen Sink: Building a Metadata Repository for Time Series Data at the Federal Reserve Board
San Cannon and Meredith Krug, Federal Reserve Board

The research divisions at the Federal Reserve Board use a variety of time series data for both research and forecasting in support of its duty to conduct monetary policy for the United States. The collection, maintenance, and upkeep of more than 50,000 time series from more than 60 sources in a central location are daunting tasks; the documenting of the metadata for the compilation and use of these data are even more so. We are currently building a comprehensive metadata repository that links three kinds of metadata about our time series: structural metadata describing the series themselves; reference metadata describing the collection and construction of the aggregate time series by the issuing agency; and operational metadata documenting our procedures for retrieving, processing, and maintaining the data. Many of the pieces to the puzzle currently exist in a disparate array of formats: attributes in a proprietary database, HTML pages on a Web site, Word documents buried on a file server, etc. We are bringing these pieces of information together in a relational database setting to allow users to search for and see all the relevant metadata for a particular series or economic concept. In addition, we have the challenge of making the entries time-sensitive to accommodate the library of vintage or "real time" data we are building for future research.

Research-Based Metadata Requirements for a BLS Reports Archive
Scott C. Berridge, John J. Bosley, and Daniel W. Gillman, U.S. Bureau of Labor Statistics

The U.S. Bureau of Labor Statistics' (BLS) Office of Publications staff is building an archive of economic reports dating from the late 1800s. The archived material will be available online through the BLS Web site (http://www.bls.gov) as PDF files. Appropriate metadata need to be integrated with the archive material to help users find and identify relevant content. Candidate metadata elements were selected from the DDI. User studies will be performed to verify that the selected metadata elements help users search successfully. Initial studies will elicit descriptions of metadata that users want to see associated with archival material, compare those choices with the candidate DDI elements, and revise the set if appropriate. Then users will test the revised metadata in realistic scripted searches of the archive. The talk will describe the project, the selection process for the metadata elements, and the methods and results of early user studies.

The Madiera Portal: Unified Access to European Data Resources
Alette Gilhus Mykkeltvedt, Norwegian Social Science Data Services

The Madiera portal is a Web-based infrastructure populated with a variety of data and resources from a selection of providers. The portal can be seen as a European virtual library giving unified access to European social science data archives. The building blocks of the portal are a common metadata standard (a cross-national standardised implementation of DDI), a technological platform based on Nesstar software, and a multilingual thesaurus breaking the language barriers. The portal enables you to search for data, browse documentation, analyse datasets online, and download. The Madiera portal is a result of the Madiera project (Multilingual Access to Data Infrastructures of the European Research Area), funded by the European Commission under the Fifth Framework programme. The portal is available at www.madiera.net.

Enabling Discovery, Integration, and Understanding of Criminal Justice Statistical Information: Developing a Metadata Application Profile
Carol A. Hert, University of Washington, Tacoma, and Sheila O. Denn, University of North Carolina

This project, funded by the Bureau of Justice Statistics (BJS), has the goal of developing and testing a metadata schema to support end-user discovery of criminal justice statistical information. Project partners are BJS, the National Archive of Criminal Justice Data, the Federal Criminal Justice Resource Center, Sourcebook of Criminal Justice Statistics, the FBI Uniform Crime Reports, and the U.S. Office of Juvenile Justice and Delinquency Prevention. The schema draws on other metadata standardization efforts including DDI, ISO1179, SDMX, and NIEM. In addition to schema development, we are undertaking user studies to better understand how the schema can best facilitate end-user discovery activities. By the time of the IASSIST meeting, we will have completed schema development and testing, and be engaged in user studies. We will report on the development activities with a focus on explicating the connections to other schemas and associated development issues. In addition, we will present an overview of the user studies and present findings to date.

A3: Innovations in Data Dissemination (Art Lounge)

Session Chair: Dan Tsang, University of California at Irvine

What's New With SDA?
Tom Piazza, University of California at Berkeley

This presentation will describe two new features of SDA:

  1. A new user interface which eliminates the need to enter the names of variables into the option screens for the analysis programs.
  2. Procedures to facilitate the addition of datasets to an SDA archive. (These procedures will be described in more detail in Workshop #4 on "Building an SDA Archive.")

Sociometrics
Josefina J. Card, Sociometrics Corporation

This paper will present the current state (scientific content, formats, platforms, distribution partners) of the Sociometrics Data Archives. It will then peer into the future by describing areas of topical expansion, new target audiences, and new resources to be built around the Sociometrics data archives.

University Information System RUSSIA: Database and Value-Added Service for Investigations of Life Quality and Economic Welfare of Households and Individuals in Russia
Tatyana Yudina, Moscow State University

Described will be a new database under the University Information System RUSSIA (UIS RUSSIA, www.cir.ru) project. There is no practice of regular household surveys at the government level in Russia. The first household-based survey, National Survey of Households Well-being and Participation in Social Programs, took place in 2003 and covered 45,000 households in 46 Russian regions. It includes 227 variables aggregated to 13 parts. These survey results are the initial data holdings downloaded into the database. Our database provides for almost the full range of services under the Harvard-MIT Data Center's Virtual Data Center and SDA Archive. In the second stage of our project other resources developed in Russia and in international research centers will be included. The most famous project is the Russian Longitudinal Monitoring Survey, which covers 13 nationally representative surveys beginning in 1992, conducted by the Carolina Population Center at the University of North Carolina at Chapel Hill in collaboration with the Russian Federal Statistics Agency and several Russian institutes. As a next stage of our project knowledge products will be integrated. An ontology to provide for content-based indexing and search is under construction.

University Information System RUSSIA: Database and Value-Added Service for Investigations of Life Quality and Economic Welfare of Households and Individuals in Russia
Anna Bogomolova, Moscow State University

University Information System RUSSIA (UIS RUSSIA, www.cir.ru) has been designed and is maintained as a digital thematic library for research and education in economics and the social sciences, in operation since January 2000. The most requested module is RF state statistics. Value-added services for economic and social statistics is a main direction of the UIS RUSSIA development. In 2005 we began to create a database for investigations of life quality and economic welfare of households and individuals. The primary survey that is downloaded to the database is the National Survey of Households Well-being and Participation in Social Programs. The survey took place in 2003 and covered 45,000 households in 46 Russian regions. It includes 227 variables aggregated to 13 parts. While working on the database we investigated the experience of Harvard-MIT Data Center's Virtual Data Center and SDA Archive.

B1: Institutional Repositories and Social Science Data: Supporting the Data Life Cycle (Pendleton Room)

Session Chair: Ann Green, Digital Life Cycle Research and Consulting

This panel discussion will explore how various models of digital repositories provide the processes and services required to support the social science data life cycle, and how these repositories and archives might fit into a common landscape. Participants will discuss the intersecting missions, roles, and relationships among institution-based digital repositories, topical archives, and discipline, or domain, specific data archives.

Chuck Humphrey, University of Alberta

Robin Rice, University of Edinburgh, EDINA

Larry McGill, Princeton University

Ron Jantz, Rutgers University Libraries

Jim Ottaviani, University of Michigan Library

B2: Managing Metadata: Archival Processing (Kuenzel Room)

Session Chair: Wendy Thomas, University of Minnesota

Efficient Ingest of Datasets in a Two-Stage Archival Process: The First Phase - Easy-Store
Marion Wittenberg and Rutger Kramer, DANS - Data Archiving and Networked Services, The Hague, The Netherlands

DANS - Data Archiving and Networked Services - is the organization responsible for storing and providing permanent access to research data from the humanities and social sciences in the Netherlands. As such, it is expected that DANS will have to ingest and manage a very large number of datasets. The traditional process of data archiving, i.e., having archivists enter extensive metadata for each incoming dataset, is likely to put a strain on personnel entering the metadata. DANS is setting up a two-stage archival process which will be able to cope with large amounts of submissions. In the first phase, nicknamed Easy-Store, datasets will be archived in a simple yet robust archival system. The second stage, called Deep-Store, will go into the details of a dataset and will only be executed for particular datasets. The paper will focus on the specifics of the Easy-Store system. We will give an overview of the concepts, the requirements, the architecture, temporary results, possibilities and future work on the system, and we'll have a first evaluation of the place the system takes in the two-stage archival process.

Metadata Management: The Forgotten World of the Back Office
Anne Etheridge, Economic and Social Data Service (ESDS)

An overlooked section of the digital data life cycle is that of metadata entry. Too often, records are populated by copying and pasting from one form to another; and frequently updating Web pages are given to non-Web editors to revise. Rational and efficient management and development of the 'back office', which feeds the information end users see, is often not a high organisational priority. This paper highlights the strategy ESDS has used to make cataloguing and Web page updates straightforward. There is an almost seamless transition from data deposit forms to catalogue records, lessening the possibility of errors. Databases are used for updating of events, online booking, news, and staff pages with information input through a simple interface appearing on Web pages instantly. All of these make repetitive tasks obsolete and leave time for staff to concentrate on more interesting matters.

Smart Qualitative Data: Methods and Community Tools for Data Mark-Up (SQUAD)
Louise Corti and Libby Bishop, UK Data Archive

This paper will provide an overview of the SQUAD project. SQUAD is a demonstrator project funded under the ESRC Qualitative Data Archiving and Dissemination Scheme (QUADS). The project is exploring methodological and technical solutions for exposing digital qualitative data to make them fully shareable and exploitable. First, the project deals with specifying and testing non-proprietary means of storing and marking up data using universal (XML) standards and technologies, and proposes an XML community standard (schema) that will be applicable to most qualitative data. The second strand investigates optimal requirements for contextualising research data (e.g., interview setting or interviewer characteristics), aiming to develop standards for data documentation and ways of capturing this information. The third strand aims to use natural language processing technology to develop and implement user-friendly tools for semi-automating processes to prepare qualitative data (format and mark-up using TEI) for both digital archiving and linking with other kinds of Web-enabled data and information sources. We will demonstrate some early versions of graphic user interfaces to natural language processing tools, including the data anonymising tool.

Building Infrastructure and Alliances to Meet Common Goals: "The Creation of a Canadian Public Opinion Data Index"
Laine Ruus, University of Toronto, and Lois Timms-Ferrara, Roper Center for Public Opinion Research, University of Connecticut

The University of Connecticut's Canadian Studies, the Roper Center, and the University of Toronto's Robarts Library were awarded a small grant to develop a set of finding aids to unlock the Canadian opinion archives for the purpose of strengthening Canadian Studies Programs in both Canada and the United States. The Canadian Embassy funded the pilot project, the result of which is Canadian iPOLL (CPOLL). This paper discusses the decision-making and collaboration in creating this resource. The design of CPOLL is built upon experiences garnered from two other Roper Center databases: iPOLL (U.S. opinion data) and JPOLL (Japanese opinion data). Collaboration with Canadian data experts assured understanding of cultural nuances, political processes, and the nature of data collection in Canada. The paper addresses efforts to assure consistency in the development of metadata, including coding of topics of coverage, and the decisions involving the selection process, from time period for inclusion to assuring a broad set of sources. Finally, this paper explores the lessons learned from this endeavor and implications for further facilitating cross-national opinion research and creating multiple country databases in the future.

B3: Compare and Contrast: Using Cross-National Data (Art Lounge)

Session Chair: Jane Weintrop, Columbia University

International Comparative Data: Advice to Neophytes
Susan Hook Czarnocki, McGill University

This presentation is aimed at those who are starting up the learning curve on all the international socioeconomic data sources out there. Comparisons of coverage, ease of use, advantages and disadvantages will be presented for services such as WDI, IFS, EIU WorldDATA, UN Data bases, etc. A secondary focus will evaluate what else is worth exploring besides the big, well-known data providers just mentioned.

Evaluating the Quantity and Quality of Publicly Available Cross-National Crime Data
Janet P. Stamatel, University at Albany

Recent world events have raised questions about the nature and distribution of violence and other types of crime across different countries and regions of the world. What data are publicly available to researchers and policymakers in order to better understand cross-national differences in crime rates? This paper describes the sources of cross-national crime and justice data and discusses several important issues regarding data access and utility, such as the format of the available data, geographic coverage, temporal coverage, and comparability of indicators. It examines how data availability and accessibility have changed with improvements in information technology and globalization and it assesses the impact of these changes on cross-national crime research. It ends with suggestions for improving access and usage of these data and gives examples of important research and policy questions that could be answered with better cross-national crime and justice data.

Let's Qualify What is Quantified: The Language of Change - Teachers and Their Expressions of Change in Six Countries
Nora Arato, University of Michigan School of Nursing

The research results explore teachers' language(s) about change that has impacted their work lives in six countries (Australia, Hungary, Israel, Netherlands, South Africa, USA). This study succeeds and is built on a large-scale cross-cultural and comparative study of teachers' perceptions of educational change (New Realities of Secondary Teachers' Work Lives, eds. Poppleton, P. & Williamson, J, 2004, Oxford, UK: Symposium Books). We further analyze and compare the responses teachers gave to a semi-structured interview including an open-ended questionnaire about political, economic, administrative, and curricular changes in their work-lives. Guidelines developed by cognitive linguists are utilized in order to compare (1) how teachers describe educational change in six of the nine countries, (2) what teachers' language teaches us about the meaning of educational change in different countries, (3) what similarities and differences are used for concrete concepts describing teachers' work-lives, and (4) how the qualitative data complement and underscore the quantitative data.

Association of Religion Data Archives
Brian J. Grim, Pennsylvania State University Survey Research Center and the ARDA (Association of Religion Data Archives)

Religion's prominence in national and international affairs makes the availability of empirical measures on religion a pressing concern for researchers, policymakers, and data archivists. Unfortunately, good international religious data are scarce. This paper describes the expanded mission of the Association of Religion Data Archives ( www.TheARDA.com) to archive and develop data on religion worldwide. The ARDA archives data on 238 different countries and territories including ARDA-coded measures from the US State Department's annual International Religious Freedom Reports. The data also include social scientific surveys such as the International Social Survey Programme (ISSP). Country-specific data will also be archived, e.g., ABC poll data from Afghanistan. Finally, this paper describes the way the ARDA "democratizes" accesses to these freely downloadable data by making them available with online analysis options. (The ARDA was formerly the American Religion Data Archive and continues to support an extensive American collection with numerous mapping and report features.)

C1: Data Issues in the Sciences: An Environmental Scan (Pendleton Room)

Session Chair: Gretchen Gano, Yale University

This session offers an "environmental scan" of data issues and initiatives in the science, technology, and engineering communities. It will provide a high-level view of the data management landscape, covering issues related to data access, documentation, preservation, and use for research, teaching, and policymaking. Representatives from the Committee on Data for Science and Technology (CODATA), the Science Commons, and other key S&T data organizations will describe current initiatives of their organizations as these relate to core themes in data management. This overview of issues in the wider data community will provide a platform for discussion and open avenues for collaboration between the communities on issues of common interest. Topics will include a review of the intellectual property and policy regime relating to scientific and technical data activities, changes in the requirements for data archiving by government agencies such as NIH and NSF, governmental policies related to data and public information, legal protection of databases, open access initiatives, e-science models, and standards.

Data Access and Preservation across the Sciences: New Ideas and Initiatives
Bob Chen, Center for International Earth Science Information Network (CIESIN)

Recent editorials in Science and Nature (Iwata and Chen, 2005; Nature, 2005) have called for expanded efforts to make scientific data and information more accessible, especially across the so-called "digital divide". Open access can not only benefit scientific research, but also facilitate the application of scientific results to pressing problems of environment and development and support the evolution of an equitable and open information society. CODATA, the Committee on Data for Science and Technology of the International Council for Science, launched a Global Information Commons for Science initiative at the November 2005 World Summit on the Information Society. The objective of the initiative is to coordinate and promote a range of national and international open access efforts, and in particular to provide leadership on key international data policy issues. This paper will highlight a range of open access activities and also address other pressing science data management issues facing the scientific community such as long-term preservation.

The Science Commons Data Project
John Wilbanks, Science Commons

Science Commons, a project of the non-profit corporation, Creative Commons, has recently launched an initiative to explore ways to assure broad access to scientific data. There is a distinct set of problems emerging around the issues posed by scientific data online. First, current expansions in intellectual property law could generate an entirely new set of obstacles to sharing data among scientists or with the public. Second, the congruence of Web-enabled database access with the widespread availability of rapid, low-cost gene sequencing and abstract, engineerable biological parts has had an unforeseen effect: there is growing uncertainty of how to store, distribute, license, and provide functional information to specify genetic function under the law. Third, there is a wasteful data economy evolving in which raw data are not made accessible; scientists are either leery of the risks of losing control over their data or subject to institutional requirements that mandate a closed approach.

The Scientific Data Commons and Non-conventional Sources
Harlan Onsrud, University of Maine

Most scientific data efforts focus on collection and maintenance of data "by scientists for scientists." Yet across the globe individuals and organizations are gathering detailed local-level data that could be of immense value to social, physical, and biological scientists that is for all practical purposes hidden from their view. These locally collected detailed data are typically unobservable through remote sensors and are being accumulated through on-the-ground direct observations or interpretation. This presentation focuses on incentives for sharing and on technological and legal mechanisms to support incentives for sharing. It outlines a conceptual model and the accompanying research challenges for providing easy legal and technological mechanisms by which any creator might affirmatively and permanently mark and make accessible a location-referenced dataset such that the world knows where the dataset came from and that the data are available for use without the law assuming that the user must first acquire permission.

C2: Effective Design for Data-Rich Web Sites (Kuenzel Room)

Session Chair: Mary Vardigan, ICPSR, University of Michigan

Evaluation of Web Sites: What Works and What Doesn't
Sue Ellen Hansen, Survey Research Operations, University of Michigan, and Matthew Richardson, ICPSR, University of Michigan

This presentation will focus on an assessment of Web sites that disseminate social science data, noting common organizational schemes and characteristics that most user-friendly sites share. The presentation will also delve into specific usability and accessibility issues that arise when developing interfaces for the purpose of data dissemination.

Building Data-Rich Web Sites: The Integration Projects of the Minnesota Population Center
Bill Block, University of Minnesota Population Studies Center

The Minnesota Population Center (MPC) is a leading developer and disseminator of demographic data over the Internet. This presentation will showcase two flagship MPC projects, IPUMS-USA and IPUMS-International, that together generate thousands of data extracts per month for researchers around the world. While successful, these and other MPC Web sites are constantly being asked to present ever-growing amounts of increasingly complex data, yet maintain the simplicity and ease-of-use for which MPC sites are known. The second part of this presentation will describe the challenges created by our ever-growing mountain of data, as well as ways in which we are working to offer large amounts of complex data easily over the Web.

Best Practices for Designing and Building Highly Interactive and Data-Aware Web Sites
Mark Gregor, Velir Studios

This presentation will provide demonstrations of four Web sites that present complex data in visually compelling ways. Presentation methods for creating on-the-fly line graphs, bar charts, tables, and GIS-based maps will be discussed. This presentation will also address ways to provide a high level of user control over data without sacrificing usability and simplicity.

C3: Effective Strategies for Metadata Management (Art Lounge)

Session Chair: San Cannon, Federal Reserve Board

International Household Survey Network: Microdata Management Toolkit
Pascal Heus, International Household Survey Network

The International Household Survey Network (http://www.surveynetwork.org), with the support of the World Bank, has completed the development of the Microdata Management Toolkit, a set of DDI and Dublin Core based tools to facilitate the archiving and dissemination of survey data and metadata. The use of the Toolkit by developing countries and international organizations will greatly support the global adoption of metadata standards and facilitate the creation of national digital survey repositories. More than 50 countries are targeted for training and deployment in 2006. This presentation reports on the status and progress of the project.

Implementing a National Data Archive in Ethiopia: Challenges and Experience
Yakob Mudesir Seid, Ethiopia Central Statistical Agency

The priority for the Central Statistical Agency (CSA) of Ethiopia is to aggressively improve its data collection, management, and dissemination framework through an effective use of Information Communication Technology (ICT). In July 2004, CSA created a new Information Communication Technology Development Department (ICT Department) to support and make such vision a reality. The action plan aims at the improvement of the ICT capacity to support the development of a Central Databank, the establishment of a socioeconomic database, and the implementation of a user-friendly dissemination system. To ensure compliance with international practices, we have adopted the World Bank Microdata Management Toolkit as a standard tool and therefore use the Data Documentation Initiative (DDI) specification as the basis for the compilation of the metadata and micro-level data. This presentation outlines the status and progress of the project and shares our experience in meeting the challenges of implementing a national data archive in Ethiopia.

Microdata Information System MISSY
Andrea Janssen, Jeanette Bohr, and Joachim Wackerow, GESIS/ZUMA (Centre for Survey Research and Methodology)

The MISSY provides online information for the German Microcensus in a structured design. The Microcensus is a multipurpose annual sample which covers 1 percent of the German households. It is produced by the Federal Statistical Office. Though the Microcensus was originally not designed for research, it is accessible as scientific use files. Because it is of great value for the scientific community, there is a need for knowledge transfer from the federal office to the scientific community. MISSY offers the metadata both in a broad and differentiated way. MISSY takes different aspects of data documentation into account: The central part of the sample and data description are based on DDI (Data Documentation Initiative). Furthermore, additional information concerning methodical and scientific subjects are integrated to improve the usability of the Microcensus. Another aspect is related to the organization of information: related metadata in MISSY is linked in multiple ways, guided by different views on the subject. In addition, different possibilities for access facilitate the search of information and consider different needs and skills of scientists. The presentation will introduce the structure of MISSY.

Roadmap for DDI Tools Development: "Birds of a Feather" Meeting

Moderator: Ron Nakao, Stanford University
Sponsored by: IASSIST Data Documentation and Citation Interest Group and the DDI Alliance

The goal of this session is to create a vision for a collection of open source and other programs for the DDI community. Ideally, the tools would provide reusable components from which other tools could benefit. This session will be of interest to those who:

  • Have a need for DDI tools
  • Have tools or ideas for tools that could be helpful to DDI users
  • Are interested in making a contribution

Pizza and soft drinks will be provided.

D1: Data Life Cycle Management and the Digital Repository: FEDORA-Based Initiatives (Pendleton Room)

Session Chair: Robin Rice, EDINA and Edinburgh University Data Library

This session discusses data life cycle management in the context of a digital repository, focusing on the use of the open source repository software, FEDORA. Representatives from several FEDORA-based projects will discuss ingest, expression, and indexing of metadata (including DDI), Web-service development, and preservation features as these topics pertain to numeric data collections. FEDORA, short for Flexible Extensible Digital Object Repository Architecture, is an open-source software that features a flexible service-oriented architecture for managing and delivering digital content. At its core is a powerful digital object model that supports multiple views of each digital object and the relationships among digital objects. Digital objects can encapsulate locally managed content or make reference to remote content. Dynamic views are possible by associating Web services with objects.

A FEDORA-Based Institutional Repository to Support Multidisciplinary Collections
Ron Jantz, Rutgers University Libraries

Institutional repositories must support both multidisciplinary collections and the preservation of those collections that are intended to be persistent. These goals are challenging from many perspectives including specifically the technological infrastructure and the emerging concept of becoming a "trusted" repository. The FEDORA framework provides a flexible and extensible environment for meeting the challenge of institutional repositories. This presentation will discuss the approach that Rutgers University Libraries has used to develop a FEDORA-based institutional repository with specific emphasis on the information architecture and services to support collections and digital preservation. Examples from data and cultural heritage collections will be used to illustrate the relevant concepts.

Exploring FEDORA's Possibilities to Create a Research Space for the Sciences
Donna J. Tolson, University of Virginia

After using FEDORA to develop a digital library repository model for text and images, resources especially central to scholarship in the Humanities, the University of Virginia has begun to explore the digital resource needs of the Sciences. Preliminary work has focused on the challenges of building an integrated information architecture that consolidates workspace, content, and tools vital to scientific research and specifically quantitative data. Using FEDORA architecture, a proof-of-concept project involving demographic, climate, and traffic data was developed to determine the challenges of ingesting datasets with very different characteristics, allow variable-level extraction, and provide standardized access to descriptive metadata at the variable level. Examples from the project will be included.

Migrating Numeric Data Collections into FEDORA
Gretchen Gano, Yale University

This presentation will outline the workflow associated with migrating social science data collections into FEDORA, focusing on the ingest process and the creation of preservation metadata appropriate for numeric data. It will enumerate components the make up a submission package: accounting for multiple types of descriptive metadata including DDI, as well as technical/preservation metadata appropriate for social science datasets. Issues of normalizing data files in proprietary formats for the purposes of long-term preservation will also be explored. Examples from the ongoing project to migrate the Yale Social Science Data Archive from an SQL database into FEDORA will be provided.

D2: Metadata Models: Mining and Retrieval (Kuenzel Room)

Session Chair: Melanie Wright, UK Data Archive and UK Economic and Social Data Service

Mine Your Data: Contrasting Data Mining Approaches to Numeric and Textual Data Sources
Louise Corti, UK Data Archive, and Karsten Boye Rasmussen, University of Southern Denmark

Data mining can be defined as exploration and analysis of large quantities of data in order to discover meaningful patterns and rules. For numeric data the process of this discovery is either directed or non-directed based upon whether there is a fact that we wish to explain through models of explanatory variables, or there is a search for patterns that can prove useful. Text mining is a variation on the field of data mining. It is the discovery by computer of new, previously unknown information, by automatically extracting and linking information from different textual sources. In both methods, automated processes help put together information to uncover new meanings or suggest new hypotheses to be explored further, typically by more conventional means of research. But what are the pros and cons of these methods and how does traditional social science data fit in? This joint paper will elaborate on the typologies of data and text mining, and provide examples and typical models that are relevant to social science and business data. The applicability for Data and Computational Grid applications (e-science) will also be highlighted.

Metadata by Design and Fielded Metadata: The Poles of a Space in Which Data Processing Takes Place
Reto Hadorn, Swiss Information and Data Archive Service for the Social Sciences

The approach usually taken when conceptualizing metadata is typically one of 'documentation', which supposes a reference object, e.g., data, and something to be told about it. That documentation is static; it describes data in a specific state, usually as ready for publication. The life-cycle idea introduces a new perspective. Sure, one can still reduce metadata to a report about what happened across the life cycle of the data. But there is also an opportunity to model metadata in a way to support work done to obtain data, process, edit and publish them. The following example will be developed. Because we don't use the integrated tools for handling metadata and data all over the life cycle, consistency between the two levels of information may be broken. We need a metadata model, which supports the comparison of metadata drawn from more than one source, e.g., the questionnaire, treated as 'metadata by design', and information extracted from a semi-documented data file, an SPSS data file for example, which stands as the 'fielded metadata'.

The Nature of Data
Frank Farance, Farance Inc., and Daniel W. Gillman, U.S. Bureau of Labor Statistics

Traditionally, the term "data" is defined by what data does, not what it is. What is data? Often, books and documents are called information. Are objects information? Do they contain information? Data are often defined in terms of information, vice versa, or in terms of some other undefined concept, such as knowledge. All this leads to much confusion. This paper is an attempt to shed light on these and related issues. Terminology theory is the study of concepts and their representations in special languages. It focuses on the essential characteristics of concepts, and therefore on what a concept is. Applying the theory, we define data in a new way, by defining what data is, by investigating its essential characteristics. This, in turn, provides a way to distinguish between data and information usefully. The role of metadata is clearly defined. Then, a definition of data element is derived. Implications for the Semantic Web are discussed.

D3: Enabling Access to Data: Promising Approaches (Anderson Room ABC)

Session Chair: Michelle Edwards, University of Guelph

The Special Licence Model for Access to More Detailed Microdata
Karen Dennison, UK Data Archive

Access to most microdata supplied by the UK Data Archive (UKDA) only requires user registration. Such data are fully anonymised and certain variables may be suppressed or aggregated to minimise disclosure risk. However, there is a research need for more detailed data, such as more precise geographic and occupation codes. To increase the range of data available for research, whilst continuing to safeguard the confidentiality pledge made to survey respondents, the Office for National Statistics (ONS), UKDA, and ESDS Government have developed a Special Licence (SL) and an associated guide to good practice. A range of more detailed ONS data are now available via this new access initiative. This paper describes the data available, discusses the SL model, and outlines the conditions for access to the more detailed data.

UK 2001 Census Microdata: Providing Access to Data Subject to Confidentiality Constraints
Jo Wathan, University of Manchester

Disclosure control issues are particularly salient to census microdata release. Data of this type do not benefit from protection arising from small samples. They are derived from the same source as tabular outputs. Additionally, they are subject to statutory requirements above and beyond that of standard data protection. This paper will describe the range of approaches used to ensure that research quality data from the 2001 Census were made available to the research community, following increased concern about confidentiality at the UK census offices. These solutions involved a mixture of broad banding, perturbation, access controls, and differing levels of licensing. A range of microdata are now available. Very detailed files are held in a safe setting; users travel to a secure site and leave outputs to be checked before release. Less detailed files are disseminated to licensed users. Hierarchical household data are subject to a special license.

Open Access Movement and Data
Jeffrey S. Bullington, University of Kansas Libraries

Given the theme 'Data in a Networked World of Knowledge', what importance does the Open Access Movement have for data? Calls have been made for systems ensuring that publicly funded research results remain in the public domain, adhering to Open Access values of no economic or use-restrictive barriers between knowledge and those who wish access to the same. These include the National Institutes of Health and Wellcome Trust efforts for biomedical research, the declaration from the Organization for Economic Cooperation and Development for open access to publicly funded research, and the United Nations World Summit on the Information Society discussing the same. And advocacy organizations such as the Alliance for Taxpayer Access have emerged, working to ensure that these values are realized. This paper examines and discusses some of these actors in the Open Access movement and how they may be seen to touch on 'data'.

E1: DDI for the Next Decade: Toward Version 3.0 (Part 1) (Pendleton Room)

Session Chair: Ron Nakao, Stanford University

Locating the Geographic Center of DDI 3.0
Wendy Thomas, University of Minnesota

For years the DDI has struggled with improving its ability to cover geographic information. Each of the past three revisions included new elements and attributes to address geography. The structural changes taking place in DDI 3.0 provided an opportunity to make major improvements in geographic information. A working group of the Expert Committee was formed to address the following needs: (1) Provide a means of describing geographic coverage that allows for more detail and better alignment with other description standards; (2) Describe geographic hierarchies and the relationship of those levels; and (3) Expand the ability to reference external maps and geographic data files (shape/boundary). The changes introduced in DDI 3.0 allow for improved description, searching, manipulating, and linking data based on geography. This presentation reviews what was changed, why it was done, and how it improves your ability to work with data.

Problems of Comparability in the German Microcensus Over Time and the New DDI Version 3.0
Jeanette Bohr, Andrea Janssen, and Joachim Wackerow, GESIS/ZUMA (Centre for Survey Research and Methodology)

The improvements of the new DDI version 3.0 (Data Documentation Initiative) will make it possible to document the coherences and variations of different census years on the basis of a standardized structure. This concept is realized in DDI 3.0 by the grouping model. The application of the new model will be illustrated by a selected documentation example of the German Microcensus. The Microcensus is a representative annual population sample containing structural population data of 1 percent of all households in Germany. A synoptical table including all variables for selected years shows which variables are comparable over time. This approach facilitates the work with Microcensuses of multiple years. To represent variable inconsistency in DDI, the grouping model offers the possibility to define information as a standard on a top level and to capture variations or additions on a lower level. The presentation will highlight the realization of the grouping model concerning the comparability of variables over time. Opportunities and limitations of documentation with DDI 3.0 will be pointed out and appropriate technical designs will be presented.

DDI Version 3 and Instrument Documentation
Karl Dinkelmann, Institute for Social Research, University of Michigan

This presentation will cover an overview of the work that the Data Documentation Initiative (DDI) Instrument Documentation working group has completed, leading up to the proposal of the Instrument Documentation (ID) Module to the DDI Structural Reform Group. We will delve 'lightly' into aspects of the new DDI-ID module, including, but not withstanding, new and exciting additions that allow more versatility in documenting survey instruments. We will present issues that have appeared as the IDWG reviewed the schema for the DDI-ID modules.

E2: Archival Partners: Handling "Born Digital" Materials (Kuenzel Room)

Session Chair: Peggy Adams, National Archives and Records Administration

"Born digital" records and data are of major importance to both traditional government archives and academic data archives. How can these two types of archives collaborate to assure proper preservation of such materials and their continued access to researchers? This panel will discuss examples of such cooperation and offer generic recommendations about sharing archival expertise in the areas of records appraisal, digital preservation, descriptive standards, and user services.

Peter Granda, ICPSR, University of Michigan

David Horrocks, Gerald R. Ford Presidential Library

Marc Maynard, The Roper Center for Public Opinion Research

Michael Carlson, National Archives and Records Administration

E3: Applications for Managing and Distributing Geospatial Data (Anderson Room ABC)

Session Chair: Michal Paneth-Peleg, The Hebrew University

An Update from Statistics Canada
Bernie Gloyn, Statistics Canada

In 1971, Statistics Canada became one of the first agencies to utilise a Geographic Information System (GIS) in support of the Canadian Census. Today, GIS is a integral part of a number of statistical programs at the Agency useful for internal operations, analysis and dissemination. Bernie Gloyn, formerly Assistant Director of the Geography Division, will review how the agency is making use of GIS in its statistical program and new developments to expect with the 2006 Census. This presentation will touch on the geography products/tools available from the 2001 Census, a historical perspective on Census data by some unique geographies, the available 2005 road network files before the Census, improvements with the postal code file, and what is coming for 2006.

Leveraging Resources through Partnerships: A Case Study of a Distributed Web Mapping Service
Michele Hayslett, North Carolina State University Libraries

North Carolina State University Libraries began a project in Fall 2005 focusing on deployment of a census data map service via the Open Geospatial Consortium (OGC) Web Map Service (WMS) protocol. The map service will be exposed for use within the NC OneMap system, which draws on map services made available from state, local, and federal agencies, and which serves as a component of the National Map. Through this partnership with the state GIS agency, the NC Center for Geographic Information and Analysis (CGIA), a gap in availability of demographic data within NC OneMap will be filled. The session will include discussion of the decision-making process regarding variable selection; a brief description of the technical setup and partnership arrangements with CGIA; and analysis of implementation issues.

Got Data? Google Map It!
Paul Bern, Syracuse University

Google Maps is the latest in Web delivery of GIS and data. Several sites have used Google's free Web interface to their mapping capability to show crime rates, apartment listings, and more. With some knowledge of Javascript, Perl and/or PHP, and a good database, anyone can deploy Web-based interactive maps. This presentation will discuss some of the applications already in use as well as explain some of the steps and details in creating a Google Map application for obtaining census information for the city of Syracuse, NY.

F1: We All Count: Quantitative Literacy Efforts and Approaches (Pendleton Room)

Session Chair: Libbie Stephenson, University of California at Los Angeles

Developing a Framework for Quantitative Literacy: Counting on IASSIST
Wendy Watkins, Carleton University

If there were a real question regarding the need for progress in Quantitative Literacy (QL), the 2003 International Adult Literacy Skills Survey's results on numeracy are illustrative of the answer. Of the seven participating countries, only Norway and Switzerland have a majority of their total populations able to function at a minimum level for success in everyday numeric situations. A problem in developing a QL program at the tertiary level is that it lacks a disciplinary home. While there is general agreement within the academy that it is an essential element of an overall education, no department appears willing to make QL a part of its curriculum. In contrast, standards in Information Literacy have been long-established and have gained wide acceptance. This paper will examine the processes by which these programs have become mainstream, and recommend approaches to develop a QL framework based on best practices.

Creating a Repository of Training Materials: The Canadian Experience
Jane Fry, Carleton University

Over the past nine years, many presentations, demonstrations, and workshops have been given at the four annual training sessions for the Data Liberation Initiative (DLI) across Canada. These sessions are rich in content and remain useful long after the initial presentation. However, if one were looking for a certain item, there was often a difficulty finding it because the material was stored in an ad hoc fashion and not archived in a central location. This became increasingly problematic for the trainers as the number of sessions grew. The Education Committee of the DLI was examining this issue and the idea of a Training Repository (TR) was born. The enthusiastic responses given at the latest training sessions, which introduced the TR, reaffirmed the need for it. And everyone was pleased to see the ease of retrieving a session. Currently there are over 150 presentations in the TR. This presentation examines the history of the Training Repository, the criteria used to choose the program that houses it, and the processes used to populate it.

Statistical Literacy Survey Results
Milo Schield, W.M. Keck Statistical Literacy Project, Augsburg College

In 2002, an international survey on reading tables and graphs of rates and percentages was conducted by the W. M. Keck Statistical Literacy Project. Respondents included US college students, college teachers worldwide, and professional data analysts in the US and in South Africa. The survey focused on reading informal statistics rates and percentages in tables and graphs. Some high error rates were encountered. In reading a 100 percent row table, 44 percent of students (28% of professionals) misread a description of a single percentage. In reading a pie chart, 68 percent of students (53 percent of professionals) misread a comparison of two slices. In reading an X-Y plot, 81 percent of college teachers misread a "times more than" comparison. Educators should accept responsibility for establishing the grammatical rules for writing ordinary English descriptions and comparisons of rates and percentages and for teaching students to read and write such statements correctly.

European Social Survey Education Net: Research-Like Learning in the Social Sciences
Atle Jastad, Norwegian Social Science Data Services

European Social Survey Education Net (ESS EduNet) is an online analysis-training programme that makes it easier and more efficient for lecturers to use ESS data in their teaching. ESS EduNet is a resource that unites different elements of social science in pursuit of a common goal -- the achievement of more penetrating and better-founded analysis of attitudinal survey data than hitherto. The intention is to create an environment for learning that challenges the students on theoretical, methodological, and practical issues simultaneously. Our hope is to improve the students' knowledge of a range of different approaches to social scientific analysis, stimulate independent thinking, and offer them the technical means of investigating empirical data and interpreting results. ESS EduNet is funded by the European Commission as a part of Round Two of the European Social Survey, and developed by the Norwegian Social Science Data Services. ESS EduNet is freely available at: http://essedunet.nsd.uib.no

F2: Catch and Release: Best Practice Across the Data Life Cycle (Kuenzel Room)

Session Chair: Chuck Humphrey, University of Alberta

Producing Archive-Ready Datasets: Compliance, Incentives, and Motivation
Margaret Hedstrom, University of Michigan

Digital archiving assumes some degree of cooperation between data producers and data archives. Experience shows that current incentives are insufficient to overcome the obstacles that data producers report to providing complete and accurate documentation with their data. A multidisciplinary team of experts in digital archiving, social science research, and experimental economics at the School of Information and ICPSR are investigating ways to increase cooperation between producers and archives. With their government partner, the National Institute of Justice, researchers use multiple methods (surveys and experiments) to identify barriers to compliance, revise guidelines and responsibilities, and develop and test alternative incentive mechanisms. This presentation will report on initial findings from a survey about the obstacles that data producers face when they deposit data in an archive.

Two Documents, Three Legs, and Five Stages: Developing an Organizational Response to Digital Preservation Requirements
Nancy McGovern, Cornell University

Recent developments in digital preservation provide organizations with a framework, useful perspectives, and some tools for responding to the challenges of preserving digital content over time. To build an effective digital preservation program, an institution requires a three-legged stool consisting of an organizational infrastructure, a technological infrastructure, and a resources framework. Based on the "Digital Preservation Management: Implementing Short-term Strategies to Long-term Problems" workshop and tutorial developed by Cornell University Library, this paper reviews core components of a digital preservation program, highlights key standards and documents (focusing on Trusted Digital Repositories: Attributes and Responsibilities and Open Archival Information System standard), describes a five-stage maturity model for the incremental development of a digital preservation program, and incorporates the results from institutional readiness surveys completed by workshop participants.

The LEADS Database at ICPSR: Identifying Important Social Science Studies for Archiving
Amy Pienta, ICPSR, University of Michigan

The National Science Foundation (NSF) and National Institutes of Health (NIH) have funded a large number of social science data collections over the last several decades. ICPSR, as part of the Data Preservation Alliance for the Social Sciences (Data-PASS) project, has undertaken a systematic review of grant awards made by NSF and NIH with a major goal of determining the extent to which important social science data have been collected, but not preserved or archived. We have found that the majority of data collections produced by NIH and NSF awards have not been archived. Our preliminary results from this project suggest that there are many reasons that data are not archived. The benefits of developing and implementing a data archiving plan at early parts of the data life cycle will also be discussed.

What Goes Around, Comes Around: We Must All be Data Curators Now
Peter Burnhill, University of Edinburgh

Data archives and data libraries emerged in order to deal with the born-digital, having a mix of mission with respect to re-use, re-purposing, and the historic record. Focus in the social and policy sciences has been on the stewardship of datasets that were generated as part of the research process, whether in academic, government, or commercial domains.

The last decade or so has seen emergence of digitisation programmes for 'born-again' digital surrogates, data-sharing in the life and physical sciences, and corporate concerns with digital asset value and legal compliance. There is now a confluence of institutional repositories and self-publishing, with attempt to manage this within the context of the evolution of digital library provision. These generate challenges in terms of what constitutes best practice for those within IASSIST who provide data for others to thresh. Key to this is value-added activity, both in the curation of datasets for which there is stewardship and in the delivery of services, re-working the mixed mission of re-use, re-purposing, and historic record.

Examples will be drawn from the operation and forward planning for Edinburgh University Data Library, EDINA National Data Centre, and the Digital Curation Centre.

F3: Moving Beyond Data to Networked Knowledge (Anderson Room ABC)

Session Chair: Cor van der Meer, Fryske Akademy

Alternative Ways of Presenting Historical Census Data
Luuk Schreven and Anouk de Rijk, Netherlands Institute for Scientific Information Services

In 1997, the Netherlands Institute for Scientific Information Services in cooperation with other research institutes initiated a digitalization of Dutch censuses held between 1795 and 1970. Among other things, the project resulted in a Web site with all the tables and the additional information. Furthermore several hundreds of the tables were scanned, OCR'd, and subsequently transformed into Excel tables. Recently we have conducted a preliminary investigation into alternative ways to disseminate the data, i.e., via Nesstar. This application offers the possibility to present geographical data in a map and conduct analyses and calculations online. But whereas the initial project's primary objective was to be as historically accurate as possible, data need to meet other requirements to be suitable for Nesstar. The presentation will cover the considerations that play a part in the decision about how to present the census data, the options that are available, and the problems that we encountered.

Database Developments to Establish Internet Content Services
Zoltan Lux, 1956 Institute, Budapest

The Institute, during the 50th anniversary year of the 1956 Hungarian Revolution, is receiving an exceptionally large number of requests for professional assistance with various educational, scholarly, cultural, and official state projects. Among the ways we would like to help satisfy the professional demands made on us during the anniversary is by creating new thematic 'mini-sites'. To prepare these thematic mini-sites, we have developed our contemporary-history databases further to enable archiving of historical documents found in archives and description and archiving of historical studies, and data linkages of existing database elements. The first mini-site, presenting the armed groups of Budapest in 1956, is being prepared in the spring of 2006. Plans are for a thematic historical narrative to provide the framework for the content development, complemented by several hundred pages of digitalized textual documents, memoirs, photo documents, bibliography, and sound documents. Each element or document in the development will concurrently form a separate document in the contemporary-history database, also searchable and usable outside this mini-site framework.

Delivering Government Data to Lawyers and Journalists
Susan Long and Linda Roberge, Syracuse University

The Transactional Records Access Clearinghouse at Syracuse University has built and maintains a data warehouse that stores data, obtained from federal agencies using the Freedom of Information Act, covering the government's enforcement, staffing, and spending activities. Whenever possible, we ask for transactional data rather than aggregated statistics. Maintaining access to the data, including regular updates, in the face of massive government reorganization, changing data systems, and a changing political environment has proved to be a challenge. Over the years, we have found it necessary to establish a series of validation and verification procedures because the quality of the underlying data systems varies. We merge in geographic, population, and other contextual information that helps to provide a basis for interpretation. This paper will cover some of the problems along with the solutions we've developed in delivering information to lawyers and journalists who often have little or no statistical background.

Disseminating Survey Information in the Networked World: A UK Resource
Julie Lamb, University of Surrey

Researchers are increasingly turning to the WWW in an effort to find information for the data collection stage of their projects as well as the more traditional searching for literature and reports. This paper will discuss the development and use of the Question bank, an innovative WWW resource which is used to teach students and researchers about UK social surveys produced by survey agencies such as the Office for National Statistics and the National Centre for Social Research. The Question bank contains the full questionnaires for over 50 social surveys and is continuously expanding. These questionnaires enable researchers to take questions that have been used in large scale surveys for use in their own research work, thus ensuring that they do not spend time re-inventing the wheel. The Qb also contains information on social measurement in 21 substantive topic areas, and has numerous resources relating to survey data collection methods. The resource is free to all.

F4: The Big Picture: GIS Data Challenges and Solutions (Art Lounge)

Session Chair: Marilyn Andrews, University of Regina

State and Local Government Challenges for Geospatial Data Management and Distribution
Robert R. Downs and Robert S. Chen, Center for International Earth Science Information Network (CIESIN)

As part of a project investigating requirements for managing and preserving geospatial data and related electronic records, interviews were conducted with 31 professionals responsible for managing geospatial data for their organizations. The interviews revealed a range of concerns regarding the management and distribution of geospatial data. Key issues include establishing and maintaining formal agreements, managing intellectual property rights and restrictions associated with the data, protecting sensitive information and the confidentiality of locations revealed by the data, and shielding the organization from potential liabilities resulting from data distribution and use. Many organizations have found innovative ways to address specific issues, but none of those surveyed has fully addressed all of these challenges. Issues identified by the interviews have contributed to the development of a guide for practitioners and a data model identifying information elements to be recorded and maintained when managing geospatial data and related electronic records.

Consideration for Security Issues of Geospatial Information Services in Local Governments
Makoto Hanashima, Institute for Areal Studies, Foundation / Institute of Information Security

Emerging technologies in the field of Web Service interoperability are accelerating development of "Web-based GIS" these days. In Japan, the Ministry of Internal Affairs and Communications (MIC) launched the "GIS Action Program" to encourage the introduction of "Integrated GIS" into local governments. It is easy to imagine that Web Service technologies should be applied to Integrated GIS in the near future. However, there is no standard or guideline for information security regarding Geospatial Information Service in Japan. Also, only a few studies have been done on these issues. Therefore, studies from viewpoints of information security are required in order to construct secure geospatial information services. In the beginning of this paper, I clarify issues of information security for geospatial information services on the Internet, then discuss information security requirements for "Web-based" geospatial information services in local government. Those issues are based on the current situation in Japan; however, they will be common with most of "e-Government" around the world.

Organizing Data With Temporal and Spatial References
Michal Paneth-Peleg, Israel Social Sciences Data Center, The Hebrew University

The information needs of regional R&D increase not only in their scope but also in their dimensions. Understanding urban and regional processes such as labour markets, internal migration, and housing prices requires the temporal dimension of data on top of its spatial reference. Running an "ordinary" time series database is complex enough. It obviously needs further logistics when handling temporal data for a multi-national universe like IFS and WDI. However, organizing a database with a temporal dimension for a multi-layer geographic universe is highly ambitious. The presentation will discuss issues of spatial statistics and present a few tradeoffs among major factors: time series continuity, changing boundaries, harmonization of different classifications, and hierarchy of geographic divisions across time. Examples from Israel Geobase will illustrate the discussed tradeoffs.

Integration of GIS With 2000 China Population Census Data
Shuming Bao, China Data Center, University of Michigan

This presentation will demonstrate some China data projects on the integration of GIS with 2000 China population Census data at China Data Center of the University of Michigan, which include China GIS Maps with Population Census Data at province, county and township levels. We'll also demonstrate how to derive the township boundary map and how to project the population Census data to 1km2 Grid maps, which will be very helpful for comparative studies on China in time and space. Other issues will include the internationally collaborative data development, copyright and data license, data service models, and the integration of the data center functions with teaching and research.

G1: DDI for the Next Decade: Toward Version 3.0 (Part 2) (Pendleton Room)

Session Chair: Pascal Heus, International Household Survey Network

DDI 3
Arofan Gregory, Aeon LLC

These two presentations will cover the implications of the major shift in focus in DDI 3.0 to encompass the entire statistical life cycle. We will review the life cycle model, the resulting data model, and implications for how applications are built and function. The role of centralized registries to support the use of metadata throughout the life cycle will be addressed, covering potential use for question banks as well as persistent sources of metadata in other applications from data collection and processing through archiving.

DDI 3
Jostein Ryssevik, presented by Chris Nelson, Open Data Foundation

These two presentations will cover the implications of the major shift in focus in DDI 3.0 to encompass the entire statistical life cycle. We will review the life cycle model, the resulting data model, and implications for how applications are built and function. The role of centralized registries to support the use of metadata throughout the life cycle will be addressed, covering potential use for question banks as well as persistent sources of metadata in other applications from data collection and processing through archiving.

Three Out of Two People Want to Know: The Issues Behind Conversion to DDI 3
Ken Miller, UK Data Archive

The Data Documentation Initiative Structural Reform Group has been working on changes to the existing standard which will result in a more modular and extensible model that covers the whole life cycle of social science data, from conception, through collection, production, distribution, and discovery to analyses and repurposing. This will mean that existing instances of marked-up DDI will not validate against this new Version 3 of the standard. This paper will discuss the issues behind converting existing DDI instances and the tools that will be available to both convert and create marked-up Version 3 DDI records.

G2: New Standards in Statistics and Data Citations (Kuenzel Room)

Session Chair: Diane Geraci, Harvard College Library

Basic Forms of Citation for Statistics and Data: Towards an Accepted Standard
Gaetan Drolet, Statistics Canada

We present the basic forms of citation (formats and elements) developed for statistics, data, and maps products at Statistics Canada. From these models 80 examples have been created to become the citation standards of the organization. We also discuss the relationship between these standards and the ISO 690, 690-2 revision to include examples of statistics, data, and maps citation in the new ISO bibliographic standard, and the opportunities for IASSIST and the data community to be part of this process.

A Proposed Standard for the Scholarly Citation of Quantitative Data
Micah Altman and Gary King, Harvard University

A critical component of the scholarly and library community is the common language of and the universal standards for scholarly citation, credit attribution, and the location and retrieval of articles and books. We present a proposal for a similar universal standard for citing quantitative data that retains the advantages of print citations, adds other components made possible by, and needed due to, the digital form and systematic nature of quantitative datasets, and is consistent with most existing subfield-specific approaches. Although the digital library field includes numerous creative ideas, we limit ourselves to only those elements that appear ready for easy practical use by scientists, journal editors, publishers, librarians, and archivists.

Tracking and Managing Citations: Data Centers and Best Practices
W. Christopher Lenhardt, Center for International Earth Science Information Network (CIESIN)

Documenting data quality and attribution, as well as facilitating appropriate use of digital data, is made more complex by the ethereal nature of the bits and bytes. Encouraging proper citation of digital data is one way to help to address these challenges. Work on technical issues such as citation standardization and knowledge capture is essential. However, there is much more that can be done to encourage progress in proper data citation. Data centers can play a primary role in developing and promoting best practices related to these areas. CIESIN has developed a number of procedures and resources related to citations of online data and information products. This paper will outline these practices and resources, as well as discussing their potential for wider applicability. These best practices connect the data provider, data center, and users and are a necessary complement for technical developments related to citation standardization.

Challenges and Opportunities in the Implementation of Citation Standards
Jeri Schneider, ICPSR

The research community faces challenges and opportunities when implementing or changing citation standards. Here we discuss some of the necessary steps to implement new data citation standards. For example, what parties will be impacted by a new standard and how can we gain their support? We also discuss the opportunities these standards present for ultimately creating a data-aware "Web of Knowledge" allowing for the exploration and visualization of associations among data collections and publications.

G3: Supporting Data Users in a Networked World (Anderson Room ABC)

Session Chair: Tiffani Conner, University of Connecticut

From Primitive Numbers to Knowledge: How Technology Has Enhanced the Dissemination of Social Science Data
Chiu-chuang (Lu) Chou, University of Wisconsin, Madison

Technology has changed the way that people seek information. We can access an unimaginable amount of information with our fingertips. Much social science data can be easily obtained on the Web. What are the processes and mechanisms behind those tabular social science data on the Web? What are the caveats associated with those well-packaged data? When users depend on search engines to find information, a data librarian needs to guide them to locate pertinent data in the information haystacks. Many data producers are using Web technology to disseminate their data. How are these changes affecting social science data libraries and their staffs? This paper examines the service shift in a social science data library over the last five years and presents its plan for the future.

Networking in the University Environment: Building Bridges From the Bottom Up
Jennifer Darragh, The Pennsylvania State University Population Research Institute, and Stephen Woods, The Pennsylvania State University Libraries

Bridges are rarely built in a day and often their foundations are hidden below the waterline where few are able to see. Building networks for data collections and services in a university community takes time and requires individuals who are willing to take it upon themselves to draft the schematics and collect the raw materials before bridges can be built between major organizations within the community. This paper focuses on the collaborative efforts of the University Libraries and the Population Research Institute at The Pennsylvania State University. We will discuss past and current initiatives that have been successful in laying the foundation for future initiatives including: resource authentication, collection building, and promotional activities. We will conclude with a discussion of ideas for future collaboration focusing on distributive reference services, team teaching and other potential partners. It is imperative that we present coherent and cohesive projects that are comprehensible to our organizations' administrators; therefore a considerable amount of thought and experimentation through informal collaboration is necessary beforehand. By developing a history of collaboration through doable and successful projects, visible bridges can be built between seemingly independent organizations within the university community.

Developing a Social Science and GIS Data Service in a Predominantly Undergraduate Library: Past, Present, and Future
Suzette Giles, Ryerson University

At Ryerson University (in Toronto, Canada) social science data collection and service began in 1997. The data librarian was also the map librarian so geospatial /GIS data became her responsibility as well. Data (including geospatial data) services to faculty and students have developed: FROM the past (1997-2003) when they were Library centred; low profile, minimum resources for staff, equipment or computers; TO the present (2003-2006) where they are university centred with a Geospatial, Map and Data Centre, full time technician, server space and Web delivery of some data; TO the future (200?) - provincially centred, with the possibility of centrally archived and networked delivery of social science and geospatial data to Ontario universities. Techniques used at Ryerson to give data services sufficient profile to attract funding and future scenarios being considered by the Ontario universities Data and Map librarians' groups for province wide delivery will be examined.

Data Services Awareness and Use Survey: What We Learned About Promoting Data Services
Eleanor Read, University of Tennessee

In fall 2003, the University of Tennessee Libraries conducted a survey to assess awareness of its data services among faculty and graduate students. The need for additional promotion of the service was clear from the responses and comments. This session will discuss how the results of the survey led to new promotion and outreach initiatives and what the outcome has been so far. It will also encourage feedback from the audience regarding successful promotion and outreach activities at other institutions.

Poster Session

Access to Archival Databases
Thomas Brown, U.S. National Archives and Records Administration

This session will demonstrate the online search and retrieval utility known as the Access to Archival Databases (AAD) resource of the [U.S.] National Archives and Records Administration (NARA). AAD offers public online search and retrieval access to specific records from a selection of NARA's archival databases. NARA launched a redesign of AAD in December 2005 that allows for free-text searching of the values within the data files. Currently access is to approximately 86 million Federal data records in nearly 480 data files from 47 archival series.

Association of Religion Data Archives
Brian J. Grim, Pennsylvania State University SRC and the ARDA (Association of Religion Data Archives)

This poster session will describe the way the Association of Religion Data Archives archives and makes its extensive data on religion in America and its new collection of data on international religion freely accessible to researchers, policymakers, and data archivists. It will demonstrate the various online comparison features provided by the ARDA as well as provide information on how the ARDA structures its data collection and its Web site. (The ARDA was formerly the American Religion Data Archive and continues to support an extensive American collection with numerous mapping and report features.)

Building Outreach and Dialogue--Data Librarianship: The Continuing LIS Education
Tiffani Conner, University of Connecticut; Jennifer Darragh, Pennsylvania State University; and Paula Lackie, Carleton College

This poster session was submitted for inclusion at the annual meeting of the American Library Association as a means to showcase "data librarianship" as a viable specialization. It demonstrates the typical and atypical aspects of the specialization, and how it relates to traditional librarianship. We wish to use this poster at IASSIST to generate discussion about how to encourage and educate librarians about the existence of secondary data, illustrate how library users benefit from access to data, and suggest how services can be designed to support patron data use.

Specific areas we hope to discuss include:

  1. Scenarios for working with varied user populations (students, faculty, professional researchers, and the general public)
  2. How your library degree and undergraduate and/or graduate degree can benefit your career as a data librarian
  3. Designing a data service from scratch and using it to market your library

China Data Explorer
Shuming Bao, University of Michigan China Data Center; Dave Cason, iScience

China Data Explorer products blend the data-rich collections of China provincial and county variables and maps with powerful a data-viewing engine and tools to aid analysts, researchers, and instructors. The China Data Explorer family products include (1) China National Statistics Explorer, (2) China Provincial Census 2000 Explorer, (3) China County Census 2000 Explorer, and (4) China Historical Census Explorer.

China Data Explorer is easy to use with a powerful data-viewing engine integrated with tables, maps, and plots. It has the following features:

  • Dynamic maps with hundreds of variables for country comparisons
  • Thematic map library with a collection of raster-based thematic maps and vector shape files featuring topography, population, plants and animals, climate, and land cover
  • Country profiles with detailed profiles outlining the infrastructure of over 250 nations
  • Gazetteer for cities, states, provinces, countries, bodies of water , and weather stations
  • Flexibility with additional data packages

Connecting Users to Numeric and Spatial Resources: How Are Libraries Faring?
Shawn W. Nicholson, Michigan State University, and Terrence B. Bennett, The College of New Jersey

The burgeoning use of numeric data across all academic disciplines raises significant questions about the library's role in providing data services and promoting quantitative literacy. In this poster session, we will present the results of our analysis of the Web pages of a random sample of members of the Association of Research Libraries. The purpose of this analysis was to identify (through a combination of browsing and searching -- intended to replicate the information-seeking behavior of a typical user) the presence of research data resources at the library; the extent to which library users are made aware of such resources; and the library's role in supporting the use of numeric and spatial data in scholarly research. In addition to presenting a graphical representation of the results of our analysis, we will also explore the diverse range of practices among institutions within our sample in providing and promoting data sources and services.

Creating a Repository of Training Materials: The Canadian Experience
Jane Fry, Carleton University

Over the past nine years, many presentations, demonstrations and workshops have been given at the four annual training sessions for the Data Liberation Initiative (DLI) across Canada. These sessions are rich in content and remain useful long after the initial presentation. However, if one were looking for a certain item, there was often a difficulty finding it because the material was stored in an ad hoc fashion and not archived in a central location. This became increasingly problematic for the trainers as the number of sessions grew. The Education Committee of the DLI was examining this issue and the idea of a Training Repository (TR) was born. The enthusiastic responses given at the latest training sessions, which introduced the TR, reaffirmed the need for it. And everyone was pleased to see the ease of retrieving a session. Currently there are over 150 presentations in the TR.

DAIS KM Suite
Prem Khosla, Health Canada

Health Accountability and Performance Reporting Division, Health Policy Branch, Health Canada will present a poster session displaying the DAIS KM Suite which consists of:

  1. DAIS KM: a full-featured workbench for policy advisors, researchers, data analysts, information specialists and others who work to bring evidence-based knowledge to bear on government policy and program decision making. It enables users whose primary focus is on knowledge to have easy and fast access to microdata, fully integrated with key research reports, publications, policy documents and polls, including published aggregates in DDI, B2020, Excel and other popular formats.
  2. DAIS DDIms Publisher: for capturing and managing DDI metadata to be used in DAIS KM.
  3. System Administration (SYSADM) tool: for describing and entering information and knowledge products (using Dublin Core extensions to the DDI), entering DAIS DDIms Publisher Metadata, managing users and access permission, and tracking DAIS KM Resources usage.

Data Sharing for Demographic Research Program at ICPSR
Felicia LeClere, ICPSR, University of Michigan

One of the primary challenges of data archiving is encouraging deposits. This is particularly problematic for active research projects in which data are being collected currently. The Data Sharing for Demographic Research program at ICPSR offers researchers the option of a virtual archive in which they use ICPSR archiving services yet retain ownership of the physical data. This poster will describe the archiving technique and its strengths and limitations.

The Data Warehouse on Trends in Health and Aging: An Online Data Resource for Health Professionals
Harold Lentzner, U.S. National Center for Health Statistics

The Data Warehouse on Trends in Health and Aging archives and makes available on the Internet macro-level trend data on aging Americans, particularly their health and well-being. The Warehouse has been developed and maintained by the National Center for Health Statistics, Centers for Disease Control with support from the National Institute on Aging. The Warehouse contains over 90 dynamic tables from a variety of national and state data systems including Vital Statistics, the National Health Interview Survey, the Behavioral Risk Factor Surveillance System, and the Medicare Current Beneficiary Survey. The demonstration will show how academics, information specialists, researchers, and policy and program analysts can use this Web tool to quickly answer questions about behaviors, health status, disease prevalence, use of health services, and cost of care.

Davidson Data Center and Network: Extending the Reach
Patricia Loh, William Davidson Institute, University of Michigan

The Davidson Data Center and Network (DDCN) is an online data portal on transition and emerging market economies. DDCN archives and provides free access to socioeconomic micro- and macrodata on these economies. The data portal is comprised of:

  1. A locator that points users to primarily Web-based data resources held by other institutions
  2. An archive that collects and disseminates data -- datasets that can be made available are freely downloadable from the DDCN Web site
  3. An exchange that facilitates the negotiation of data deposition and usage between researchers and between institutions

In 2006, DDCN will focus on 1) expanding the scope of its geographic regions, 2) extending the myriad of issues and topics examined, and 3) reaching a larger user base. To date, DDCN has 599 datasets or data links and more than 860 registered users.

DDI2StatsProgs: Data Definition Command Generator on the Basis of DDI Documents
Joachim Wackerow, GESIS/ZUMA (Centre for Survey Research and Methodology)

DDI2StatsProgs is a flexible tool to generate data definition commands for statistical programs and databases on the basis of DDI documents. In the current state command setups can be created for SAS, SPSS, STATA, R Language, and ANSI-SQL.

DDI2StatsProgs provides a generic approach to produce program-specific definitions of variable-level metadata (DDI); it is template driven. The specific syntax of a statistical program is separated from the variable definitions fed by a DDI document. This approach enables users to change the desired data definition commands ( i.e., SPSS) easily without further technical knowledge (XSLT). By this simplified way it is possible to define rapidly new templates for additional statistical programs. It will be available to the public under the terms of the GNU General Public License.

DDI2StatsProgs is part of a DDI tool family including generation of DDI documents (data description part) from SPSS system files, and generation of multi-channel output (i.e., HTML, Microsoft Help, and PDF) on the basis of DDI content.

Design and Development of Data-Intensive Web Sites
Mark Gregor and Teresa Bailey, Velir Studios

Velir will demo its award-winning, user-friendly data sites. Learn how to use them for researching or how to create an interactive site for your data.

Evaluating Suitability for User Purpose in a Commons of Geographic Data
James Campbell, Harlan Onsrud, Marilyn Lutz, Kenton Williams, and David McCurry, Fogler Library and the Department of Spatial Information Science and Engineering, University of Maine

A Commons of Geographic Data (CGD) is an environment in which those who control "invisible" spatially-related digital data which is not currently accessible to others may choose to contribute that data to a commons environment so that others may have access to it. Such data may be generated by, for example, high school or college science classes, non-profit organizations, individual researchers, and even local businesses. To make such use of the CGD attractive to a wide range of potential users, it is necessary to create an infrastructure in which users would be able to (1) efficiently find data using a variety of extraction strategies, and (2) evaluate the quality of the extracted data and its suitability for the user's purposes. We suggest an infrastructure which includes a "post-publication" evaluation/recommender system, and multiple search options which include access through keywords/tagging as well as use of standards-based metadata and faceted thesauri.

Geospatial Repository for Academic Deposit and Extraction (GRADE)
Anne Robertson, EDINA

Current technologies have led to a dramatic growth in the wealth of created geospatial data assets. Despite this wealth, sharing of geospatial data is not easy with problems surrounding the willingness to share, concerns over IPR, locating data and sharing and access mechanisms. Given these problems, the Scoping of a Geospatial Repository for Academic Deposit and Extraction (GRADE) seems both necessary and timely. As part of JISC's Digital Repositories Programme, the GRADE project will investigate and report on the technical and cultural issues around the reuse of geospatial data within the JISC Information Environment in the context of discipline-centric informal and institutional repositories. The aim of GRADE is to lay the foundations for a sustainable infrastructure (both cultural and technical) that underwrites the communities' substantial and ongoing investment in the utilisation of geospatial resources within the learning, teaching and research environments of UK academia.

National Infrastructure for Community Statistics
Rebecca Blash, The Brookings Institute

The National Infrastructure for Community Statistics (NICS) is a Web-based marketplace that will provide live distributed access to thousands of community-level administrative and survey datasets from local, state, federal, and commercial sources, and the software tools with which to integrate and analyze these data. Sponsored by the Brookings Institution, NICS is intended to support and enhance the ability of data intermediaries and research organizations to better serve their customers. NICS is currently under development, guided by the NICS Community of Practice (CoP), a group of 150+ individuals from federal, nonprofit, industry, and other organizations. Key activities at present include creating approaches to encourage the preparation of metadata, exploring the development of cutting-edge data analytic tools, the formation of topic-specific special interest groups, and the preparation of a NICS business plan.

QNEDS Data Warehouse
John Rago, NORC at the University of Chicago

This poster is a presentation of the QNEDS Data Warehouse built by NORC to assist in the education reform for the state of Qatar. The QNEDS Data Warehouse is a user-accessible repository of data related to the education reform effort in the state of Qatar. The Warehouse and Portal was constructed using the Oracle technology stack (RDBMS, Application Server, Portal, BI tool, ETL tool). Data sources for the Warehouse include the following: the database of an 'Enumeration' application which is used to count students, classes, teachers, and schools; data from the Qatari Ministry of Education; survey data collected from various instruments conducted in the field; and an assessment data file which contains students' scores on math, science, English, and Arabic tests. Once the data is loaded into the Warehouse, it is transformed and made available to users via the Data Access Portal as a SAS or SPSS downloadable dataset and through the Oracle Discoverer ad-hoc query tool. Using Oracle Discoverer, users can query data and create reports and graphs. A user can also download the dataset from the portal and analyze the file on his or her local machine.

Rarely Used Nesstar Functionalities
Irena Vipavc Brvar, Social Science Data Archives, Slovenia

There are many tools for viewing, browsing, analyzing, and downloading survey materials and data. Once an organization decides which tool it will use for its purposes, employees need to learn how to use it. Slovene Social Science Data Archive (ADP) uses Nesstar tools for preparing and presenting survey descriptions and data to users. Nesstar tools are widely used throughout the archiving community, but there are possibilities and functionalities that are hardly exploited. This is partly because some procedures are not so user-friendly and partly because cooperation with other institutions is needed. The ADP catalogue records are viewed through Nesstar Webview by pupils in secondary schools and researchers whose English is not fluent. That's why we are in the process of translating text fields of Nesstar Webview into the Slovene language. We also are testing possibilities of defining specific access conditions and adding map functionality to tables. Archives need to acquire maps and gain permission to use them from responsible institutions (The Surveying and Mapping Authority of the Republic of Slovenia) and need to have suitable data files that contain geographical variables. We will present procedures, advantages, and disadvantages of the above-mentioned features.

Social Survey Data Archiving in the Philippines
Jeanette M. Ureta, Social Weather Stations

The survey archive of Social Weather Stations is the Philippines' pioneer public-use facility providing raw data from statistically-representative surveys on subjects relevant to social science research. The archive has more than 282 datasets from 292,400 interviews of the general public and other populations. Researchers can access surveys on elections, governance, poverty, hunger, corruption, and other issues important to Filipino life. The collection includes comparable Philippine data with other countries on topics such as religion, social inequality, the environment, family and gender roles, national identity, work orientations, and citizenship, based on SWS membership in the International Social Survey Programme, the World Values Survey and other cross-country survey networks. SWS developed its own archiving and computer-assisted retrieval procedures with help through formal ties to the Inter-University Consortium for Political and Social Research at the University of Michigan, the University of Connecticut's Roper Center for Public Opinion Research, and the University of Cologne's Central Archive.

SourceOECD
Joachim K. Doll, OECD

The Organization for Economic Co-operation and Development (OECD), a 30-member country intra-governmental organization, established SourceOECD, its Online Portal enabling institutional data dissemination, in 2000. OECD partnered with Ingenta and Beyond 20/20 to guarantee a full-featured online experience. The site provides full-text content to all OECD's publications including a centralized interface for OECD's 40 statistical databases. The Online Portal has been undergoing continuous improvements since its debut -- among them, advanced full-text searching capabilities, a highly customizable e-mail alerts service, advanced cross-linking to allow access to underlying data found in publications, and the inclusion of sought-after OECD working papers in a centrally organized sub-site. The presentation will provide a live overview of SourceOECD with a demonstration of data extraction methods via the Statistics interface and the new StatLinks feature.

Turning Program Management Data Into a Research Dataset: Working With the Department of Commerce to Study Technology Projects in Local Communities
Kate Williams, Dominican University Graduate School of Library and Information Science; Mark Bard, University of Michigan, School of Information; and Andrea Wiggins, University of Michigan, School of Information

In partnership with the federal agency that created and collected the data, we are building the first shared dataset for the emerging field of community informatics. The data was collected during 1994-2005 in the process of the agency's work, which was funding community-based organizations to apply new technologies to local problems. 600-plus demonstration projects later, the research team now has paper, video and audio recordings, and images as well as quantitative and qualitative data collected electronically from the projects on a quarterly basis. Our poster focuses on the process of transforming and repurposing the electronic data for project management into data for researchers: teaming with the agency, conducting a FOIA request, extracting the data from a proprietary format, constructing a data dictionary, and extracting portions of the data to answer research questions.

The UK Economic and Social Data Service: A One-Stop-Shop for Social Science Data Access and Support
Louis Corti, UK Data Archive

The Economic and Social Data Service (ESDS) is a national data service that provides access and support for an extensive range of key economic and social data, both quantitative and qualitative, spanning many disciplines and themes. A key feature of ESDS is to provide more seamless and easier access to data and to significantly enhance user support activities. The ESDS is a distributed service, based on a collaboration among four key centres of expertise: UK Data Archive (UKDA), University of Essex; Institute for Social and Economic Research (ISER), University of Essex; Manchester Information and Associated Services (MIMAS), University of Manchester and Cathie Marsh Centre for Census and Survey Research (CCSR), University of Manchester. These centres work collaboratively to provide preservation, dissemination, user support and training for the secondary use of data across the research, learning and teaching communities.

This poster will illustrate: products and services offered by the ESDS; how to access data and documentation over the Web; the ESDS Online data systems including NESSTAR, Qualidata Online and ESDS International Beyond 20/20 data access; ESDS role in supporting data creators and depositors; ESDS outreach, user support and training activities; and other UKDA projects on metadata standards.