Reimagining Catalogs and Repositories to Enhance the FAIRness of Research Data
Research data archives typically comprise catalogs and repositories that are tightly woven together, in which research data are both discovered and accessed through the same platform. While this archive design enables the catalog to reflect the repository in substantial detail, it can limit discoverability and interoperability. Furthermore, studies can contain diverse types of data, such as surveys, biomarkers, and neuroimages that may be available through different specialized repositories. These different repositories often have different metadata schemas. We propose to loosen the tight coupling of a catalog and to a single repository by catalogs incorporating study metadata from many different repositories. Catalogs will enable data discovery while repositories will provide data access. Repositories host and deliver access to the research data, while catalogs provide searchable listings of the contents of repositories. As a result, the FAIRness of research data will enhance.
Our proposed archive design requires an API (application programming interface) for each repository, so that catalogs can communicate with repositories to produce searchable listings. Catalogs typically incorporate metadata from their own repositories. In our reformulation, catalogs would pull metadata from multiple repositories.
The DSDR presentation slides are available at https://myumi.ch/bVrVy
The DSDR poster is available at https://zenodo.org/records/15683694
Visit us at DSDR.icpsr.umich.edu