SOMAR Virtual Data Enclave (VDE)

The SOMAR Virtual Data Enclave (VDE) is a secure, remote desktop research environment operated by the Social Media Archive at ICPSR (SOMAR). The VDE enables researchers to work with sensitive or restricted social media datasets while maintaining the highest standards of data privacy, security, and compliance.

What is the VDE?

The VDE serves as a secure virtual environment for working with restricted data. Researchers using restricted data will complete their analysis in the VDE using software available in the VDE. The VDE is available through a remote desktop connection and does not allow external internet access.

SOMAR VDE Features & Software

  • Supported software:
    • Programming languages: R and Python
      • Researchers can install R packages from CRAN and Python packages from PyPI
    • Analysis environments: RStudio, JupyterLab, Jupyter Notebooks
    • Statistical software: Stata (by request)
    • Qualitative analysis software: MAXQDA (by request)
  • Large Language Models (LLMs), Machine Learning, and AI tools:
    • The SOMAR VDE can allow a variety of tools in accordance with the ICPSR LLM policy
    • Custom and pretrained models may be available by request, following a security review by ICPSR staff
    • SOMAR staff will review and upload custom models for use in the VDE, as long as they can be used in an isolated computing environment (eg, does not require internet access to run the model) and they do not retain user-provided data
    • We support several models from HuggingFace, including:
  • High Performance Computing:
    • The SOMAR VDE offers multiple configurations of CPU and GPU compute resources to support your data analysis needs
  • Code and Data Upload
    • Researchers may request that their own code and supplemental data be uploaded to their SOMAR VDE
    • All files uploaded to the VDE undergo a security review by ICPSR staff. Typical turnaround time is 5-10 business days
    • Some datasets may not allow researchers to combine the restricted data with any external data source
    • Email somar-help@umich.edu to request an upload review

Data Security

Research teams using restricted data in the VDE are required to submit a Restricted Data Use Agreement (RDUA) with their application, signed by the Principal Investigator and an Institutional Signatory from the PI’s institution. The RDUA details data security requirements for data access in the VDE. The VDE has several features designed to ensure data security, including:

  • No external internet access
  • No copying and pasting from outside the VDE environment
  • Researchers cannot upload their own external files or export files from the VDE environment
  • No screenshots may be taken and no screensharing may occur from the VDE environment

SOMAR VDE Costs

Starting in early 2026, the SOMAR VDE will implement a new cost structure for VDE use. Each research team using the SOMAR VDE will be responsible for:

  • $371 per month for the duration of the research project
  • One-time setup fee of $1,000 before VDE usage begins (applicable only to teams onboarded after January 1, 2026)

These fees help ensure the continued sustainability and quality of SOMAR services. The resources required to support the VDE – including computing infrastructure, storage, security, and staff time – represent significant ongoing costs. These charges allow SOMAR to maintain and improve the level of support and reliability you expect from our archive.

How do I know if I need to use the VDE?

When you find a SOMAR dataset to use for your research, the “Additional Notes” field will explain any access restrictions on the data, including if the data are public or restricted to the SOMAR VDE.

VDE Setup and Onboarding Process

It can take several weeks to gain access to restricted data in the SOMAR VDE. The primary steps are as follows:

  1. Research team submits an application for the restricted data of interest. All applications for data in the VDE require a Restricted Data Use Agreement (RDUA) signed by the Principal Investigator and an Institutional Signatory. Many applications for data in the VDE also require IRB or Ethics Committee approval.
  2. All research team members will complete VDE security training. This training is included in the application form.
  3. After your application is approved, SOMAR staff will create your login credentials. This typically takes 1-5 business days.
  4. SOMAR staff begin building your team’s enclave and verify the required data and functionality. This typically takes 5-10 business days.
  5. Once setup is complete, SOMAR will email all team members the instructions for accessing your team’s enclave.

Support

For technical issues or questions about VDE setup, onboarding, or operations, email SOMAR support at somar-help@umich.edu. Our staff are here to ensure a smooth, secure, and productive research experience.

For more details: