Features

An overview of Dataverse features can be found at https://dataverse.org/software-features. This is a more comprehensive list.

Highlights

AI

AI Tools

A number of AI tools integrate with Dataverse. More information.

Model Context Protocol (MCP)

Model Context Protocol (MCP) is a standard for AI Agents to communicate with tools and services. More information.

Access and download

File previews

A preview is available for text, tabular, image, audio, video, and geospatial files. More information.

Preview URL

Create a URL for reviewers to view an unpublished (and optionally anonymized) dataset. More information.

Guestbook

Optionally collect data about who is downloading the files from your datasets. More information.

File download in open tabular formats

Proprietary tabular formats are converted into TSV and RData for download. More information.

Administration

User management

Dashboard for common user-related tasks. More information.

Quotas

For number of files, amount of storage, etc. More information.

Usage statistics and metrics

Download counters, support for Make Data Count. More information.

Configurable notifications

In-app and email notifications for access requests, requests for review, etc. can be muted. More information.

Authentication

Login via Shibboleth

Single Sign On (SSO) using your institution’s credentials. More information.

Login via ORCID, Google, GitHub, or Microsoft

Log in using popular OAuth2 providers. More information.

Login via OpenID Connect (OIDC)

Log in using your institution’s identity provider or a third party. More information.

Customization

Branding

Your installation can be branded with a custom homepage, header, footer, CSS, etc. More information.

Internationalization

The Dataverse software has been translated into multiple languages. More information.

Customization of collections

Each personal or organizational collection can be customized and branded. More information.

Widgets

Embed listings of data in external websites. More information.

FAIR data publication

Support for FAIR Data Principles

Findable, Accessible, Interoperable, Reusable. More information.

Versioning

History of changes to datasets and files are preserved. More information.

Prepublication Review Support

Datasets start as drafts and can be submitted for review before publication. More information.

TK Labels

Integrate with the Local Contexts platform, enabling the use of Traditional Knowledge and Biocultural Labels, and Notices. More information.

File management

File hierarchy

Users are able to control dataset file hierarchy and directory structure. More information.

Restricted files

Control who can download files and choose whether or not to enable a “Request Access” button. More information.

Embargo

Make files inaccessible until an embargo end date. More information.

Retention Periods

Make files inaccessible once the retention period set has passed. More information.

Configurable storage

Choose between filesystem or object storage, configurable per collection and per dataset. More information.

Direct upload and download for S3

After a permission check, files can pass freely and directly between a client computer and S3. More information.

Fixity checks for files

MD5, SHA-1, SHA-256, SHA-512, UNF. More information.

Auxiliary files for data files

Each data file can have any number of auxiliary files for documentation or other purposes (experimental). More information.

Geospatial

Geospatial Metadata Fields

There is a dedicated geospatial metadata block. More information.

Geospatial File Preview

GeoJSON, GeoTIFF, and Shapefiles can be previewed as a map. More information.

Metadata Extraction from Geospatial Files

Populate the bounding box from NetCDF and HDF5 files. More information.

Geospatial Search API

Pass geo_point and geo_radius to find datasets based on their bounding box. More information.

Integrations

Dataverse integrates with a wide variety of third party systems, some of which are highlighted below. For a full list, see Integrations.

DataCite Integration

DOIs are reserved, and when datasets are published, their metadata is published to DataCite. More information.

External tools

Enable additional features not built in to the Dataverse software. More information.

Galaxy Integration

Import files directly from Dataverse into Galaxy as well as publish datasets containing artifacts (Histories, datasets, etc.) from Galaxy to Dataverse. More information.

Handles

Handles are a Persistent ID (PID) that are an alternative to DOIs. More information.

Globus

Upload from and download to Dataverse using Globus endpoints. More information.

iRODS

Pull data from an iRODS instance to a Dataverse dataset. More information.

RSpace

Exchange data and metadata with RSpace. For example, a Data Management Plan (DMP) can be uploaded to RSpace and updated with the DOI of a Dataverse dataset. More information.

Dropbox integration

Upload files stored on Dropbox. More information.

GitHub integration

A GitHub Action is available to upload files from GitHub to a dataset. More information.

Integration with Jupyter notebooks

Datasets can be opened in Binder to run code in Jupyter notebooks, RStudio, and other computation environments. They can also be previewed in Dataverse itself. More information.

Interoperability

Signposting

Enable easier machine access to datasets by adding linkset in a Dataverse header. More information.

Harvest from DataCite

Harvest metadata directly from DataCite to Dataverse using OAI-PMH. More information.

Croissant

Export metadata as linked data following the Croissant ontology. More information.

RO-Crate

Export dataset metadata as an ro-crate.json. More information.

OAI-PMH (Harvesting)

Gather and expose metadata from and to other systems using standardized metadata formats: Dublin Core, Data Document Initiative (DDI), OpenAIRE, etc. More information.

APIs for interoperability and custom integrations

Search API, Data Deposit (SWORD) API, Data Access API, Metrics API, Migration API, etc. More information.

API client libraries

Interact with Dataverse APIs from Python, R, Javascript, Java, and Ruby More information.

Schema.org JSON-LD

Used by Google Dataset Search and other services for discoverability. More information.

External vocabulary

Let users pick from external vocabularies (provided via API/SKOSMOS) when filling in metadata. More information.

Export data in BagIt format

For preservation, bags can be sent to the local filesystem, Duraclound, and Google Cloud. More information.

Reusability

Data citation for datasets and files

EndNote XML, RIS, BibTeX, or 1000+ CSL formats at the dataset or file level. More information.

Multiple licenses

CC0 by default but add as many standard licenses as you like or create your own. More information.

Custom terms of use

Custom terms of use can be used in place of a license or disabled by an administrator. More information.

Post-publication automation (workflows)

Allow publication of a dataset to kick off external processes and integrations. More information.

Provenance

Upload standard W3C provenance files or enter free text instead. More information.

Misc

Preview and analysis of tabular files

Data Explorer allows for searching, charting and cross tabulation analysis More information.

Curation

Curation status labels

Let curators mark datasets with a status label customized to your needs. More information.

Pull header metadata from Astronomy (FITS) files

Dataset metadata prepopulated from FITS file metadata. More information.