Features
An overview of Dataverse features can be found at https://dataverse.org/software-features. This is a more comprehensive list.
Highlights
…
…
AI
AI Tools
A number of AI tools integrate with Dataverse. More information.
Model Context Protocol (MCP)
Model Context Protocol (MCP) is a standard for AI Agents to communicate with tools and services. More information.
Access and download
Faceted search
Facets are data driven and customizable per collection. More information.
File previews
A preview is available for text, tabular, image, audio, video, and geospatial files. More information.
Preview URL
Create a URL for reviewers to view an unpublished (and optionally anonymized) dataset. More information.
Guestbook
Optionally collect data about who is downloading the files from your datasets. More information.
File download in open tabular formats
Proprietary tabular formats are converted into TSV and RData for download. More information.
Administration
User management
Dashboard for common user-related tasks. More information.
Quotas
For number of files, amount of storage, etc. More information.
Usage statistics and metrics
Download counters, support for Make Data Count. More information.
Configurable notifications
In-app and email notifications for access requests, requests for review, etc. can be muted. More information.
Authentication
Login via Shibboleth
Single Sign On (SSO) using your institution’s credentials. More information.
Login via ORCID, Google, GitHub, or Microsoft
Log in using popular OAuth2 providers. More information.
Login via OpenID Connect (OIDC)
Log in using your institution’s identity provider or a third party. More information.
Customization
Branding
Your installation can be branded with a custom homepage, header, footer, CSS, etc. More information.
Internationalization
The Dataverse software has been translated into multiple languages. More information.
Customization of collections
Each personal or organizational collection can be customized and branded. More information.
Widgets
Embed listings of data in external websites. More information.
FAIR data publication
Support for FAIR Data Principles
Findable, Accessible, Interoperable, Reusable. More information.
Versioning
History of changes to datasets and files are preserved. More information.
Prepublication Review Support
Datasets start as drafts and can be submitted for review before publication. More information.
TK Labels
Integrate with the Local Contexts platform, enabling the use of Traditional Knowledge and Biocultural Labels, and Notices. More information.
File management
File hierarchy
Users are able to control dataset file hierarchy and directory structure. More information.
Restricted files
Control who can download files and choose whether or not to enable a “Request Access” button. More information.
Embargo
Make files inaccessible until an embargo end date. More information.
Retention Periods
Make files inaccessible once the retention period set has passed. More information.
Configurable storage
Choose between filesystem or object storage, configurable per collection and per dataset. More information.
Direct upload and download for S3
After a permission check, files can pass freely and directly between a client computer and S3. More information.
Fixity checks for files
MD5, SHA-1, SHA-256, SHA-512, UNF. More information.
Auxiliary files for data files
Each data file can have any number of auxiliary files for documentation or other purposes (experimental). More information.
Geospatial
Geospatial Metadata Fields
There is a dedicated geospatial metadata block. More information.
Geospatial File Preview
GeoJSON, GeoTIFF, and Shapefiles can be previewed as a map. More information.
Metadata Extraction from Geospatial Files
Populate the bounding box from NetCDF and HDF5 files. More information.
Geospatial Search API
Pass geo_point and geo_radius to find datasets based on their bounding box.
More information.
Integrations
Dataverse integrates with a wide variety of third party systems, some of which are highlighted below. For a full list, see Integrations.
DataCite Integration
DOIs are reserved, and when datasets are published, their metadata is published to DataCite. More information.
External tools
Enable additional features not built in to the Dataverse software. More information.
Galaxy Integration
Import files directly from Dataverse into Galaxy as well as publish datasets containing artifacts (Histories, datasets, etc.) from Galaxy to Dataverse. More information.
Handles
Handles are a Persistent ID (PID) that are an alternative to DOIs. More information.
Globus
Upload from and download to Dataverse using Globus endpoints. More information.
iRODS
Pull data from an iRODS instance to a Dataverse dataset. More information.
RSpace
Exchange data and metadata with RSpace. For example, a Data Management Plan (DMP) can be uploaded to RSpace and updated with the DOI of a Dataverse dataset. More information.
Dropbox integration
Upload files stored on Dropbox. More information.
GitHub integration
A GitHub Action is available to upload files from GitHub to a dataset. More information.
Integration with Jupyter notebooks
Datasets can be opened in Binder to run code in Jupyter notebooks, RStudio, and other computation environments. They can also be previewed in Dataverse itself. More information.
Interoperability
Signposting
Enable easier machine access to datasets by adding linkset in a Dataverse header. More information.
Harvest from DataCite
Harvest metadata directly from DataCite to Dataverse using OAI-PMH. More information.
Croissant
Export metadata as linked data following the Croissant ontology. More information.
RO-Crate
Export dataset metadata as an ro-crate.json. More information.
OAI-PMH (Harvesting)
Gather and expose metadata from and to other systems using standardized metadata formats: Dublin Core, Data Document Initiative (DDI), OpenAIRE, etc. More information.
APIs for interoperability and custom integrations
Search API, Data Deposit (SWORD) API, Data Access API, Metrics API, Migration API, etc. More information.
API client libraries
Interact with Dataverse APIs from Python, R, Javascript, Java, and Ruby More information.
Schema.org JSON-LD
Used by Google Dataset Search and other services for discoverability. More information.
External vocabulary
Let users pick from external vocabularies (provided via API/SKOSMOS) when filling in metadata. More information.
Export data in BagIt format
For preservation, bags can be sent to the local filesystem, Duraclound, and Google Cloud. More information.
Reusability
Data citation for datasets and files
EndNote XML, RIS, BibTeX, or 1000+ CSL formats at the dataset or file level. More information.
Multiple licenses
CC0 by default but add as many standard licenses as you like or create your own. More information.
Custom terms of use
Custom terms of use can be used in place of a license or disabled by an administrator. More information.
Post-publication automation (workflows)
Allow publication of a dataset to kick off external processes and integrations. More information.
Provenance
Upload standard W3C provenance files or enter free text instead. More information.
Misc
Preview and analysis of tabular files
Data Explorer allows for searching, charting and cross tabulation analysis More information.
Curation
Curation status labels
Let curators mark datasets with a status label customized to your needs. More information.
Pull header metadata from Astronomy (FITS) files
Dataset metadata prepopulated from FITS file metadata. More information.