Need help managing your data?
Contact the library: ask@uml.libanswers.com
This guide was developed in Fall 2024 by Bari Pender (Ph.D., M.L.S. expected Spring 2025) and Veronica Chea (B.S. Public Health, expected Spring 2026), with inspiration and content from:
You are invited to re-use any content from this guide without needing to contact us, but please credit the authors and UMass Lowell Library when re-using.
Sharing your datasets to a public data repository is often a requirement of both funding agencies and with journal submissions. Submitting your data to an appropriate repository has many benefits as well. Data repositories:
Where to share your data first depends on any grant- and/or journal-specific repository requirements. The term repository is often used for a digital/online system that has robust backups and storage, and is relatively permanent.
Generally speaking, as a best practice:
See discipline-specific repositories and general repositories in the next sections for more guidance
The table below highlights important characteristics (adapted from the NIH list of Desirable Characteristics for All Data Repositories):
Characteristic |
Description |
Persistent Unique Identifiers |
Assigns datasets citable PIDs for discovery and reporting. |
Long-term Sustainability |
Plans for stable technical infrastructure, funding, and contingency for data longevity. |
Metadata |
Ensures sufficient metadata for discovery, reuse, and citation. |
Curation & Quality Assurance |
Expertise to maintain accuracy and integrity of datasets and metadata. |
Free and Easy Access |
Maximizes open access, respecting legal and ethical constraints. |
Broad and Measured Reuse |
Broad terms of reuse with measures for attribution, citation, and data usage. |
Clear Use Guidance |
Provides documentation on dataset access and usage terms. |
Secure |
Prevents unauthorized data access or modification with appropriate security. |
Confidentiality |
Safeguards to meet confidentiality and risk management standards. |
Common Format |
Data and metadata are in widely used, non-proprietary formats. |
Provenance |
Tracks origin, custody, and modifications to datasets. |
Retention Policy |
Documents policies for data retention in the repository. |