Policies for Research Data and Datasets
eScholarship@UMMS Data Deposit Policy Lamar Soutter Library, Research and Scholarly Communication Services
Download the Data Deposit Policy (PDF)
Table of Contents
Data We Collect (Data Collection Policy)
Requesting Removal of Data from eScholarship@UMMS
How People Can Use Your Material (End-User Access Policy)
What You Need to Agree to in Order for Us to Host and Share Your Work (Deposit License)
Limited Review of Data by the Repository Administrator
Language for Use in the Grant-Writing Process
Registering Your Data in eScholarship@UMMS
Policies Used in the Creation of this Document
All research data housed within eScholarship@UMMS is subject to the following data-specific policies. Note: Policies and Terms are subject to change without notice. Your use of the Site, or any of its Content licensed hereunder, after any such changes have been made will constitute your agreement on a prospective basis to the modified eScholarship@UMMS Data Deposit Policy and all changes. Accordingly, you should read this policy upon each deposit.
eScholarship@UMMS is managed by the Lamar Soutter Library, and accepts digital research data, digital research artifacts, and digital research materials (henceforth referred to as "data") at no cost that meet the following collection criteria:
- Data must be created by at least one University of Massachusetts Medical School (UMMS) researcher with an active UMMS email at time of deposit. Depositor is responsible for notifying us, firstname.lastname@example.org, if contact information changes.
- Data should be original and unique, and not deposited elsewhere. If you have deposited your data in another repository we are happy to create a metadata record in eScholarship@UMMS to link to the location.
- Administrative records and documents (e.g. meeting minutes, internal memos) will not be accepted unless used for the purposes of research.
- Data must be unencrypted and unclassified data, meaning data without private, confidential, or other legally protected information (e.g., personal identifiable information).
- Please contact us for assistance locating a repository for protected research data or confidential data.
- For more examples of identifiers that may be considered a breach of confidentiality, please see the guidance here: https://datadryad.org/docs/HumanSubjectsData.pdf
- For examples of personally identifiable information, see here: https://www.umassmed.edu/it/security/compliance/what-is-pii/
- Investigators should review prior to submission any applicable consent forms, protocols, contracts, or similar documents that might restrict data sharing.
- Data must be deposited for open access. Visitors to eScholarship@UMMS may download and reuse your data. Data that is not suitable for reuse should not be shared publicly.
- A release embargo of no more than 2 years may be placed on the data. Please contact us, email@example.com, if you need this embargo extended.
- Open, non-proprietary data formats are preferred such as csv and pdf. Repository staff reserve the right to create a copy of the data in an open, non-proprietary format for preservation purposes.
- File upload and download speeds are dependent upon your network connection. We suggest limiting files to 1 GB for upload, however larger files can be deposited. Please contact us, firstname.lastname@example.org, to discuss data sets larger than 1 GB or that you are having trouble uploading.
- Word files are automatically converted to pdf files and have a limit of 800 MB. Please contact us, email@example.com, to discuss larger Word files.
- Data must include a README file describing the nature of the data at an appropriate level for purposes of reuse and discovery. All data depositors must complete a README file as part of acceptance into the repository. The README must include:
- Creators: Include contact information for the primary creator and principal investigator and/or advisor (if not the same), specifically email address, phone number, and institution. Contact information for all creators is preferred.
- Directory of Files in Dataset: List and define the different files included in the dataset, which serves as its table of contents. This information can be included in the README file, in a separate file, or as part of the data file. If it is in a separate file or in the data file, explain where this information is located and ensure that it is accessible without specialized software. We recommend using plain text files or tabular plain text CSV files exported from spreadsheet software.
- File Naming Convention: Define your File Naming Convention (FNC), the framework used for naming your files systematically to describe what they contain, which could be combined with the Directory of Files. For more information and examples, see the Research Data Management Resource Guide.
- Data Description: A data description, dictionary, or codebook defines the variables and abbreviations used in a dataset. This information can be included in the README file, in a separate file, or as part of the data file. If it is in a separate file or in the data file, explain where this information is located and ensure that it is accessible without specialized software. We recommend using plain text files or tabular plain text CSV files exported from spreadsheet software.
- If applicable, additional required information for README file:
- Software: If specialized software(s) generated your data or are necessary to interpret it, please provide for each (if applicable): software name, version, system requirements, and developer.
- Equipment: If specialized equipment generated your data, please provide for each (if applicable): equipment name, manufacturer, model, and calibration information. Be sure to include specialized file format information in the data dictionary.
- Dates of Data Collection: List the dates and/or times of data collection.
- A README file template, more information on README files and example README files are available at https://libraryguides.umassmed.edu/research_data_management_resources/documentation.
- The data should be in a final, reusable state. We suggest cleaning your data, including deduplication, correction, and normalization. Tools such as OpenRefine can be used to assist with this process. Please contact us, firstname.lastname@example.org, for more information and suggestions.
- It is the responsibility of the individuals who are uploading the data to make sure that they have the necessary rights, permissions, or lawful authorization (such as fair use) to share any materials they did not generate and clearly and appropriately identify and acknowledge any third-party material in the content of the deposit.
Please contact us, email@example.com, if you have questions or specific inquiries.
In accordance with the Digital Collections Policy, we will consider requests to remove content on a case-by-case basis for reasons such as IRB infringement, plagiarism, or at the request of the creator(s). Once the data is removed, a tombstone metadata record will be left as a landing page for anyone who has the link. The tombstone record will not be crawled by search engines.
The submission process consists of the following steps:
- Read and accept the Submission Agreement
- Provide information about yourself and any co-authors
- Provide basic information about the data you are uploading
- Select the appropriate Creative Commons license. We suggest using either a Creative Commons Zero (CC0) or a Creative Commons Attribution 4.0 (CC-BY 4.0) license for broadest re-use of your data. Read more about open data licensing. If you have questions about licensing your data, please contact us, firstname.lastname@example.org.
- Upload your primary data file and supplemental files or link out to a file on another website
Before you begin the online submission, please be sure you have the following items ready:
- The submission's title
- Name, email address, and affiliation of the data creators
- Description or abstract of the data
- Completed README file
- A list of keywords
- An electronic version of your primary data file and any supplemental files, or hyperlink to your file (maximum size of 15 GB, with 800 MB limit for Microsoft Word files)
Submission Agreement for Research Data and Datasets
By checking the box below, I grant the Lamar Soutter Library on behalf of the University of Massachusetts Medical School the non-exclusive right to distribute my submission ("the Work") over the Internet and make it part of eScholarship@UMMS.
I warrant as follows:
- that I have the full power and authority to make this agreement;
- that the Work does not infringe any copyright, nor violate any proprietary rights, nor contain any libelous matter, nor invade the privacy of any person or third party; and
- that no right in the Work has in any way been sold, mortgaged, or otherwise disposed of, and that the Work is free from all liens and claims.
I understand that once a Work is deposited in the repository, it may not be removed except for reasons such as unethical/unsound science, plagiarism, copyright infringement, IRB non-compliance and/or violation of the Licensing Agreement.
Before data is made available through eScholarship@UMMS, the repository administrator will complete a limited review of the data to:
- Check for completeness of the README (data dictionary, etc.)
- Perform a cursory check for personal protected or health information but will not check for cross-variable identification
- Not checking for errors or omissions
- Repository administrator is not liable
Please contact us, email@example.com, prior to including eScholarship@UMMS in your data management plan or grant proposal.
Feel free to use the language below when filling out grants that require data sharing and/or data management plans. Please note that eScholarship@UMMS can only accept data that is openly available - we are unable to provide restricted access to data.
The data will be deposited into eScholarship@UMMS, https://escholarship.umassmed.edu/datasets/. The Lamar Soutter Libraryís hosted institutional repository is an open access platform for dissemination and long-term storage of university research data. In addition, eScholarship@UMMS maintains persistent DOIs for data sets, facilitating data citations. In accordance with eScholarship@UMMS policies, the (deidentified, if applicable) data will be accompanied by the appropriate documentation, metadata, and code to facilitate reuse and provide the potential for interoperability with similar data sets.
If you need further assistance with language for a grant, please see Florida International Universityís boilerplate library at http://research.fiu.edu/proposal-preparation/templates/#data. For further inquiries or assistance, contact us, firstname.lastname@example.org.
There may be situations where you do not want to deposit your data in eScholarship@UMMS. For example, you may intend to deposit your data in a repository specific to your discipline instead. If this is the case, you can register your data in eScholarship@UMMS. By registering your data in eScholarship@UMMS, you increase the potential for your work to be discovered and reused by other researchers. To register your data, you will provide information (metadata) describing the data, along with the location of the data. Please contact us, email@example.com, if you have questions, or would like more information about data registration.
eScholarship@UMMS Team: firstname.lastname@example.org
|General Statement||ScholarWorks@UMassAmherst, eScholarship@UMMS|
|Data Collection Policy||ScholarWorks@UMassAmherst, KiltHub|
|End-User Access Policy||ScholarWorks@UMassAmherst, eScholarship@UMMS|
|Deposit License||ScholarWorks@UMassAmherst, eScholarship@UMMS|
|Submission Agreement||ScholarWorks@UMassAmherst, eScholarship@UMMS|
|Boilerplate Language for Grants||ScholarWorks@UMassAmherst|
Policies Related to DEPOSIT of Materials in eScholarship@UMMS
Policies Related to USE of Materials Available in eScholarship@UMMS
This policy was last updated on December 13, 2019.
This policy was approved by the eScholarship@UMMS Management Team on December 17, 2019.