Skip to content

In January the NIH implemented new policies requiring that research data be managed, archived, and shared using a data management and sharing plan that must be submitted as part of any new grant application. These policies encourage data re-use and reproducibility, increase transparency, and enable researchers to build on previous work.

Himmelfarb Library’s NIH Data Management and Sharing Plan (DMSP) Research Guide brings data management and sharing services and resources together for easy reference and instruction. The guide can step you through the process of determining what data needs to be shared and archived, putting together a data management plan, finding a data storage solution, and/or an open data repository for sharing.

Templates for data management plans are helpful development tools. DMPTool provides a variety of templates, including the NIH_GEN DMSP (2023) template specifically for NIH funding. You can find it and other sources for templates on the DMSP guide Getting Started tab. NIH recently released 13 additional sample templates on its website, including templates for genomic and survey data. The Survey and Interview Data (Sample Plan M) includes language related to data that can't be shared.

Finding an appropriate open data repository for storage and sharing can be a challenge. The NIH-supported Scientific Data Repositories site is useful for finding specialized repositories. For more generalist data, NIH began the Generalist Repository Ecosystem Initiative (GREI) and has partnered with seven organizations that offer open repositories, including figshare, Mendeley Data, OSF, and DRYAD. More information about these repositories, including recorded webinars, is on NIH’s GREI website.

The June R01 deadline has just passed, meaning that the next submission date is in October. If you’re planning to apply for NIH funding, don’t put off work on a data management and sharing plan! Start now and reach out to data specialists at GW with your questions. Sara Hoover, Metadata and Scholarly Publishing Librarian is the contact at Himmelfarb Library. You can reach Sara at shoover@gwu.edu. Additionally, Gelman Library offers data management consultation services. Librarians can answer your questions or refer them to other University research services for assistance, including the OVPR, Office of Sponsored Projects, the Office of Research Integrity, and the Office of Clinical Research.

Graphic image of computer screen with connections to gears, light bulb and bulls eye from pxfuel
Photo from https://www.pxfuel.com/en/free-photo-jylug

This is the second in a series of articles on the changes to the NIH Data Management and Sharing policies that will come into effect for NIH grant applications starting January 2023. See our first article for a general overview.

If you’re preparing to apply for an NIH grant, having a plan to manage and share your data just turned up on your to-do list. Currently, only grants of $500,000 or more are required to have a data management plan. Effective January 25, 2023, ALL grant applications or renewals that generate scientific data must include a detailed plan related to managing and storing data through the duration of the funded period, including plans for data dissemination. NIH just released a list of activity codes for grants that will be subject to the new policy last week. Where do you start? What should be included in this plan? We’ll provide some answers and resources to guide you here. 

All data management plans should incorporate the FAIR (Findable, Accessible, Interoperable, Reusable) principles to ensure optimal research data stewardship. Beyond following FAIR guidelines, what are the specific elements that must be included in a data management plan?  Here’s an outline of things to include and think through:

Who will be responsible for the data?

  • Usually, data is owned by the institution awarded the grant and the principal investigator is responsible for data collection and management.
  • If there are others responsible, this should be documented in the plan.

What types of data will be generated and where will they come from? Create a descriptive list of all the data that will be collected during the research process, as well as an estimate of how much data will be generated. Further things to consider include:

  • Why is it desirable to share this data and how could it be re-used? All data that is required to replicate results should be shared.
  • Are there any risks to disclosing this data? If any data cannot be shared due to legal, ethical, or technical reasons, exceptions for sharing can be written into the plan. However, all data must be managed.
  • At what point in the research process should data be shared? Will it be in a usable format at that time?
  • If you’re using data from other sources, include the source and any conditions for using it, also what relationship it may have to the original data generated during the research.

What formats and standards will be used for your data?

  • Non-proprietary file formats (.csv or .txt or XML or PDF, for example) are preferred. This ensures they will be readable in the future and is important for preservation.
  • Consider using a directory structure with a formalized naming convention and version control to better organize your data. Learn more about file management naming conventions from Cornell.

What formats and standards will be used for your metadata? Metadata describes your data and makes it findable. 

What will be the methods for archiving and sharing the data?

  • Where will the data be stored during the research process and how will it be backed up and secured (is encryption required)? Find tips on our data storage and security page
  • How will the data be made accessible after the research is complete? Find some options on our data repositories page. Cornell has considerations for selecting a repository site on their Sharing and archiving data page.
  • Determine the rights for sharing. A CC0 or CC-BY license is recommended when possible, but there may be commercial or intellectual property limitations for your research. Learn more about data licensing and protection in this guide from Cornell and about GW’s policies for sharing data.
  • Will any tools and software be needed to work with the data and metadata? How will those be provided?
  • How long should the data be preserved and made available? It may not be necessary or practical to preserve all the data in perpetuity. Making plans for how long it should be available is important to selecting a repository site.

Additional Resources:

If you have questions about creating data management plans or need further resources or information for guidance, contact Sara Hoover, Metadata and Scholarly Publishing Librarian at shoover@gwu.edu.