Research Data Management and Sharing
- Home
- Research Data Lifecycle
- Get Started with Your DMSP Toggle Dropdown
-
Sharing & Storage
- Storage and sharing through the Research Lifecycle
- Storing and sharing data while working on a project (Create and Explore Phases)
- Publishing and archiving data (Share, Preserve and Re-Use Phases)
- ASU Research Data Repository
- Disciplinary Research Data Repositories
- General Research Data Repositories
- Licensing and Data
- Using CCO for datasets
- Citing Data Sets
- Tutorials
- Preservation
Storage and sharing through the Research Lifecycle
Understanding your data sharing and storage options depends on where you are in your project. The active research phase versus the final publication phase of a project has different needs and platforms. The following sections clarify the distinction between repositories and storage services.
Active Research: Create and Explore Phases
- Sharing Data: During the active research phase, sharing data involves providing access to datasets, code, and other research materials with collaborators, colleagues, or project team members. This sharing facilitates collaboration, facilitates verification of findings, and enables data reuse within the research team.
- Example: Sharing raw experimental data with lab members facilitates collaborative analysis and interpretation of the data.
Final Publication: Share, Preserve and Re-Use Phases
- Sharing Data: In the final publication phase, sharing data involves making research outputs, such as datasets, code, and supplementary materials, publicly accessible to the broader scientific community. This sharing promotes transparency, reproducibility, and the potential for further research and discovery beyond the original study.
- Example: Depositing curated datasets in a disciplinary repository along with a published research article to enable independent verification and reuse by other researchers.
Storage Needs:
- Active Research (Create and Explore Phases): During the active research, storage needs typically focus on securely managing and organizing data for ongoing analysis, collaboration, and experimentation within the research team. These needs may involve using local servers, cloud storage platforms, or institutional data repositories.
- Final Publication (Share, Preserve and Re-Use Phases): When you publish, storage needs shift towards ensuring the long-term preservation and accessibility of research data for future use and reference. This action requires depositing data in trusted repositories or archives equipped with preservation infrastructure and public access policies.
Repositories vs. Storage Services:
- Repositories: Research data repositories serve as publication and preservation systems designed to enable public access, discovery, and reuse of research data. They provide infrastructure and services for long-term storage, metadata documentation, versioning, and access control, often adhering to community- or discipline-specific standards.
- Storage Services: In contrast, basic storage services such as Dropbox or Google Drive primarily offer file storage and sharing capabilities for general-purpose use. While they may provide convenient storage solutions for active research data management, they lack the specialized features and sustainability measures necessary for long-term preservation and public access.
Research Data Publishing Analogy:
Similar to article publishing, research data publishing is a process that involves publishing scholarly articles, but requires additional work on the researcher's part to prepare the data for public use. Just as journal articles are disseminated through publication platforms to achieve broader visibility and accessibility, research data is published in repositories to facilitate discovery, verification, and reuse by the scientific community.
Storing and sharing data while working on a project (Create and Explore Phases)
Research Data Storage Best Practice
Regardless of where you choose to share and store your research data, you and your team should back up your data and maintain three copies:
- The original files
- External, local copies (e.g., external hard drive, department server)
- External, remote copies via remote cloud storage supported by ASU, such as
- Google Drive
- Faculty, Staff, Student Workers, and Retired Employees 20GB, Students 5GB
- Colleges, departments, or units can purchase an additional 10TB of Google Drive storage for $3,600 a year. Contact: googledriveincrease@asu.edu.
- Dropbox 1TB limit for individuals, 25TB for teams
- OneDrive account: 1TB for individuals, 5TB for teams
The sensitivity and classification of your data may dictate what resources you can use. See ASU Research Data Management Services for guidance on planning your data management strategy and to learn more about ASU research data storage options. Contact the Research Data Management Office for assistance.
Additional best practice information: UK Data Archive's guide on Managing and Sharing Data (PDF)
ASU Research Computing
The ASU Research Computing team provides state-of-the-art computing clusters to support research activities that require high-performance computing power. Research Computing also offers workshops on various skills and holds office hours. Visit Research Computing Dates and Times to request a consultation during their office hours and view the scheduled workshops.
Electronic Research Notebooks
An Electronic Research Notebook (ERN), often referred to as an Electronic Laboratory Notebook (ELN), is a multifunctional data manager designed to help consolidate research information in one place During your project, an ERN can quickly and accurately import protocols, notes, observations, and other data from a computer or mobile device, as well as static objects such as photographs of gels ERNs integrate into several types of lab instrumentation and store the generated data.
ASU now has an enterprise license for LabArchives. This commercial-grade research management platform supports the documentation of methods, uploading images, and connecting to project storage. LabArchives also provides Intellectual property protection, audit functions, and meets the data management expectations of federal agencies. You can find more information on the product at Knowledge Enterprise's Research Data Management Electronic Notebooks.
Publishing and archiving data (Share, Preserve and Re-Use Phases)
Where should I publish my data?
Institutions and funding agencies expect researchers to share their data in research data repositories upon completion of a project and publication of their research articles. They are more than just storage. Repositories provide rich metadata for discovery and context, documentation on how to use your data, persistent identifiers for long-term access, and discoverability of your research data.
Many repositories serve the research community, and it is worth checking with your funding source to see if they prefer where the data are published and archived. The following sections provide an overview of the types of research data repositories you can choose from and help you make an informed decision on how and where you share your research data.
Finding a repository to publish your data
Data publication and preservation support vary and might leave you asking which is suitable for your data. A few tools are being developed that might help you decide.
-
A searchable database of data repositories for identifying a suitable disciplinary repository
-
FAIRsharing.orgA curated, informative and educational resource on data and metadata standards, inter-related to databases and data policies. Identify and cite your discipline's standards, databases, or repositories when creating a data management plan, releasing data, or submitting a manuscript to a journal.
-
DataCite Repository FinderA pilot project of the Enabling FAIR Data Project led by the American Geophysical Union (AGU) in partnership with DataCite can help you find an appropriate repository to deposit your research data. DataCite hosts the tool and queries the re3data registry of research data repositories. The interface provides filtering for repositories that best support findable, accessibl
-
NNLM Data Repository FinderA tool to locate NIH-supported repositories for sharing research data. Answering questions to narrow the number of repositories to compare. Provided by the Network of the National Library of Medicine
-
DataSeerAn open source project supported by the Sloan Foundation that seeks to fill the need for a low-cost, scalable solution to help researchers comply with stakeholder data sharing policies. It aims to show researchers what they need to do with the datasets to comply with funder and journal mandates by analyzing their manuscripts. Upload some research text, and DataSeer works on recommendations and can send an accessibility report to funders and journals. DataSeer provides examples to demonstrate which datasets to share from a particular article, the proper format, and which research data repository is most suitable.
ASU Research Data Repository
The ASU Research Data Repository for university-affiliated researchers
The ASU Research Data Repository allows ASU-affiliated researchers to share, store, preserve, cite, explore, and make research data accessible and discoverable. Our librarians provide expertise in curating and sharing cross-disciplinary research data produced at Arizona State University.
Working on a grant proposal? Check out Writing the ASU Research Data Repository in your data management and sharing plan in the repository depositor user guide.
We also recommend exploring the repository to see how others share their data and to gain insight into how datasets are described and linked to relevant articles, presentations, and reports. You can also find data here and from other repositories in the ASU Library One Search.
Not all research data can be shared in ASU's institutional repository, so the ASU Library also works with you to identify alternative solutions if the repository is not the appropriate place to archive and share your data.
We accept research from ASU faculty and staff projects. For student-created projects, we recommend using the OSF or Zenodo, as described in the General Research Data Repositories section below.
Disciplinary Research Data Repositories
A variety of domain-based repositories are natural homes for your data. You can also increase the exposure of your data and collaboration opportunities for your research by depositing it in a disciplinary repository. Some repositories have fees for storing your files, so include those costs in your project proposal.
ASU in-house disciplinary options:
You may not need to look too far for a disciplinary solution. ASU is home to two well-known and established disciplinary research data repositories.
-
An international archaelogical digital archive and repository by the Center for Digital Antiquity
-
The Central Arizona–Phoenix Long-Term Ecological Research (CAP LTER) project, one of 28 sites in the LTER network funded by the National Science Foundation, studies the ecology of the greater Phoenix metropolitan area and surrounding Sonoran desert region. All data sets from the project are published and shared in the CAP LTER Data repository unless limited by either privacy or license restrictions.See Managing Your Project Data and Metadata for information about submitting your data for publication: https://sustainability-innovation.asu.edu/caplter/managing-your-data/
-
DataOneA community driven program providing access to data across multiple member repositories, supporting enhanced search and discovery of Earth and environmental data. DataONE also promotes best practices in data management through responsive educational resources and materials. You can browse and search the DataOne database.
-
ICPSR: Inter-University Consortium for Political and Social ResearchICPSR is an international consortium of academic institutions and research organizations, which provides data access, curation, and methods of analysis for the social science research community. ICPSR web portal provides access to a data archive of thousands of files of research in the social sciences. Searchable content includes specialized collections of data in education, aging, criminal justice, substance abuse, terrorism, and other fields.
-
ICPSR NIH DataICPSR also supports data archiving for NIH-funded data collections. They provide cost estimates, assist with planning NIH grants, and offer letters of support to bolster grant applications.
-
Adding dataset metadata to the NSF-PAR repositoryThe National Science Foundation (NSF) Public Access Repository (NSF-PAR) for project reporting is a metadata index that does not receive dataset file submissions. You must have previously deposited your dataset into a repository that associates a DOI with your dataset.
-
Data Repository Selection Decision Tree for Researchers in the Earth, Space, and Environmental SciencesA decision tree to help researchers identify the criteria they should consider when selecting an appropriate repository for their data.
The ASU Library assists with creating accounts, ICPSR services and policies, and information about data deposits. Request a consultation and ask for ICPSR support.
General Research Data Repositories
-
A repository that hosts research data underlying scientific and medical publications. Historically, the repository focused on the life sciences which has evolved into an interdisciplinary repository.
-
Manage your research and share your supplementary materials and data with the OSF, a scholarly web tool that enhances transparency, fosters collaboration, and increases the visibility of research at the institutional level.
The OSF limits the capacity of private projects and components utilizing OSF Storage to 5 GB and public projects and components to 50 GG. If your project exceeds these limits, see Calculating OSF Storage Costs to determine the appropriate amount of OSF storage and calculate the costs you will need to provide for your project. You can view ASU affiliated published projects at osf.asu.edu.
Sign in with ASU authentication and start today or contact ASU Library Researcher Support to learn more about OSF.
-
General-purpose data repository that accepts and preserves research output from data files to presentation files. Data is stored in the CERN Data Center, and open to researchers globally.
-
Generalist Repository Comparison ChartDesigned to assist researchers in finding a generalist repository should no domain
repository be available to preserve their research data. -
Open Access Directory's Data Repositories WikiA compilation of repositories and databases for open data, organized by subject, including both general and disciplinary repositories.
-
The repository selection flow chart is a product of the Generalist Repository Ecosystem Initiative (GREI) and is designed to guide users through a series of considerations for selecting the right repository for sharing data. (PDF)
Licensing and Data
Copyright and Data
The ownership and copyright of data can be complex. When considering data, it may be more useful to think about rights and responsibilities, which can be more contextual and granular this simple copyright ownership. Data sharing, access, use, and preservation all intersect with copyright in different ways, so it is beneficial to start by considering what is being done with the data, and who is responsible for it. Copyright is not straightforward with data. There are complicated questions of what is and is not protected by copyright if other intellectual property laws apply, and who owns the data, especially with regards to institutional IP policies. The resources below are a starting place to help you in navigating this tricky terrain.
-
Data LicensesFrom the Turing Way Guide for Reproducible Research, provides an overview of Creative Commons Licenses, Open Data Commons Licenses, and other considerations.
-
Open Data CommonsA set of 3 standardized licenses specifically meant to apply to data/databases
-
Copyrightability of Tables, Charts, and Graphs (PDF)An article explaining how copyright does and does not apply to tables, charts, and graphs.
-
Sharing Research Data and Intellectual Property Law: A PrimerAn article by Dr. Michael Carroll which explains how to work through the general intellectual property and contractual issues for all research data.
Citation: Carroll MW (2015) Sharing Research Data and Intellectual Property Law: A Primer. PLoS Biol 13(8): e1002235. https://doi.org/10.1371/journal.pbio.1002235 -
How to License Research Data - Digital Curation CentreThis guide will help you decide how to apply a license to your research data, and which license would be most suitable, as well as why licensing data is important, the impact licenses have on future research, and the potential pitfalls to avoid.
-
Who “owns” your data?From the University of California's Office of Scholarly Communication, this post by Katie Fortney walks through the complications around copyright/intellectual property ownership and data.
-
CC-BY and Data: Not Always a Good FitFrom the University of California's Office of Scholarly Communication blog: an article by Katie Fortney which explains Creative Commons (CC) licenses and public domain tools and how they work with data.
-
Sharing & Licensing Research OutputsThis instructional session will help you understand how research outputs like data, software, and code can be shared and licensed to maximize reuse and the contribution of knowledge. It explores: how copyright plays out for data, software, and code; other contracts and policies that affect what's "ours" and how we can use and share content; and license selection best practices.
Using CCO for datasets
We recommend using the CC0 license, also known as "Public Domain Dedication," along with a suggested citation. In many cases, data (such as collections of facts) may not be eligible for copyright protection. By assigning a license like CC-BY, you may create unnecessary barriers to reuse and confusion. You can ensure proper attribution by providing a suggested citation, as scholarly norms, rather than licenses, govern how data should be cited to demonstrate the credibility and validity of the research.
The University of California’s Office of Scholarly Communication provides an overview of this topic in their article "CC BY and Data: Not Always a Good Fit," which recommends using the CC0 license for many types of data. Additionally, CC0 serves as the default dedication when you publish your datasets in the ASU Research Data Repository.
Alternatively, the Open Database License (ODC-By) also includes an attribution variant.
For more information, please refer to the section on Licensing and Data below. If you have further questions, feel free to contact the ASU Library Researcher Support team for assistance with copyright licensing for authors and researchers.