Research data is increasingly seen as part of the corpus of scholarly publications. Publishers, funders and governments support researchers to publish their data outputs by various policies, guidelines and mandates.
Data publication options
Researchers have many options when publishing data and each has different impacts on reuse, attribution, reach and discoverability of research. The Data publishing continuum was developed by TERN (Terrestrial Ecosystem Research Network) to represent different approaches to data publishing and their impacts.
Data can be published via:
- sharing information about research datasets through rich metadata records in repositories, most often at institutions eg this record in the CSIRO Data Access Portal
- syndicating those metadata records to Research Data Australia or discipline-specific portals or repositories
- formally published through Data journals
- informal publication such as via personal or commercial repositories or websites
- citation and attribution of data: ANDS Guide - Data citation
Most research data is stored and/or described in a research data repository. These are enduring services (including human, policy and technology elements) that enable the ingesting, storing, archiving, preserving, and publishing of research data and associated research outputs.
Considerations when publishing data
- Data publishing principles - six principles of data publishing as defined by the Nature Publishing Group
- Copyright, data and licensing Guide
- Connect publications through persistent identifiers
- Digital Object Identifiers (DOI) System for research data - ANDS Guide
Data is more than just datasets: consider publishing associated grey literature, software, algorithms, workflows. eg Data visualisations move past two dimensional flatland lists, and encourage the exploration of data complexity in an engaging, interesting ways through wrapping images in stories.
Data publication consensus and controversies (journal article): Kratz J and Strasser C. Data publication consensus and controversies [version 1; referees: 1 approved with reservations]. F1000Research 2014, 3:94 (doi: 10.12688/f1000research.3979.1)
Data sharing insights
Publishing data includes sharing information about research datasets with Research Data Australia or discipline-specific portals or repositories, as well as making collections available to other researchers, either on an individual basis or through a formal publication process.
A research data repository is an enduring service (including human, policy and technology elements) that enables the ingesting, storing, archiving, preserving, and publishing of research data.
Sharing information about research datasets
|Research Data Australia||Using Research Data Australia to find research datasets produced by or relevant to Australian researchers.|
|Explains how institutions can share information about datasets created by their researchers with Research Data Australia.|
|The data curation continuum||Explains how effective sharing and reuse of research datasets is built upon end-to-end data management processes.|
|The data publishing Continuum||Developed by TERN (Terrestrial Ecosystem Research Network) to represent different approaches to data publishing and their impacts.|
|Data publishing principles||The six principles of data publishing as defined by the Nature Publishing Group.|
|Provides an overview of this evolving publishing paradigm for research data.|
|Data visualisation||Visualising moves past two dimensional flatland lists, and encourages the exploration of data complexity in an engaging, interesting, and through wrapping images in stories in compelling ways. Stories help build communities and pictures help.|
|Data discovery and access||Data must be able to be discovered and accessed before it can be reused.|
|Overview to providing collection descriptions||Technical overview of the processes of generating collection descriptions and providing them to Research Data Australia.|
Sharing research datasets
|Copyright, data and licensing||Explains how copyright applies to research datasets and describes the use of open content licensing for research datasets.|
|Handle minting Service (Identify My Data)||Guide to obtaining unique identifiers for research datasets or other resources.|
|Persistent Identifiers||Introduction to persistent identifiers and how they work.|
|Persistent Identifiers (working level)||Detailed guide to using persistent identifiers with research datasets.|
|Persistent Identifiers (expert level)||A technical infrastructure perspective on persistent identifiers.|
|Digital Object Identifier (DOI) System for research data||Explains the advantages of using DOIs for research datasets.|
|Data Citation||Introduction to citing research datasets in publications.|
|Data in Education||Some resources to support growing capability in data literacy.|
|Data publication consensus and controversies||Presents an overview of data publication initiatives under way and the current conversation, highlighting points of consensus and issues still in contention.|