Use of DataCite¶
OpenAIRE has adopted the DataCite Metadata Schema as the basis for harvesting and importing metadata about datasets from data archives.
The core mission of DataCite is to build and maintain a sustainable framework that makes it possible to cite data through the use of persistent identifiers, DOIs.
OpenAIRE shares the goal of the DataCite Metadata Schema — to provide a domain agnostic metadata schema and provide interoperability through a small number of properties — making interoperability possible in the simplest manner possible and as a result keep the technical barriers for implementation as low as possible.
We would like to thank DataCite for their kind support and help making these OpenAIRE Guidelines for Data Archive Managers possible.
What’s different¶
In this document you will find the needed properties to make your data archive compatible with the OpenAIRE infrastructure. OpenAIRE has adopted the DataCite Metadata Schema v3.1 with some minor adjustments.
- OpenAIRE accepts other persistent identifier schemes and not only a DOI in 1.1 identifierType (M) .
- OpenAIRE recommends exporting links to related publications and datasets (i.e. 12. RelatedIdentifier (MA) property and attributes are mandatory when applicable).
- OpenAIRE recommends using the 7. Contributor (MA/O) property to relate a dataset to funding information.
- DataCite optional properties 8. Date (M) and 17. Description (MA) are mandatory in OpenAIRE.
- OpenAIRE enforces an encoding scheme on DataCite property 16. Rights (MA) .
Access rights and license information¶
OpenAIRE uses the access rights to enable a better user experience by declaring the access rights clear and explicit in the portal. Access rights are specified using the 16. Rights (MA) property. Please see example of encoding scheme in the section above.
1 2 3
rightsURI=”info:eu-repo/semantics/openAccess” />
OpenAIRE further recommends including license information if available:
1 2 3 4 5 6
rightsURI=”info:eu-repo/semantics/openAccess” /> rightsURI=”http://creativecommons.org/licenses/by/4.0/”> Creative Commons Attribution 4.0 International
Funding information¶
OpenAIRE have a specific use of the 7. Contributor (MA/O) property to identify links to funding steams.
One of OpenAIRE’s main goals is to link research output to (EC) research funding. The following application of the contributor property allows unique and persistent identification of the funder who has funded wholly or partly the dataset described.
An example for linking a research output to the OpenAIREplus FP7 project:
1 2 3 4 5 6
contributorType="Funder"> European Commission nameIdentifierScheme="info"> info:eu-repo/grantAgreement/EC/FP7/282896
Related publications and datasets information¶
OpenAIRE harvests all datasets from a data repository, but exposes only certain datasets in the OpenAIRE portal. See the Set content for specific details of which datasets are exposed.
For example, datasets related to publication will be exposed in the OpenAIRE portal. The link between the dataset and publication may be explicit defined, as described in this section, or automatically inferred by the OpenAIRE infrastructure.
If the link is explicit defined, the dataset will be exposed in the OpenAIRE portal within 1-2 days after harvesting (a repository is harvested once a week on average). If the link is automatically inferred by the OpenAIRE infrastructure it may take up to a month after harvesting before the dataset is exposed in the OpenAIRE portal. It is thus mandatory when applicable to provide links to related publications and datasets when these links are available in the repository, and thereby ensure faster exposure of the dataset in the OpenAIRE portal.
Related identifiers¶
DataCite Metadata Schema allows linking publications and datasets by use of persistent identifiers to uniquely identify the resource being described (A) typically a dataset but not limited to that, and the related resource (B) in the case of OpenAIRE typically a publication or a dataset.
1 2 3 4 5
relatedIdentifierType="DOI" relationType="IsCitedBy"> 10.1234/bar
Embargo date information¶
For OpenAIRE two main types of dates are relevant. When the data were made available, published or uploaded to a formal database, this is the date the data were ‘’Issued’’.
Sometimes data may be embargoed for a period; this information should be managed by the data provider and expressed by exporting an ‘’Available’’ date to indicate the end of an embargo period and an ‘’Accepted’’ date to indicate the start of an embargo period.
1 2 3
dateType="Issued">2005-04-05
An embargo example:
1 2 3 4
dateType="Accepted">2011-12-01 dateType="Available">2012-12-01
© Copyright 2015,2022, OpenAIRE. This work is licensed under Creative Commons Attribution 4.0 International. Revision bb2c3ed5 .
Open Positions
Metadata is data providing information about data that makes findable, trackable and (re)usable.
It can include information such as contact information, geographic locations, details about units of measure, abbreviations or codes used in the dataset, instrument and protocol information, survey tool details, provenance and version information and much more.
Metadata formats and standards
Metadata can take many different forms, from free text to standardized, structured, machine-readable, extensible content. It is recommended to use a standard metadata format used in your field.
- Dublin Core — domain agnostic, basic and widely used metadata standard
- DDI (Data Documentation Initiative) — common standard for social, behavioral and economic sciences, including survey data
- EML (Ecological Metadata Language) — specific for ecology disciplines
- ISO 19115 and FGDC-CSDGM (Federal Geographic Data Committee’s Content Standard for Digital Geospatial Metadata) — for describing geospatial information
- MINSEQE (MINimal information about high throughput SEQeuencing Experiments) — Genomics standard
- FITS (Flexible Image Transport System) — Astronomy digital file standard that includes structured, embedded metadata
- MIBBI — Minimum Information for Biological and Biomedical Investigations
Where no appropriate, formal metadata standard exists, for internal use, writing “readme” style metadata is an appropriate strategy.
- DCC Metadata Standards
- RDA Metadata Directory
- FAIRsharing
OpenAIRE Guidelines
The OpenAIRE Guidelines are a suite of application profiles designed to allow research institutions to make their scholarly outputs visible through the OpenAIRE infrastructure. The profiles are based on established standards and designed to be used in conjunction with the OAI-PMH metadata harvesting protocol:
- The OpenAIRE Guidelines for Literature Repositories are based on Dublin Core;
- The OpenAIRE Guidelines for Data Archives are based on the DataCite Metadata Schema;
- The OpenAIRE Guidelines for CRIS Managers is based on CERIF.
While the focus of each profile is different, they allow for interlinking and the contextualization of research artefacts.
Documentation
Responsible organizations
- Maintainer: OpenAIRE
- View website
Identifiers
Internal MSC ID msc:m92
Relationships to other metadata standards
- This scheme is a profile of CERIF (Common European Research Information Format).
- This scheme is a profile of DataCite Metadata Schema.
- This scheme is a profile of Dublin Core.
Tools
This service validates OAI-PMH metadata records against the OpenAIRE Guidelines for publication repositories, data archives and current research information systems.
Dataverse is an open source web application to share, preserve, cite, explore, and analyze research data. It facilitates making data available to others, and allows you to replicate others’ work more easily. Researchers, journals, data authors, publishers, data distributors, and affiliated institutions all receive academic credit and web visibility. Dataverse has grown considerably over time and is now a major international collaborative project. We encourage you to join us.
The OpenAIRE Validator Service allows data source managers to validate the OAI-PMH publisher of their sources against the OpenAIRE Guidelines (different versions of the guidelines). The tool returns a detailed evaluation of compliance together with feedback on what has to be fixed in the metadata to achieve compliance.
OpenAIRE Guidelines¶
Welcome to the OpenAIRE Guidelines. The intention of this is to provide a public space to share OpenAIREs work on interoperability and to engage with the community. The OpenAIRE Guidelines helps repository managers expose publications, datasets and CRIS metadata via the OAI-PMH protocol in order to integrate with OpenAIRE infrastructure.
OpenAIRE Guidelines have been released for publication repositories, data archives, CRIS systems, software repositories and repositories of other research products respectively:
- OpenAIRE Guidelines for Literature, institutional, and thematic Repositories
- OpenAIRE Guidelines for Data Archives
- OpenAIRE Guidelines for CRIS Managers
- Draft OpenAIRE Guidelines for Software Repository Managers
- Draft OpenAIRE Guidelines for Other Research Products
The guidelines specifically provide guidance on how to specify:
- Access right
- Funding information
- Related publications, datasets, software etc..
Participate¶
You are invited to participate by commenting or editing the content. See our guide for how to get started:
How to Contribute ¶
OpenAIRE Validator¶
The OpenAIRE Validator service is integrated in the Content Provider Dashboard and allows to test your repository’s compatibility with the OpenAIRE Guidelines.
If validation succeeds the data source can be registered for regular aggregation and indexing in OpenAIRE. OpenAIRE allows for registration of institutional and thematic repositories registered in OpenDOAR, research data repositories registered in re3data, individual e-Journals, CRIS, aggregators and publishers.
Horizon 2020 Open Access Requirements¶
The European Commission has published Guidelines on Open Access to Scientific Publications and Research Data (version 1.0, 11-Dec-2013). By following the OpenAIRE Guidelines for Literature, institutional, and thematic Repositories it is ensured that specific requirements on bibliographic information about Open Access publications are met. These requirements are summarized here:
© Copyright 2015,2022, OpenAIRE. This work is licensed under Creative Commons Attribution 4.0 International.