Find about the OpenAIRE Guidelines, and make your publication or data repository OpenAIRE compatible.
Find the steps you need to follow in order to register your repository in OpenAIRE.
- For researchers
- to indicate the relevance of a publication
- to indicate the impact level of particular authors
- For academic libraries
- to indicate the level of acceptance and usage of the repository hosting publications
- to support refinement and optimization of repository websites and metadata information by analysis of the use of devices, browsers, search engines and terms
- For funding organisation
- to indicate the impact of research results
- to provide insights on current trends in research
Aggregation requires standardization on the recording of usage events, exclusion of robot accesses and data exchange mechanisms. The OpenAIRE Guidelines for Usage Statistics v1.0, based on OAI-PMH, specify a common format of usage events and transfer protocol (OAI-PMH) for a straightforward adoption by data providers.
In alignment with the European Act of personal data protection (http://europa.eu/legislation_summaries/information_society/l14012_en.htm), the IP address, session-id and in some cases also the C-class Subnet must be pseudonymised before transferring the usage data to the aggregator service.
Repository log file conversion and transfer tools
Below is a list of tools that can help in conversion and transfer of collected usage data:
- SURFshare-sure (Statistics on the Usage of Repositories) provides a software for the conversion of Apache2 log files of repositories into OpenURL Context Object Files and for the OAI-PMH transfer to a log Aggregator, like OpenAIRE's Usage Data Aggregator Service.
- OA-Statistik Data Providers plugins for DSpace, WebDoc and OPUS repositories that convert repository usage data formats into the OpenURL Context format and expose them with OAI-PMH.
Check out regularly for an update of the list of tools.
OpenAIRE is an OA publications infrastructure, making visible funded research outcomes. We currently aggregate 20,217,132 publications and 518,052 datasets from 2,212 data providers, including many institutional repositories. We support a wide range of scholarly resources (see our content acquisition policy document to wee how we approach this issue). To this extent, OpenAIRE now additionally to literature repositories harvest CRIS systems and repositories for research data sets.Why register my repository/journal/CRIS in OpenAIRE?
When your content is harvested by OpenAIRE, you automatically comply with the European Commission’s policies on Open Access. All records (closed or open) with funding information attached are made more visible via OpenAIRE aggregated and viewed into different facets by a wide range of stakeholders.
1. Use the OpenAIRE Guidelines
OpenAIRE has defined a series of Interoperability Guidelines to assist data providers expose their content in an interoperable format.
- Literature repositories: use of vocabularies, and how to provide links to funding and other research entities (e.g., datasets), and access rights
- Data repositories: based on the latest DataCite XML Schema, extended with information on links to funding and access rights.
- CRIS systems: compatible with CERIF (EuroCRIS)
For literature repositories we have different status of compatibility depending on the guidelines version and the exposed content via OAI-Sets.
Please carefully analyse the proper version of the OpenAIRE guidelines, especially the metadata and vocabulary application profile, OAI-PMH usage instructions and available examples, and search for useful info about OpenAIRE compatibility improvements for your software repository platform.
For more comprehensive support, please access the recordings and slides from the last webinars on OpenAIRE compatibility for repository platforms: Proprietary platforms: slides & recordings; EPrints platforms: slides & recordings; DSpace platforms: slides & recordings (access also to the slides on the OR2014 conference workshop).Are the guidelines backwards-compatible?
Yes. Latest schema guarantees backwards-compatibility with previous versions. As the OpenAIRE OAI set has been renamed from ‘ec_fundedresources’ to ‘openaire’, the OpenAIRE infrastructure ensures the continuity of harvesting EC funded resources content based on the old set, and specifically those repositories are identified as OpenAIRE 2.0 compatible.
2. VALIDATE AND REGISTER YOUR DATA SOURCE
OpenAIRE provides a validator service that allows you to test and validate your data source/content for any of our guideline sets. Once you do this, you can proceed and register your data source and join OpenAIRE.
Before you register your repository with OpenAIRE we ask you to take an extra step: register with the global repository registries OpenDOAR or re3data.org. Why do we do this? Your data, as well as OpenAIRE, is part of a global infrastructure landscape and we need to use these registries as they are a key component of this landscape.
Next, you use our registrtation service and specify information about the OAI-BaseURl and OAI-Sets OpenAIRE should harvest from. When you register, an automatic validation will start and the OpenAIRE team will contact you.
You can always use the validation and registration service to update to newer OpenAIRE guidelines compatibility levels, or edit infromation about what to be harvested.Where can I find more information?
- Learn and discuss our guidelines: https://guidelines.openaire.eu
- Test and register the repository: http://validator.openaire.eu
- Access the OpenAIRE API documentation: http://api.openaire.eu
- Find useful info for Repository Managers: https://www.openaire.eu/rep-man-toolkit/repository-managers
- Browse/search for OpenAIRE data providers: https://www.openaire.eu/search/openaire-data-providers
- Check out OpenAIRE's Content Acquisition policy: https://www.openaire.eu/openaire-s-content-acquisition-policy/download
OpenAIRE, in alginment with recent national initiatives in the UK (JISC PIRUS2), the Netherlands (SURF SURE), Germany (OA-Statistik), and Knowledge Exchange Group on Usage statistics is harvesting usage data from repositories in an effort to produce metrics that will lead to an alternative way of assessing of FP7 OA publications.
Usage statistical data has enormous potential as
- it is available much earlier than citation analysis: recording is started immediately after publication - very rapid indicator of scholarly trends (no publication delays)
- it can be recorded for all types of scholarly communications: papers, journals, preprints, blogs, datasets, chemical structures, software etc. (and not just 10,000 journals)
- metrics for the item-level (i.e., publication) are more meaningful especially for documents on repositories
- it can be applied in an aggregated form so that more accurate and timely conclusions are derived within a specific discipline or region
- it will allow to compare usage data across repositories on a European level, when centrally aggregated
- Removal of all malformed usage events .
- Unique publication identifier generation (currently by cross referencing DRIVER's identifiers).
- Filtering out of all robot initiated requests(COUNTER and custom black robot identification lists).
- Multiple simultaneous streams consolidation.
- Type of request (download full text or view metadata) deduction for uknown types of events.
- Double click filtering according to the COUNTER rules.
- Unique requester detection, which uses multiple input event fields to enhance precision.
- Grouping of multiple events of each requester into sessions (inactivity rule of 30 mins).