Mida kasutusstatistikat pakub huvigruppide?

Prindi PDF
  • Teadlastele
    • näitab publikatsiooni asjakohasust
    • näitab ühe autori mõju ulatust
  • Teadusraamatukogudele
    • näitab repositooriumis hoiustatud materjalide vastuvõtmise ja kasutamise taset
    • kasutamisanalüüs seadmete, veebilehitsejate, otsingumootorid ja otsisõnade alusel toetab repositooriumi veebilehtede ja metaandmete täiustamist ning optimeerimist
  • Rahastamise korraldajatele
    • näitab mõju teadustöö tulemuste
    • annab teadmisi praeguste suundumuste kohta teaduses

Why are usage statistics important?

Prindi PDF

Usage statistical data has enormous potential as

  • it is available much earlier than citation analysis: recording is started immediately after publication - very rapid indicator of scholarly trends (no publication delays)
  • it can be recorded for all types of scholarly communications: papers, journals, preprints, blogs, datasets, chemical structures, software etc. (and not just 10,000 journals)
  • metrics for the item-level (i.e., publication) are more meaningful especially for documents on repositories
  • it can be applied in an aggregated form so that more accurate and timely conclusions are derived within a specific discipline or region
  • it will allow to compare usage data across repositories on a European level, when centrally aggregated

How can your repository participate?

Prindi PDF

Aggregation requires standardization on the recording of usage events, exclusion of robot accesses and data exchange mechanisms. The icon OpenAIRE Guidelines for Usage Statistics v1.0, based on OAI-PMH, specify a common format of usage events and transfer protocol (OAI-PMH) for a straightforward adoption by data providers.

Legal  issues - Privacy Policy

In alignment with the European Act of personal data protection (http://europa.eu/legislation_summaries/information_society/l14012_en.htm), the IP address, session-id and in some cases also the C-class Subnet must be pseudonymised before transferring the usage data to the aggregator service.

Repository log file conversion and transfer tools

Below is a list of tools that can help in conversion and transfer of collected usage data:

  • SURFshare-sure (Statistics on the Usage of Repositories) provides a software for the conversion of Apache2 log files of repositories into OpenURL Context Object Files and for the OAI-PMH transfer to a log Aggregator, like OpenAIRE's Usage Data Aggregator Service.
  • OA-Statistik Data Providers plugins for DSpace, WebDoc and OPUS repositories that convert repository usage data formats into the OpenURL Context format and expose them with OAI-PMH.

Check out regularly for an update of the list of tools.

How is aggregating performed?

Prindi PDF
Once the usage data is  harvested in OpenAIRE, it is cleaned and harmonized to procude more accurate statistics. Some of these transformation processes include:
  • Removal of all malformed usage events .
  • Unique publication identifier generation (currently by cross referencing DRIVER's identifiers).
  • Filtering out of all robot initiated requests(COUNTER and custom black robot identification lists).
  • Multiple simultaneous streams consolidation.
  • Type of request (download full text or view metadata) deduction for uknown types of events.
  • Double click filtering according to the COUNTER rules.
  • Unique requester detection, which uses multiple input event fields to enhance precision.
  • Grouping of multiple events of each requester into sessions (inactivity rule of 30 mins).
When sufficient usage data is available to OpenAIRE, additional types of analyses will be explored in search of usage impact metrics. Also, when combined with other types of data (e.g., author ids, citation, etc.) additional results may be deduced.