Guides for Researchers
How to identify and assess Research Data Management (RDM) costs
What will it cost to manage and share my data?
Infographic providing information for researchers on the costs of research data management, how these can be addressed in advance, and the community resources available.
This version also provides an editable version of the document which can be completed with costs, activities, stakeholders, or cost drivers more relevant to the user's context.
The cost of data management
Under H2020 an Open Research Data pilot was introduced encouraging good data management and aiming to make research data open. When your Horizon 2020 project is part of the pilot, and your data meets certain conditions, you must deposit your data in a research data repository where they will be findable and accessible for others.
Data management and sharing activities need to be costed into research, in terms of the time and resources needed. By planning early, costs can be significantly reduced. Costs associated with open access to research data, can be claimed as eligible costs of any Horizon 2020 grant during the duration of the project under the conditions defined in the H2020 Grant Agreement: they must already be budgeted and accepted in the grant proposal, and note the “during the duration of the project”.
Do you want to get a quick, visually understandable overview of this issue? Take a look at this infographic produced by DCC in the context of the OpenAIRE's RDM Task Force, dedicated to DMP resources.
Estimating Costs RDM tool
DMP PHASE | ACTIVITY | COMMENTS AND SUGGESTIONS | COSTS |
---|---|---|---|
Preparing | Make a Data Management Plan |
|
2 hrs to 2 days, depending on the complexity of your project |
1. Data Collection |
Acquiring External datasets
Do you plan to use existing data, and is the data available at a commercial partner? |
|
Example: A faculty licence on a database for macro-economic analysis: €18.000/y |
1. Data Collection |
Formatting and organising
|
|
Per project organize style, format, names can be done by a student assistant at level 1* salary or data manager at level 2* salary |
1. Data Collection | Transcription
|
|
Example: Time needed for transcription - four to eight hours per hour recording |
1. Data Collection |
Consent for data sharing
|
|
Student assistant at level 1* salary or data manager at level 2* salary |
1. Data Collection |
Data transfer Are special measures needed to transfer data from mobile devices, from fieldwork sites or from home equipment to a central work server? |
Is software or hardware needed for data transfer, for encryption of confidential data before transfer, or for synchronisation of data files across sites? | Free encryption or data transfer software (i.e. SurfFileSender) is available in most cases |
2. Data Documentation |
Data description and Metadata
|
|
Examples: 4 hrs per single experiment (120 measurements) filling in 60 required metadata fields, with assistance of a data manager at level 2* salary Two to three weeks are costed into an average two year research grant application to prepare and collate materials for deposit More information: http://www.data-archive.ac.uk/help/user-faq |
2. Data Documentation |
Documentation Do you have documentation for the data that describes the context and methodology of how data were gathered, created, processed and quality controlled? |
|
Researcher at level 2* salary. |
3. Data Storage & Back-up |
Data backup
|
Institutional backup – included in standard indirect cost/overheadsadditional backup needed – cost according to number of copies to be kept, frequency of backup and storage media needed | Examples: University drive €0.80 per GB/y Cloud: €0.30 per GB/y2 x Harddrive: €0.14 per GB (single purchase) |
3. Data Storage & Back-up |
Data storage
|
|
Example: Cloud Database as a service:€160/Month (storage 5GB transfer 30GB)Database architect at level 2* salary |
4. Data Access & Security |
Data Access Do external people require access to research data? |
Does remote access via VPN or secure FTP need to be arranged for external people? | Mostly researchers can make use of existing, free services |
4. Data Access & Security |
Data security
|
|
Example: TTP (trusted third party), dependent on pseudonymisation type, ca. €1.000- €30.000 Existing encryption services could be used at no costs |
5. Data Preservation & Archiving |
File format Do data need to be converted to a standard or open format with long- term validity for long-term preservation? |
|
Researcher at level 2* salary |
6. Data Sharing & Reuse |
Anonymisation
|
|
Free software is available. AMNESIA is a data anonymization tool, that allows to remove identifying information from data. Student assistant at level 1* salary |
6. Data Sharing & Reuse |
Copyright
|
|
Juridical advice at level 3* salary |
6. Data Sharing & Reuse |
Data sharing
|
|
Examples: Completing a data repository upload form (i.e. Zenodo a free-of-charge repository) may take 15 min to 4 hrs Dryad €110 once (max 20 GB) DataverseNL €3.60 dper GB/yearCloud Database as a service:€160 /month (storage 5 GB, transfer 30 GB) |
6. Data Sharing & Reuse |
Data cleaning
|
|
Example: Data cleaning service: €270 to well over €1800 More information: http://datascopic.net/cost-of-data-cleansing/data-cleansing/ Researcher/data manager at level 2* salary |
6. Data Sharing & Reuse |
Digitisation Do analogue or paper-based research data (maps. newspaper clippings, photographs, images, text) need to be digitised to increase their potential for sharing? |
|
Example: Digitisation €0.50 per page (few pages) OR €320-390 per 1000 pages (OCR included) |
Overall |
Roles and responsibilities Do you need to allocate roles and responsibilities for various data management activities? |
If multiple partner institutions, researchers or funders are involved in research – consider cost of data management planning meetings or discussions | Travel costs, lunch, time |
Overall |
Operationalising data management What measures are needed to implement and operationalise data management throughout the research lifecycle? |
|
Data manager at level 2* salary |
* Local salary scales differ per country. E.g.:
- Level 1 (i.e. student assistant) ~ 17 euro per hour.
- Level 2 (researcher, data manager) ~60 euro per hour
- Level 3 (external expert) ~160 euro per hour.)
- UK Data Service (2013). Data management costing tool. UK Data Archive, University of Essex.
- Alisa Westerhof (UU), Tessa Pronk (UU),Annemiek van der Kuil(3TU & TUD), Annemie Mordant (UM)(2015). Data Management Bij wetenschappelijk onderzoek méér dan alleen storage. Landelijk Coördinatiepunt Research Data Management, The Netherlands.
How to use this costing tool?
Step 1: Check the data management activities in the table and tick those that may apply to your proposed research.
Step 2: For each selected activity, estimate the additional time and/or other resources needed and cost this, e.g. people’s time or physical resources needed such as hardware or software. Find out which resources, e.g. for data storage and backup, are available to you from your institution. Consider whether you need a dedicated data manager.
Step 3: Add these data management costs to your research application. Coordinate resourcing and costing with your institution, research office and institutional IT services.
Step 4: Plan the data management activities in advance to avoid them competing with the need to focus on research excellence.
Remember that when your research project nears the end you do not want these additional data management activities to compete with delivery of your planned outputs, writing of publications and the timely delivery of your project. At this later stage the costs of preparing data for sharing may be significantly higher.
How to calculate costs?
But we can help you get started on costing your curation activities. In this guide you can find a tool listing, explaining and estimating the cost of possible expenses of data management. Estimates for quantifying amounts are only indicative of the order of magnitude.