OpenAIRE/LIBER workshop "Dealing with Data. What's the role for the library?"Ghent, May 28 2013
The purpose of this workshop was to explore where the library sits in the whole data lifecycle, and pinpoint some areas as to what its role might be in the future. The workshop also invited a range of scientists to present their line of research to the audience, with idea about where the library might be able to support them.
Below is a summary and highlights of event, with some pointers for libraries that can contribute to the ongoing discussion about the library’s role in dealing with research data.
What is open science? A researcher’s perspective’ by Erik Mannens (UGent, MMLAB)
‘Open should be the default, and libraries can be the catalyst to promote this idea' was his message to the audience.[Presentation]
- Know your Data – researchers and librarians have to realize what a commodity their data represents
- Provide the tooling – while so much software is available, the library should be at least aware of what software there is out there
- Push open access in your daily work
Research Data Overview by Sarah Callaghan (British Atmospheric Data Centre)
Sarah Callaghan guided the audience through the steps of the data life-cycle as well as her personal experiences as a researcher creating data.
Just like with hand-made knitwear, reconstruction is only possible if you know all the details (i.e. the metadata) about how it was originally made.
- make preservation and access to data a reality by promoting use of correct metadata
- make sure researchers who share their data are properly credited
"Meet the scientists" A Researcher’s perspective.
This session aimed at highlighting to the audience the daily workflows of researchers and their opinions on what libraries can do for them when it comes to Research Data Management.
Joris Van Zundert (humanities): Joris explained that his data is fairly unstructured, highly complex and heterogeneous. Researchers don’t come from a community of storing and sharing data. Getting credit for sharing and publishing data is key, this will be the biggest push, even more so than funder policies. Libraries could organize storage support as subject data repositories for humanities are limited. Many disciplines in SSH need more machine access to data, and small-scale projects to work this out would be valuable
Aaike de Wever (life sciences): While storage and data centres are fairly abundant for this discipline, encouraging colleagues to share data is still a barrier. The development of accreditation mechanisms still has a long way to go. Libraries should play a role in encouraging deposit, linking the research workflow within the life-sciences. Guiding researchers to repositories is also key, recommending appropriate storage for the right data.
Dirk Van den Poel (marketing): Most data is highly sensitive and he mentioned that even anonymising data isn’t always totally safe. He explained that more open software is being used and there is a push to share the end result data. However this sort of data management takes time and the library could play a role here with support, such as assist researchers to plan the workflow for processing the data, and assist in managing it, so that it can be reinterpreted by other researchers.
- Take into account that researchers’ needs differ per discipline!
- Fear of not being properly credited and misuse of rights is widely spread
- Libraries can help sorting out practical issues like rights management, guiding towards repositories, workflow planning, ...
The data publishing process
'Supporting scientific communities by publishing data by Peggy Schaeffer (Dryad)
After giving a comprehensive overview of what Dryad does, Peggy from Dryad outlined some good pointers for libraries and data
'Data Publication: a Publisher's perspective' by Fiona Murphy (Wiley Publishers)
Fiona Murphy of Wiley spoke about what publishing data is all about: why it is important in terms of being cited and credited. The growing pressure funder mandates also plays a role.
- Surprise researchers by proving how much their data is consulted
- Just like publishers, libraries should re-think the role they can play in the research publication process in all its aspects
- Get acquainted with data journals
Actively Supporting Data Management : Learning from the Approach of Three Research Institutions
These talks gave best-practice examples of how three institutions have supported research data management.Three very different library approaches to support RDM were presented. It would be interesting to see how they can combine their resources in future training materials!
Developing online data management resources for researchers and librarians' by Robin Rice (Data Librarian, Edinburgh University Data Library) [Presentation]
'3TU.Datacentrum and Data Intelligence for Librarians: support to staff and researchers on data management' - Ellen Verbakel (Data Librarian, 3TU.Datacentrum, Delft University) [Presentation]
RDMRose -an open educational resource on Research Data Management tailored for information professionals by Andrew Cox (University of Sheffield) [Presentation]
- Help participants gain confidence by bringing them in contact with researchers during training
- Be flexible and provide variation in your training materials: no use for a 'one size fits all' approach
- Combine training resources, get in touch with other institutions organizing similar courses
Services and Tools
‘ZENODO - A new innovative service for sharing all research outputs’ by Lars Holm Nielsen (OpenAIRE)Zenodo is the re-branded OpenAIRE orphan repository for both publications and datasets. In terms of data it is suitable for long-term data
The strength of Zenodo is the ‘Community collections’ element which will flag up the relationships between publications and data. The collections can also be exported into local systems, including the links between objects.
CKAN - A Research Data Management Tool. The University of Lincoln's experience in implentation' by Joss Winn (Orbital project, University of Lincoln library)
- While RDM might be fairly new, it involves many processes that have existed in libraries for years –visibility, discoverability, curation, preservation
LIBER/OpenAIRE Panel Session
Wilma van Wezenbeek (TU Delft, Amsterdam)
Lukas Koster (Library, University of Amsterdam)
Wolfram Horstmann (University of Oxford Library)
Chair: Kevin Ashley, Digital Curation Centre (DCC)
Some key points from the Panel Session
Libraries can give practical advice, training and advocacy
Libraries can help with discoverability, visibility, and offering new contextual information to researchers
Library can help at the beginning and end of the life-cycle (planning and curation), especially when researchers move onto another institution
Libraries are not best suited to carry out storage and active data management. This is more the role of data centres and data libraries.
Starting to carry out a Data Audit would ‘break the ice’ to start data management planning
In the long term, libraries should mediate with publishers to work on standards for extracting metadata
LIBER’s ‘10 recommendations for data management’ are a key resource for the community (URL)
Another valuable resource is the DCC’s guide on ‘Setting up a research data service’ (URL)
Concluding Points from the workshop:
There is a perceived lack of confidence about dealing with data in research libraries
- acting in an advisory role
- guiding researchers to repositories
- making data visible
- support in RDM planning
- The library shouldn’t work in isolation – a collaborative approach with other units and other libraries might be the way forward in order to harness richer research information
The library should play a key role implementing standards. Putting standards in place is crucial, enabling institutions to implement standards.
Training: Embedding RDM in library courses so that a new breed of data librarians is integrated from the start