News
PANGAEA®: Data Publisher for Earth & Environmental Science
Interview with Michael Diepenbroek, PANGAEA Data Publisher for Earth & Environmental Science, MARUM- Center for Marine Environmental Sciences, University Bremen
Q: What is PANGAEA?
Data center, long-term repository, and data publisher (scope as in our name).
Q: How do you contact researchers and encourage them to deposit and share data?
We are embedded in science institutes and participate in science projects.
Q: How does one data library/archive handle the diversity of datatypes, formats, technical standards and so on?
We have a flexible data model and we map our internal formats to the mostly used standards.
Q: How are the data licensed?
CC-BY mostly.
Q: How do you think a data library/archive should measure success?
Usage, relevance to science.
Q: Thomson Reuters announced in June Data Citation Index for discovering global data sets – an upcoming research resource within the Web of KnowledgeSM to facilitate the discovery, use and attribution of data sets and data studies, and link those data to peer-reviewed literature. Do you cooperate with Thompson Reuters?
Thompson is harvesting our metadata to be integrated into the web of knowledge.
A note: More information about PANGAEA® impact is in the study of the impact of sharing data underlying publications (Data Sharing Effect on Article Citation Rate in Paleoceanography) in Jon Sears (AGU) - Abstract for the AGU 2011: “The validation of scientific results requires reproducible methods and data. Often, however, data sets supporting research articles are not openly accessible and interlinked. This analysis tests whether open sharing and linking of supporting data through the PANGAEA® data library measurably increases the citation rate of articles published between 1993 and 2010 in the journal Paleoceanography as reported in the Thomson Reuters Web of Science database. The 12.85% (171) of articles with publicly available supporting data sets received 19.94% (8,056) of the aggregate citations (40,409). Publicly available data were thus significantly (p=0.007, 95% confidence interval) associated with about 35% more citations per article than the average of all articles sampled over the 18-year study period (1,331), and the increase is fairly consistent over time (14 of 18 years). This relationship between openly available, curated data and increased citation rate may incentivize researchers to share their data.”