“Open data and content can be freely used, modified and shared by anyone for any purpose”
Tim Berners-Lee’s proposal for five star open data http://5stardata.info
|One star||make your stuff available on the Web (whatever format) under an open licence|
|Two stars||make it available as structured data (e.g. Excel instead of a scan of a table)|
|Three stars||use non-proprietary formats (e.g. CSV instead of Excel)|
|Four stars||use URIs to denote things, so that people can point at your stuff|
|Five stars||link your data to other data to provide context|
4 Key steps in very approximate order — many of the steps can be done simultaneously.
Basically, everything that is needed to replicate a study should be available. Plus everything that is potentially useful for others.
"It was a mistake in a spreadsheet that could have been easily overlooked: a few rows left out of an equation to average the values in a column. The spreadsheet was used to draw the conclusion of an influential 2010 economics paper: that public debt of more than 90% of GDP slows down growth. This conclusion was later cited by the International Monetary Fund and the UK Treasury to justify programmes of austerity that have arguably led to riots, poverty and lost jobs."
A study that analysed the citation counts of 10,555 papers on gene expression studies that created microarray data, showed: “studies that made data available in a public repository received 9% more citations than similar studies for which the data was not made available” (Source: Data reuse and the open data citation advantage, Piwowar, H. & Vision, T.)
A Data Management Plan (DMP) is a brief plan to define:
DMPs are often submitted as part of grant applications, but are useful whenever researchers are creating data.
Scientific research data should be easily: