Skip to main content

Guides for Researchers

How do I know if my research data is protected?

Learn more about what is research data and their protection by intellectual property rights

This guide, a user FAQs for Researchers, about the protection of research data, is part of the "User guide on copyright, open science and data", meant to offer a state of the art, legally advanced, but still manageable set of rules, guidelines, and resources to enable the full potential of Open Science in the EU research field with a view to addressing copyright and related rights issues.

  • What is Research Data?

    Research data are the evidence that underpins the answer to the research question, and can be used to validate findings regardless of its form (e.g. print, digital, or physical). These might be quantitative information or qualitative statements collected by researchers in the course of their work by experimentation, observation, modelling, interview or other methods, or information derived from existing evidence. Data may be raw or primary (e.g. direct from measurement or collection) or derived from primary data for subsequent analysis or interpretation (e.g. cleaned up or as an extract from a larger data set), or derived from existing sources where the rights may be held by others. Data may be defined as ‘relational’ or ‘functional’ components of research, thus signalling that their identification and value lies in whether and how researchers use them as evidence for claims. They may include, for example, statistics, collections of digital images, sound recordings, transcripts of interviews, survey data and fieldwork observations with appropriate annotations, an interpretation, an artwork, archives, found objects, published texts or a manuscript.

    In this information guide, the definition of research data is the one used in the Concordat on Open Research Data.
  • Protection of Research Data

    When is research data protected?

    Research data are likely to be a ‘bundle’ of different types of information and content, sourced from third parties, or created by the researcher. Research data may be factual and/or creative. Data as such, like facts, principles, mathematical concepts and methods are not protected by copyright. However, there are cases in which data, not as such but part of collections, can be protected. Additionally, whereas data as such are not protected by copyright, that does not mean data are not protected by other laws (e.g. confidential information or personal data). This guide will offer guidance in this area with a specific focus on research data.

    This information guide will focus on those elements of research data that, might be automatically protected by intellectual property rights such as copyright protected works and databases protected by copyright and/or the Sui Generis Database Right (SGDR).

    How do rules on research data impact on use?

    Research data may be:

    • Automatically protected by the law;
    • Regulated by contract;
    • Subject to community norms such as academic best practices.

    Multiple types of protection might exist in research data, or there may be elements that have no legal protection. As explained in this information guide, the default position is that the owners of protected rights can restrict the use of those rights. Licensing is a way to stipulate when and how protected rights in research data can be used by others. Before thinking about licensing or using research data, it is important to understand the rights that can arise automatically in research data. The rest of this guide will discuss copyright and the database right. Sui Generis Database Right – SGDR is a right protecting databases where there has been a substantial investment in obtaining, verifying or presenting the data.

    Copyright is a property right in certain types of original literary, artistic and scientific works. Copyright does not protect ideas. Confidentiality protects confidential information. This might be imposed by a contract or if the information is marked confidential. Use of confidential information might give rise to a claim for compensation if confidentiality is breached. Data Subject Rights arise in information that identifies individuals and are recognised by data protection laws in the EU. Patents are registered rights in novel inventions of products or processes. Patent protection is not specifically discussed in this FAQs. Some research data may not benefit from any legal protection, although moral and ethical considerations may apply.

  • Sui Generis Database Right (SGDR)

    What is the Sui Generis Database Right (SGDR) and how does it relate to other rights in Databases?

    For legal purposes, a ‘database’ means “a collection of independent works, data or other materials arranged in a systematic or methodical way and individually accessible by electronic or other means.”

    Databases are automatically protected by law if:

    1. the selection or arrangement of the contents are the author’s own intellectual creation – in which case copyright protection applies to the structure of the database (not to its content); or
    2. they qualify for a special IP right (called the sui generis database right (SGDR)) because there has been a substantial investment in obtaining, verifying or presenting the contents of a database (e.g. a database of poetry titles). Importantly, the investment in creating data does not count towards the threshold for SGDR protection. This may be quite confusing (especially because the difference between creating and obtaining data should not be read with the eyes of a scientists, i.e. it is not an epistemological issue, but with the eyes of competition law, i.e. the goal is to limit as much as possible so called single source databases). At present time, the best we know about how to distinguish creation from obtaining is in the field of sporting events: match fixtures are created data, therefore sports organisers do not have a SGDR in that. Live sporting event data (e.g. who scores at what minutes) are obtained data and therefore can be protected by SGDR. The SGDR has been heavily criticised especially from an innovation and scientific point of view, but as for now it is part of EU law and the above example is as accurate as it can be right now.

    The content of a database can also be composed of copyright protected works in the first place, such as a database of scholarly articles. However, in this case it is not the copyright in the database that extends to the content, there are simply two independent copyrights in the same database: the copyright in the database structure if it is sufficiently original, and the (independent) copyright in the element constituting the database. In addition, there may also be a SGDR right in the substantial investment in the database. The latter protects the database from substantial extractions, therefore effectively offering a form of protection to data that would otherwise not be protected.

    In conclusion, a database may be protected by up to 3 different rights that regulate 3 different uses. They may belong to 3 different authors/owners and follow 3 different rules in terms of exceptions and limitations. Easy isn’t it?

    Who owns SGDR?

    SGDR is owned by the maker of the database, which is usually the person or entity who bears the financial risk. Financial risk in academic research. In academic research the financial risk of projects is usually borne by the institution. It is therefore possible that the SGDR in a database produced during research may be owned by the institution, even where copyright arising in the database may be owned by the researcher. However, contracts among the different parties (funding grants, universities IP policies, researchers employment contracts, etc.) may provide differently.

    It is important to check the terms of any funding or collaboration agreement to see whether ownership of the deliverables is specified.

    How long does the SGDR last?

    15 years, renewable if a new substantial investment is made.

    What can SGDR owners do?

    The SGDR gives the right holder the right to perform or to authorise the extraction (copying) of substantial amount of data, or repeated extraction of insubstantial amount, and their reuse therefore effectively extending protection to the data.

  • Copyright

    What is Copyright law?

    Original literary, musical, dramatic and artistic works may be protected by copyright law. Certain non-original works such as the typographical arrangement of published editions, sound recordings, broadcasts, non-original photographs, the first fixation of films, etc. are also protected by so called rights related to copyright (or neighbouring rights). Copyright is automatic, there no need to register, deposit, apply a © symbol, etc., although all these activities are permitted and may be even helpful. However, they cannot be a condition for copyright protection (this rule applies almost everywhere in the world, as it is enshrined in the Berne Convention, the most relevant international copyright conventions). Copyright law does not protect ideas that are not expressed in an original form of expression, and does not protect data as such, principles, facts, etc.

    Copyright law grants the owner the exclusive right to control certain rights such as reproduction, redistribution, communication to the public etc.

    For example, when you write a research paper you are automatically the author of that paper. If you write it with others you all become co-authors (as long as everyone made an “authorial” contribution: proofreading, commenting, etc. are usually not considered authorial contributions).

    Who owns copyright?

    The author is usually the person creating the work, but if the work is created jointly, it may be jointly owned, and if a work is created by someone in the course of a contract of employment it may be owned by the employer (depending on the contract).

    How long does copyright last?

    The default rule in the EU is the life of the author plus 70 years. Related rights (e.g. sound recording, film fixations, non-original photographs, etc.) all have different periods, which are more or less harmonised in the EU.

    What can copyright owners do?

    Copyright owners have rights to restrict certain acts in relation to the protected work such as copying, distributing and communicating the work to the public. The effect is that in most instances use of copyright protected works requires permission from the owner.
    Moral rights are additional rights in copyright works. They are not harmonised at the EU level therefore each EU Member State can have different rules on moral rights. However, the Berne Convention stipulates some minimum standards that must be met. These are:

    • The right to be attributed as the author
    • The right to protect the integrity of the work (i.e. to object to alteration or derogatory treatment of the work that are prejudicial to the honour or reputation).

    Moral rights cannot be transferred, but in certain jurisdictions can be waived. They must last at least as long as the economic rights (in the EU 70 + author’s life), but can last longer and in certain jurisdictions they are perpetual.

    How can copyright protected works be used?

    In the EU, there is no concept of ‘fair use’ of works protected by copyright (or SGDR).

    There are however exceptions and limitations that play a similar role, however they do not reach the same flexibility or width as fair use (which is a US concept). Art. 5 of the Copyright in the Information Society Directive of 2001 has a list of about 21 exceptions and limitation to copyright. However, whereas this is a closed list (MS cannot create new exceptions) it is not mandatory (except for the case of temporary acts of reproduction), meaning that each MS can choose which ones to implement. This aspect has attracted strong criticisms as it is not conducive to a level playing field in the EU scientific landscape. For a list of the exceptions and limitations implemented in your country you should refer to your NOAD, which may maintain an updated list.

    In absence of an exception or limitation, uses covered by copyright are reserved to their authors. You may however still be able to reuse a work if it is licensed to you or it is generally licensed to the public with a public licence. Creative Commons Licences or FLOSS licences are examples of this type of public (and usually conditional) permissions.

    Depending on the type of licence, free use of the research data may be permitted, allowing for use/re-use or re-mixing. Other research data may be subject to certain conditions such as attribution of the original research data owner, or attribution of the owners of other rights in the research data. More restrictive licences may prevent using the research data for commercial purposes or not altering the research data.
    If the research data does not specify a licence, it is necessary to consider whether use of any protected content within the research data would infringe the rights of anyone.

    Are there any automatic legal rights in the dataset?

    Copyright and the sui generis database right (SGDR) arise automatically in qualifying works in the EU. If a dataset contains any protected works, the default position is that these rights are owned by the author or maker.

    What if the researcher is an employee of a research institution or a student?

    Depending on the employment contract and the institution’s intellectual property policies, the law may recognise an employee’s copyright as being owned by the employer.

    What if there is a contractual relationship between the researcher and any other party (e.g. funder)?

    A collaboration or funding agreement may specify that the ownership of the deliverable research lies with the funder, or that the funder has an automatic licence in the research.

    What if any of the data has been obtained from a third party who imposed terms and conditions?

    For example, datasets collated through social media platforms will be subject to that platform’s terms and conditions which may specify how the data can be used or made available.

    Who took the financial risk for the research?

    While copyright is usually owned by the author of the work, the SGDR in a database is owned by the person or organisation that bore the financial risk in creating the database. This should be verified on a case by case basis, but it is commonly the employer or the unit that has budgetary autonomy.

  • Training materials


Toolkit for researchers on legal issues

How do I know if my research data is protected

Learn more about what is research data and their protection by intellectual property rights

How do I license my research data

Learn more about licenses for research data and how to apply it

Can I reuse someone else’s research data

Learn more on how to reuse research data

Still have questions?

Contact us via our Helpdesk.
We try to respond within 48 hours.