LOGIN TO YOUR ACCOUNT

Username
Password
Remember Me
Or use your Academic/Social account:

CREATE AN ACCOUNT

Or use your Academic/Social account:

Congratulations!

You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.

Important!

Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message

CREATE AN ACCOUNT

Name:
Username:
Password:
Verify Password:
E-mail:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Publisher: Zenodo
Languages: English
Type: dataset
Subjects: Mars, information extraction, named entity recognition

This data set contains annotated text versions of 2-page abstracts published at the Lunar and Planetary Science Conference in 2015 and 2016.

The original PDF abstracts are available at:

  • https://www.hou.usra.edu/meetings/lpsc2015/programAbstracts/view/
  • https://www.hou.usra.edu/meetings/lpsc2016/programAbstracts/view/

The text files in this archive were extracted using the Apache Tika PDF parsing tool.  The text is provided here so that the annotations can be viewed.  The text content remains copyright of the original abstract authors.

The annotations (entities and relations) are provided in the format used by the brat annotation tool.  To view the annotations in a web-based graphical form, install the brat tool (http://brat.nlplab.org/).  These annotations were generated using brat v1.3.  The annotation files are also human-readable and can be parsed in to be used directly in code.

Contents:

  • lpsc15/: 62 abstracts
  • lpsc16/: 55 abstracts

Each directory contains a .txt and .ann file for each abstract.  The .ann file is in brat standoff format (http://brat.nlplab.org/standoff.html).

Additional .conf files are provided to generate color highlighting and keyboard shortcuts.  These are used by the brat tool.

Attribution:

If you use this data set in your own work, please cite this DOI:

10.5281/zenodo.1048419

Please also cite this paper, which provides additional details about the data set.

Kiri L. Wagstaff, Raymond Francis, Thamme Gowda, You Lu, Ellen Riloff, Karanjeet Singh, and Nina Lanza. "Mars Target Encyclopedia: Rock and Soil Composition Extracted from the Literature."  Proceedings of the Thirtieth Annual Conference on Innovative Applications of Artificial Intelligence, 2018.

  • No related publications.
  • No related research data.

Share - Bookmark

Download from

Funded by projects

No projects found

Cite this research data

Collected from