LOGIN TO YOUR ACCOUNT

Username
Password
Remember Me
Or use your Academic/Social account:

CREATE AN ACCOUNT

Or use your Academic/Social account:

Congratulations!

You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.

Important!

Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message

CREATE AN ACCOUNT

Name:
Username:
Password:
Verify Password:
E-mail:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Name
LINDAT/CLARIN repository
Type
Data Repository
Items
40 Research Data
Compatibility
OpenAIRE Data (funded, referenced datasets)
OAI-PMH
http://lindat.mff.cuni.cz/repository/oai/openaire_data
More information
Detailed data provider information (Re3data)

 

  • No data provider publications found
  • WMT16 APE Shared Task Data

    Turchi, Marco; Chatterjee, Rajen; Negri, Matteo (2016)
    Publisher: Fondazione Bruno Kessler, Trento, Italy
    Projects: EC | QT21 (645452)
    Embargo end date: 2016/02/21
    Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. Training and development respectively contain 12,000 and 1,000 triplets, while the test set 2,000 instances. All data is provided by the EU project QT21 (http://www.qt21.eu/).

    Khresmoi Summary Translation Test Data 2.0

    Dušek, Ondřej; Hajič, Jan; Hlaváčová, Jaroslava; Libovický, Jindřich; Pecina, Pavel; Tamchyna, Aleš; Urešová, Zdeňka (2017)
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Projects: EC | KHRESMOI (257528)
    Embargo end date: 2017/04/03
    This package contains data sets for development (Section dev) and testing (Section test) of machine translation of sentences from summaries of medical articles between Czech, English, French, German, Hungarian, Polish, Spanish and Swedish. Version 2.0 extends the previous version by adding Hungarian, Polish, Spanish, and Swedish translations.

    Many Czech References for 50 Sentences Selected from WMT11 Data

    Bojar, Ondřej; Macháček, Matouš; Tamchyna, Aleš; Zeman, Daniel (2013)
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Projects: EC | MOSESCORE (288487)
    Embargo end date: 2013/12/10
    This dataset contains the whole set of very many Czech translations for 50 English source sentences coming from WMT11 test set (http://www.statmt.org/wmt11). In total, there are 15431447 Czech sentences, i.e. 300k reference translations per source English sentence on average, but the exact number greatly varies across sentences. You can find more details in included README file. If you use this dataset, please cite the following paper which describes the technique used to construc...

    Prague Czech-English Dependency Treebank 2.0 Coref

    Nedoluzhko, Anna; Novák, Michal; Cinková, Silvie; Mikulová, Marie; Mírovský, Jiří (2016)
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Projects: EC | QTLEAP (610516)
    Embargo end date: 2016/03/30
    The Prague Czech-English Dependency Treebank 2.0 Coref (PCEDT 2.0 Coref) is a parallel treebank building upon the original PCEDT 2.0 release and enriching it with the extended manual annotation of coreference, as well as with an improved automatic annotation of the coreferential expression alignment.

    Khresmoi Query Translation Test Data 1.0

    Pecina, Pavel; Dušek, Ondřej; Hajič, Jan; Urešová, Zdeňka (2013)
    Publisher: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
    Projects: EC | KHRESMOI (257528)
    Embargo end date: 2014/04/02
    This package contains data sets for development and testing of machine translation of medical search short queries between Czech, English, French, and German. The queries come from general public and medical experts.
  • Latest Documents Timeline

    Chart is loading... It may take a bit of time. Please be patient and don't reload the page.

    Document Types

    Chart is loading... It may take a bit of time. Please be patient and don't reload the page.

    Projects with most Research Data

    Chart is loading... It may take a bit of time. Please be patient and don't reload the page.

Share - Bookmark