Remember Me
Or use your Academic/Social account:


Or use your Academic/Social account:


You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.


Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message


Verify Password:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Biblio at Institute of Formal and Applied Linguistics
Institutional Repository
293 Publications
OpenAIRE 3.0 (OA, funding)
More information
Detailed data provider information (OpenDOAR)


  • Querying Diverse Treebanks in a Uniform Way

    Štěpánek, Jan; Pajas, Petr (2010)
    Projects: EC | EUROMATRIXPLUS (231720)
    The paper presents a system for querying treebanks in a uniform way. The system is able to work with both dependency and constituency based treebanks in any language. We demonstrate its abilities on 11 different treebanks. The query language used by the system provides many features not available in other existing systems while still keeping the performance efficient. The paper also describes the conversion of ten treebanks into a common XML-based format used by the system, touching t...

    NMT at CUNI

    Bojar, Ondřej (2017)
    Projects: EC | QT21 (645452)
    A summary of the development of neural MT at Charles University.

    Selecting Data for English-to-Czech Machine Translation

    Bojar, Ondřej; Tamchyna, Aleš; Kamran, Amir; Galuščáková, Petra; Stanojević, Miloš (2012)
    Projects: EC | EUROMATRIXPLUS (231720)
    We provide a few insights on data selection for machine translation. We evaluate the quality of the new CzEng 1.0, a parallel data source used in WMT12. We describe a simple technique for reducing out-of-vocabulary rate after phrase extraction. We discuss the benefits of tuning towards multiple reference translations for English-Czech language pair. We introduce a novel approach to data selection by full-text indexing and search: we select sentences similar to th...

    Maximum Entropy Translation Model in Dependency-Based MT Framework

    Popel, Martin; Žabokrtský, Zdeněk; Mareček, David (2010)
    Projects: EC | FAUST (247762), EC | EUROMATRIXPLUS (231720)
    Maximum Entropy Principle has been used successfully in various NLP tasks. In this paper we propose a forward translation model consisting of a set of maximum entropy classifiers: a separate classifier is trained for each (sufficiently frequent) source-side lemma. In this way the estimates of translation probabilities can be sensitive to a large number of features derived from the source sentence (including non-local features, features making use of sentence s...

    Incorporation of a valency lexicon into a TectoMT pipeline

    Kuboň, Vladislav; Klyueva, Natalia (2016)
    Projects: EC | QTLEAP (610516)
    In this paper, we focus on the incorporation of a valency lexicon into TectoMT system for Czech-Russian language pair. We demonstrate valency errors in MT output and describe how the introduction of a lexicon influenced the translation results. Though there was no impact on BLEU score, the manual inspection of concrete cases showed some improvement.
  • No data provider research data found
  • Latest Documents Timeline

    Chart is loading... It may take a bit of time. Please be patient and don't reload the page.

    Document Types

    Chart is loading... It may take a bit of time. Please be patient and don't reload the page.

    Funders in data provider publications

    Chart is loading... It may take a bit of time. Please be patient and don't reload the page.

    Projects with most Publications

    Chart is loading... It may take a bit of time. Please be patient and don't reload the page.

Share - Bookmark