|
|
|
||
Last update: T_UFAL (09.05.2012)
|
|
||
Last update: T_UFAL (09.05.2012)
• Martin Popel, Zdeněk Žabokrtský: TectoMT: Modular NLP Framework. In Proceedings of IceTAL • Antonio M. Corbí-Bellot, Mikel L. Forcada, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Gema Sánchez-Ramírez, Felipe Sánchez-Martínez, Iñaki Alegria, Aingeru Mayor, Kepa Sarasola (2005) "An open-source shallow-transfer machine translation engine for the romance languages of Spain • Philip Resnik, Noah A. Smith, The Web as a parallel corpus • Rayid Ghani, Rosie Jones, Dunja Mladenic: "Building Minority Language Corpora by Learning to Generate Web Search Queries" KAIS |
|
||
Last update: T_UFAL (09.05.2012)
Model scenario 1: We want to construct a machine-translation system from/to a new language. First of all we need a parallel corpus of the language and English (or Czech or something else that is available). Further on, we will be interested in tools for morphological and syntactic analysis, named entity recognition etc. in order to improve translation quality.
Model scenario 2: We have little or no parallel data, thus we will focus on rule-based systems. We will propose a set of morphological tags, create a simple morphological and syntactic analyzer, if possible also bilingual lexicon. We will try to put all these pieces together in a rule-based translation system (Treex, Apertium) and use it for primitive translation. |