Automatické osvojení vzorů s minimální supervizí
Thesis title in Czech: | Automatické osvojení vzorů s minimální supervizí |
---|---|
Thesis title in English: | Acquisition of inflectional paradigms with minimal supervision |
Key words: | strojové učení, morfologie, fonologie, vzory |
English key words: | machine learning, morphology, phonology, inflectional paradigms |
Academic year of topic announcement: | 2010/2011 |
Thesis type: | diploma thesis |
Thesis language: | angličtina |
Department: | Institute of Formal and Applied Linguistics (32-UFAL) |
Supervisor: | RNDr. Jiří Hana, Ph.D. |
Author: | hidden - assigned and confirmed by the Study Dept. |
Date of registration: | 17.06.2010 |
Date of assignment: | 17.06.2011 |
Confirmed by Study dept. on: | 03.10.2011 |
Date and time of defence: | 10.05.2012 13:00 |
Date of electronic submission: | 13.04.2012 |
Date of submission of printed version: | 13.04.2012 |
Date of proceeded defence: | 10.05.2012 |
Opponents: | RNDr. Jaroslava Hlaváčová, Ph.D. |
Guidelines |
The goal of this thesis is the creation of a system automatically discovering inflectional paradigms including basic phonological/graphemic alternations (e.g., palatalization of 'r' into 'ř' in 'dobr-ý' - 'dobř-í'). It should be applicable to any fusional language, such as Czech. The system should take advantage of a limited amount of data provided manually - i.e. it should be neither a traditional labor and knowledge intensive system, nor a completely unsupervised system. |
References |
John Goldsmith. 2001. Unsupervised learning of the morphology of a natural language. Comput. Linguist. 27, 2 (Jun. 2001), 153-198. DOI=http://dx.doi.org/10.1162/089120101750300490
Jirka Hana. 2008. Knowledge- and labor-light morphological analysis. OSUWPL, Volume 58, Fall 2008, pp. 52-84. http://ufal.mff.cuni.cz/~hana/bib/hana-2008-wp-morph.pdf Kemal Oflazer, Sergei Nirenburg, and Marjorie McShane. 2001. Bootstrapping morphological analyzers by combining human elicitation and machine learning. Comput. Linguist. 27, 1 (March 2001), 59-85. DOI=10.1162/089120101300346804 http://dx.doi.org/10.1162/089120101300346804 www.aclweb.org/anthology/J01-1003.pdf David Yarowsky and Richard Wicentowski. 2000. Minimally supervised morphological analysis by multimodal alignment. In Proceedings of the 38th Annual Meeting on Association for Computational Linguistics (ACL '00). Association for Computational Linguistics, Morristown, NJ, USA, 207-216. DOI=http://dx.doi.org/10.3115/1075218.1075245 |