Assignment details
Thesis title in Czech: Výpočetní modely slovotvorby
Thesis title in English: Computational Models of Word Formation
Key words: vektorová reprezentace slov, slovotvorba, morfologie
English key words: vector space models, word formation, morphology
Academic year of topic announcement: 2023/2024
Type of assignment: dissertation
Thesis language:
Department: Institute of Formal and Applied Linguistics (32-UFAL)
Supervisor: doc. Ing. Zdeněk Žabokrtský, Ph.D.
Word formation data resources harmonized for multiple natural languages were almost non-existent until very recently ([1],[2]), which was a limiting factor for developing models whose validity would be empirically testable in a multilingual setting. The aim of the thesis is to develop, implement, and evaluate word formation models that make use of modern distributional vector space word representations (word embedding models), with a special focus on derivational morphology ([3]) and on multilingual aspects ([4]). Optionally, optimization criteria used in the models can be interpreted in terms of Information Theory, and might reflect hierarchical interactions in a language’s vocabulary, biological and cognitive biases relevant for natural languages, as well as language evolution perspectives.
