Processing of Turkic Languages
Thesis title in Czech: | Zpracování tureckých jazyků |
---|---|
Thesis title in English: | Processing of Turkic Languages |
Key words: | morfologická analýza, valence, slovník |
English key words: | morphological analysis, valency, dictionary |
Academic year of topic announcement: | 2012/2013 |
Thesis type: | diploma thesis |
Thesis language: | angličtina |
Department: | Institute of Formal and Applied Linguistics (32-UFAL) |
Supervisor: | RNDr. Daniel Zeman, Ph.D. |
Author: | hidden - assigned and confirmed by the Study Dept. |
Date of registration: | 06.11.2012 |
Date of assignment: | 07.11.2012 |
Confirmed by Study dept. on: | 21.11.2012 |
Date and time of defence: | 02.09.2013 00:00 |
Date of electronic submission: | 02.08.2013 |
Date of submission of printed version: | 02.08.2013 |
Date of proceeded defence: | 02.09.2013 |
Opponents: | doc. RNDr. Markéta Lopatková, Ph.D. |
Guidelines |
Turkic languages, such as Turkish, pose a specific set of challenges for computational processing. Complex agglutinating morphology is central to many tasks. The goal of the thesis is to explore and evaluate existing publicly available tools for analysis of one selected Turkic language, to consider possibilities of their improvement, extension and/or application in higher-level tasks. In particular, we will focus on existing morphological analyzer(s) for Turkish and work with them. Higher levels include but are not limited to syntax (valency lexicon) and transfer-based machine translation. |
References |
A Freely Available Morphological Analyzer for Turkish, Çagrı Çöltekin, 2010
Projective and Non-Projective Turkish Parsing, Ruket Çakıcı, Jason Baldridge, 2006 Dependency Parsing of Turkish, Gulsen Eryigit, Joakim Nivre, Kemal Oflazer, 2008 Design and implementation of a computational lexicon for Turkish, Thesis by Abdullah Kurtulus Yorulmaz, 1997 Integrating Morphology with Multi-word Expression Processing in Turkish, Kemal Oflazer and Özlem Çetinoğlu, Bilge Say (2004) Automatic Acquisition of Subcategorization Frames for Turkish with Purely Statistical Methods, Yılmaz Kılıçaslan, Erdinç Uzun, Volkan Agun, Erdem Uçar (2007) Web-based Acquisition of Subcategorization Frames for Turkish, Erdinç Uzun, Yılmaz Kılıçaslan, Volkan Agun, Erdem Uçar (2008) |