Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
Language Resources for Yoruba
Thesis title in Czech: Jazykové zdroje pro jorubštinu
Thesis title in English: Language Resources for Yoruba
Key words: jorubština, nigerokonžské jazyky, morfologie, závislostní syntaxe
English key words: Yoruba, Niger-Congo languages, morphology, dependency syntax
Academic year of topic announcement: 2019/2020
Thesis type: dissertation
Thesis language: angličtina
Department: Institute of Formal and Applied Linguistics (32-UFAL)
Supervisor: RNDr. Daniel Zeman, Ph.D.
Author: hidden - assigned by the advisor
Date of registration: 08.10.2019
Date of assignment: 08.10.2019
Guidelines
The core of the work will be design of annotation guidelines specific for Yoruba, within the Universal Dependencies framework, and creation of annotated data that enable training of at least a small model for automatic tokenization, tagging and dependency parsing of this language.

Jádrem práce bude návrh anotačních pravidel specifických pro jorubštinu v rámci formalismu Universal Dependencies, a tvorba anotovaných dat umožňujících natrénování alespoň malého modelu pro automatickou tokenizaci, značkování a závislostní syntaktickou analýzu tohoto jazyka.
References
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajič, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman. 2016. Universal Dependencies v1: A Multilingual Treebank Collection. In Proceedings of LREC.
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html