SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Dependency Grammars and Treebanks - NPFL075
Title: Závislostní gramatiky a korpusy
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2022
Semester: summer
E-Credits: 3
Hours per week, examination: summer s.:1/1, MC [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: taught
Language: Czech, English
Teaching methods: full-time
Teaching methods: full-time
Additional information: https://ufal.mff.cuni.cz/courses/npfl075
Guarantor: RNDr. Daniel Zeman, Ph.D.
RNDr. Jiří Mírovský, Ph.D.
Class: Informatika Mgr. - Matematická lingvistika
Classification: Informatics > Computer and Formal Linguistics
Incompatibility : NPFX075
Interchangeability : NPFX075
Is incompatible with: NPFX075
Is interchangeable with: NPFX075
Annotation -
Last update: doc. Mgr. Barbora Vidová Hladká, Ph.D. (19.05.2020)
The goal of the course is to introduce a dependency-based description of natural languages, principles of dependency-based grammar formalisms and their application in morphologically and syntactically annotated corpora. The course will focus on the Prague Dependency Treebank project and on the Universal Dependencies project. The emphasis is also placed on annotation schemata and data formats, on practical work with treebanks and useful tools. The course is designed for students with the computer science background as well as for linguists with some CS experience.
Course completion requirements -
Last update: RNDr. Jiří Mírovský, Ph.D. (10.05.2022)

To pass the course, you will be required to actively participate in the classes and to submit all of the homework tasks. The quality of your homework solutions will determine your grade. More detailed information on grading can be found at the course webpage https://ufal.mff.cuni.cz/courses/npfl075

Literature -
Last update: RNDr. Jiří Mírovský, Ph.D. (10.05.2022)
  • Hajič J., Hajičová E., Mikulová M., Mírovský J.: Prague Dependency Treebank. In N. Ide & J. Pustejovsky (eds.): Handbook of Linguistic Annotation. Dordrecht 2017, pp. 555-594.
  • PDT Guide - http://ufal.mff.cuni.cz/pdt2.0/
  • PDT documentation - https://ufal.mff.cuni.cz/pdt3.5/documentation
  • Nivre, J., de Marneffe, M.C., Ginter, F., Goldberg, Y., Hajič, J., Manning, C., McDonald, R., Petrov, S., Pyysalo, S., Silveira, N., Tsarfaty, R., Zeman, D. (2006) Universal Dependencies v1: A Multilingual Treebank Collection. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), ELRA, Paris, pp. 1659-1666.
  • Universal Dependencies - http://universaldependencies.org/
  • Zeman, D.: The World of Tokens, Tags and Trees. Studies in Computational and Theoretical Linguistics, vol. 19. ÚFAL, Praha, 2018, ISBN 978-80-88132-09-7.
  • Partee, B. H., ter Meulen, A., Wall, R.E.: Mathematical Methods in Linguistics. Studies in Linguistics and Philosophy, vol. 30, Kluwer Academic Publishers, Dordrecht, Boston, London, 1990 (chapter 16)

Syllabus -
Last update: doc. Mgr. Barbora Vidová Hladká, Ph.D. (29.01.2019)
  1. Trees as a data structure (dependency trees, phrase-based trees, word order, projectivity)
  2. Dependency and non-dependency relations in natural languages
  3. Family of Prague Dependency Treebanks - introduction and principles; Functional Generative Description as their theoretical basis
  4. Universal Dependencies - introduction and principles
  5. Stratificational approach to natural language description:
    • morphology and its annotation in dependency treebanks
    • (surface) syntax and its annotation in dependency treebanks
    • (deep) syntax and its annotation in dependency treebanks
  6. Annotation of selected deep syntactic phenomena
  7. Annotation schemata, data formats
  8. Tools (TrEd, PML-TQ, Udapi)
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html