SubjectsSubjects(version: 901)
Course, academic year 2021/2022
Introduction to Computer Linguistics - NPFL012
Title: Úvod do počítačové lingvistiky
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2014
Semester: winter
E-Credits: 3
Hours per week, examination: winter s.:2/0 Ex [hours/week]
Capacity: unlimited
Min. number of students: unlimited
Virtual mobility / capacity: no
State of the course: taught
Language: Czech, English
Teaching methods: full-time
Guarantor: doc. RNDr. Vladislav Kuboň, Ph.D.
Class: Informatika Bc.
Classification: Informatics > Computer and Formal Linguistics
Is pre-requisite for: NPFL030, NPFL028
Annotation -
Last update: T_UFAL (10.05.2001)
The main goal of this course is to provide an overview of individual subfields of computational linguistics. Main issues being solved by these subfields are also mentioned. Among the subfields the course stresses are machine translation, syntactic parsing, morphology and corpus linguistics.
Course completion requirements -
Last update: doc. RNDr. Vladislav Kuboň, Ph.D. (15.10.2017)

The course finishes with a written exam. It contains 8-10 questions which concern the topics covered in lectures. One of the questions requires a detailed description of one of the algorithms described in the course of the semester.

Literature -
Last update: doc. RNDr. Vladislav Kuboň, Ph.D. (21.05.2007)

R. Grishman. Computational Linguistics: An Introduction. ACL Studies in Nantural Language Processing. Cambridge University Press, 1986.

B. Grosz, K. Sparck-Jones and B.L. Webber (eds.). Readings in Natural Language Processing. Morgan Kaufmann, los Altos, CA, 1986.

R. Hauser. Introduction to NLP. Springer-Verlag, 1987

KIRSCHNER. Z. (1983). MOSAIC - A Method of Automatic Extraction of Significant Terms from Texts. Praha : MFF UK, 1983. 124 s.

Králíková, K., Panevová, J. (1990). "ASIMUT - A Method for Automatic Information Retrieval from Full Texts", Explizite Beschreibung der Sprache und automatische Textbearbeitung XVII, Faculty of Mathematics and Physics, Charles University, Prague.

Jan Hajič, Alena Böhmová, Eva Hajičová, Barbora Vidová Hladká: "The Prague Dependency Treebank: A Three-Level Annotation Scenario". In: A. Abeillé (ed.): Treebanks: Building and Using Parsed Corpora, Amsterdam:Kluwer, 2000, pp. 103-127

Syllabus -
Last update: doc. RNDr. Vladislav Kuboň, Ph.D. (21.05.2007)

1. Introduction

2. Natural language, its function and structure

3. Introduction to morphology

4. Morphological applications - ASIMUT, MOSAIC, spellchecking

5. Natural language syntax - theories and tools

6. Machine translation

7. Corpora - Brown corpus, PennTreebank, Czech National Corpus, Prague Dependency Treebank

8. Statistical methods

9. Introduction to semantics

10. Wordnet and ontologies

11. Stock of Shared Knowledge

Charles University | Information system of Charles University |