SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Introductory Seminar on Mathematical Linguistics II - NPFL031
Title: Úvodní seminář matematické lingvistiky II
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2020
Semester: summer
E-Credits: 3
Hours per week, examination: summer s.:0/2, C [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: cancelled
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Guarantor: doc. RNDr. Vladimír Petkevič, CSc.
Class: Informatika Mgr. - volitelný
Classification: Informatics > Computer and Formal Linguistics
Co-requisite : NPFL002
Annotation -
Last update: T_UFAL (25.05.2003)
The seminar follows up with the Introductory seminar of mathematical linguistics I. It deals with the following topics: morphological and syntactic analysis of natural languages; Functional generative description of language (FGD); main features of the formal description of sentence structure; introduction to unification-based formalisms and grammars; prominent grammatical theories of the description of natural language in the West; introduction to corpus linguistics.
Literature - Czech
Last update: T_UFAL (23.05.2003)

Čermák F., J. Klímová, V. Petkevič (2000): Studie z korpusové lingvistiky. Nakladatelství Karolinum, Univerzita Karlova v Praze, Praha.

Český národní korpus (2000). Úvod a příručka uživatele. Ústav Českého národního korpusu FFUK. http://ucnk.ff.cuni.cz

Covington M. A. (1994): Natural Language Processing for Prolog Programmers. Prentice Hall, Englewood Cliffs, New Jersey.

Garside R., G. Leech, A. McEnery (eds.) (1997): Corpus Annotation. Linguistic Information from Computer Text Corpora. Longman, London, New York.

Karlsson F., A. Voutilainen, J. Heikkilä, A. Antilla (eds.) (1995): Constraint Grammar. A Language-Independent System for Parsing Unrestricted Text. Mouton de Gruyter, Berlin - New York.

Kennedy G. (1998): An Introduction to Corpus Linguistics. Longman, London.

Kuboň V. (2001): Problems of Robust Parsing of Czech. Doktorská disertační práce. ÚFAL MFF UK.

Květoň P., K. Oliva (2002): Achieving an Almost Correct PoS-Tagged Corpus. Text, Speech and Dialogue. Proceedings of the Fifth International Conference, TSD 2002, LNAI 2448, Brno, Czech Republic 2002, 19-26.

McEnery A.M. (1992): Computational Linguistics. Sigma Press, Wilmslow.

Panevová J. (1980): Formy a funkce ve stavbě české věty. Academia, Praha.

B.H. Partee, A. ter Meulen, R. E. Wall (1990): Mathematical Methods in Linguistics. Kluwer Academic Publishers. Dordrecht, Boston, London.

Petkevič V. (1995): A New Formal Specification of Underlying Structures. Theoretical Linguistics 21, No. 1, 1-61.

Sells P. (1985): Lectures on Contemporary Syntactic Theories. (An Introduction to Government-Binding Theory, Generalized Phrase Structure Grammar and Lexical Functional Grammar.) CSLI Lecture Note Series No. 3. CSLI Publications, Stanford (California).

Sgall P., Hajičová E., Buráňová E. (1980). Aktuální členění věty v češtině. Academia, Praha.

Sgall P. (ed.) (1984): Contributions to Functional Syntax, Semantics, and Language Comprehension. Academia, Praha.

Sgall P. et al. (1986): Úvod do syntaxe a sémantiky. Academia, Praha.

Shieber S. (1986): An Introduction to Unification-Based Approaches to Grammar. CSLI Lecture Note Series No. 4. CSLI, Stanford (California).

Wintner S. (1998): Unification-based Linguistic Formalisms. 10th European Summer School in Logic, Language and Information. Saarbruecken, BRD.

Syllabus -
Last update: T_UFAL (23.05.2003)
I. Syntactic analysis (parsing) of natural languages

  • main objectives and problems of parsing in general and of parsing natural languages in particular
  • the input text to be analyzed, the grammar and the parser - mutual relations
  • declarative and procedural aspects of parsing
  • definite-clause grammars (DCG)
  • chunking, shallow parsing
  • some algorithms of syntactic analysis: top-down parsing, bottom-up parsing, left-corner parsing, chart-parsing, Earley algorithm
  • the relation of syntactic analysis and morphological disambiguation

J. The foundations of the Functional generative description of language (FGD)

  • stratificational linguistics
  • FGD as a representative of stratificational descriptions of natural language
  • characteristic properties of the levels in FGD, elementary and complex units
  • the relation of representation as the relation of form and function, asymmetric dualism
  • homonymy and synonymy
  • relationships between levels, transductive components

K. The level of language meaning in FGD (level of tectogrammatics)

  • language meaning and extralinguistic content
  • deep structure and surface structure
  • main features of the level of language meaning: syntactic structure of sentence - dependency, projectivity; description of coordination; topic-focus articulation (TFA); deep word order; negation and focalizing particles, their scope and their relation to TFA; grammatical and textual coreference

L. Executive parts of FGD

  • generative component and its mathematical expression: dependency grammar, pushdown dependency grammar
  • transductive components of FGD

M. FGD and the Prague Dependency Treebank

  • the objective and main features of the Prague Dependency Treebank
  • analytical level, tectogrammatical level

N. Introduction to unification-based formalisms and grammars

  • declarative and procedural description of language, nonstratificational descriptions of language
  • motivation of the development of unification-based formalisms and their relation to context-free languages
  • feature structures
  • underspecification, subsumption, unification, generalization
  • directed acyclic graph
  • design of unification grammars for the description of various syntactic structures
  • types and their hierarchy
  • lexicalized grammars, relationship of dictionary and grammar

O. Short survey of the prominent grammatical theories in the West

  • Government and Binding theory
  • Generalized Phrase-Structure Grammar
  • Lexical Functional Grammar
  • Head-Driven Phrase-Structure Grammar

P. Introduction to Corpus Linguistics

  • subject of corpus linguistics
  • computer language corpus of electronic texts
  • design and build-up of a corpus
  • types of corpora
  • administrative and textual markup of corpora, their linguistic annotation (tagging); markup languages (SGML and XML)

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html