SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Corpus Linguistics - Introduction Focused on Czech - ALF400221
Title: Korpusová lingvistika - úvod zaměřený na češtinu
Guaranteed by: Institute of Linguistics (21-ULING)
Faculty: Faculty of Arts
Actual: from 2017
Semester: both
Points: 0
E-Credits: 4
Examination process:
Hours per week, examination: 0/2, Ex [HT]
Capacity: winter:unknown / unknown (unknown)
summer:unknown / unknown (unknown)
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
Key competences:  
State of the course: cancelled
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Level:  
Is provided by: AMLV00008
Note: you can enroll for the course in winter and in summer semester
Guarantor: prof. Mgr. Václav Cvrček, Ph.D.
Incompatibility : ALF400086, ALF400222, ALF400223
Is incompatible with: ALF400223, ALINV154B, ALINV153B, ALF400222
Schedule   Noticeboard   
Annotation -
Last update: Mgr. Michal Křen, Ph.D. (28.05.2018)

The course is aimed typically at the students of Czech studies. The students will get to know the language corpora available at Czech National Corpus and learn how to use them for their own research. They will also learn how to work with the KonText query interface and other web applications to query, find and interpret language phenomena.

Credit requirements: active participation, test, analysis of a language phenomenon using corpus linguistic methods.
Literature -
Last update: Mgr. Michal Křen, Ph.D. (28.05.2018)

Compulsory reading

Baker, P.: Using Corpora in Discourse Analysis. Continuum, London 2006. (initial chapter)

Čermáková, A.: Valence českých substantiv. Studie z korpusové lingvistiky, volume 9. NLN, Praha 2009. (initial chapter)

Cvrček, V. - Kováříková, D.: Možnosti a meze korpusové lingvistiky. Naše řeč 94/3, 2011 (p. 113-133).

Recommended reading

Bartoň, T. a kol.: Statistiky češtiny. NLN, Praha 2009.

Cvrček, V. a kol..: Mluvnice současné češtiny /Grammar of Contemporary Czech/. Karolinum. Praha 2010 (353 pages).

Biber, D. et al.: Corpus Linguistics: Investigating Language Structure and Use. Cambridge University Press, Cambridge 1998.

Oakes, M. P.: Statistics for Corpus Linguistics. Edinburg University Press, Edinburg 1998.

Teubert, W. - Krishnamurthy, R. (eds.): Corpus Linguistics Vol. I-VI. Critical Concepts in Linguistics, Routledge 2007.

Syllabus -
Last update: Mgr. Michal Křen, Ph.D. (28.05.2018)

Topics

The course covers the following topics. Each lecturer has his/her own individual approach, the order of and/or emphasis on the particular topic can thus vary.

  • What is a corpus; CNC corpora
  • Corpus linguistics
  • Reprezentativeness of written and spoken corpora, register variation
  • Corpus annotation and structure
  • Corpus querying and interpretation of a concordance
  • Frequency analysis
  • Regular expressions and advanced CQL queries
  • Collocation, colligation and semantic prosody
  • Corpus material in the research of individual language layers
  • Basic foundations of data processing (MS Excel, tables and figures)
  • Basic statistics for working with corpora
  • Corpus tools SyD, Morfio, KWords
  • Specialized corpora (Diakorp, InterCorp, author corpora)
  • Devising and delivering a linguistic research based on corpus data.
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html