SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Work with Language Corpora - ADE110018
Title: Práce s jazykovými korpusy
Guaranteed by: Institute of Germanic Studies (21-UGS)
Faculty: Faculty of Arts
Actual: from 2023
Semester: winter
Points: 0
E-Credits: 3
Examination process: winter s.:
Hours per week, examination: winter s.:1/1, C [HT]
Capacity: unknown / unknown (unknown)
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
Key competences:  
State of the course: not taught
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Level:  
Note: course can be enrolled in outside the study plan
enabled for web enrollment
Guarantor: Mgr. Věra Hejhalová, Ph.D.
Interchangeability : ADE100136
Is interchangeable with: ADE100136
Schedule   Noticeboard   
Annotation -
Last update: Mgr. Věra Hejhalová, Ph.D. (21.09.2022)
Working with language corpora is a course in which undergraduate students learn about the existence of language corpora and the possibilities of their use in linguistic practice. The course is focused on practical work and is a precursor to the Seminar on Corpus Linguistics, which follows in the NMgr. program. In addition to getting to know the basic types and properties of corpora, the emphasis is placed on the practical usability of corpora in the everyday life of a linguist. Emphasis is placed on the corpora of Czech (ČNK) and German (DeReKo and DWDS) incl. associated applications Kookkurrenzanalyse, CCDB, SOM, or Treq, SyD, Morfio, KWords, WaG etc. In the field of usability, the needs of students are reflected with regard to their research and writing of seminar and final theses from other disciplines.
Aim of the course -
Last update: Mgr. Věra Hejhalová, Ph.D. (21.09.2022)

The aim of the course is to familiarize students with the most common language corpora that are relevant to their field of study, to teach them basic work with corpus search engines and applications, and to enable them to gain insight into the possibilities of using corpus data for study and practice.

Course completion requirements -
Last update: Mgr. Věra Hejhalová, Ph.D. (21.09.2022)

Completion: zápočet

1) Absence of max. 3

2) Ongoing preparation in hours

3) active participation in solving assigned tasks

4) a final thesis of approximately 3-5 pages relating to the analysis of the assigned corpus analysis, or verification of practical knowledge of working with corpora in the form of a test in the last week.

In case of transition to distance learning when the epidemiological situation worsens:

For the duration of the ordered distance education, the course will take place in a combined form, i.e. online teaching will take place in MS Teams at the scheduled time and will be combined with home preparation using the course in LMS Moodle.

When choosing the final thesis, it is possible to submit it remotely (by e-mail) at any time; if you choose practical verification of knowledge by testing, this test will take place in a distance form (similar to continuous tasks in Moodle) - by entering and submitting the test via e-mail on the dates listed in SIS

Literature -
Last update: Mgr. Věra Hejhalová, Ph.D. (21.09.2022)

Recommended lecture:

BUBENHOFER, Noah: Einführung in die Korpuslinguistik: Praktische Grundlagen und Werkzeuge. – Online-Kurs

ČERMÁK, F. et al.: Studie z korpusové lingvistiky, úvod, edice a překlad, Karolinum 2000.

ČERMÁK, F./BLATNÁ, R. (ed.): Studie z korpusové lingvistiky 1, Korpusová lingvistika, stav a modelové přístupy, NLN Praha 2006.

ČERMÁK, F./BLATNÁ, R.: Jak využívat Český národní korpus, NLN Praha 2005.

ČERMÁK, F./ŠULC, M.(ed.): Studie z korpusové lingvistiky 2, Kolokace, NLN Praha 2006.

ČERMÁK, F.: Korpusová lingvistika Praha 2011. Vyd. 1. Praha: NLN, Nakladatelství Lidové noviny, 2011, 372 s. ISBN 9788074221149.

ČERMÁK, František. Korpusová lingvistika Praha 2011. Vyd. 1. Praha: NLN, Nakladatelství Lidové noviny, 2011, 323 s. ISBN 9788074221156.

KOCEK, J./KOPŘIVOVÁ, M./KUČERA, K. (ed.): Český národní korpus. Úvod a příručka uživatele, Praha 2000.

KOPŘIVOVÁ, Marie: Jak využívat Český národní korpus, Praha 2005.

LEMNITZER, Lothar a Heike ZINSMEISTER. Korpuslinguistik. Eine Einführung. Tübingen: Gunter Narr, 2006, 220 s. ISBN 3-8233-6210-0.

LÜDELING, Anke / MAIK Walter: Korpuslinguistik für Deutsch als Fremdsprache. Sprachvermittlung und Spracherwerbsforschung. (.pdf – zugänglich unter https://www.linguistik.hu-berlin.de/de/institut/professuren/korpuslinguistik/mitarbeiter-innen/anke/pdf/LuedelingWalterDaF.pdf)

LÜDELING, Anke a Merja KYTÖ. Corpus linguistics: an international handbook. Berlin: Walter de Gruyter, c2008, xvii, 776 s. ISBN 9783110180435.

LÜDELING, Anke a Merja KYTÖ. Corpus linguistics: an international handbook. Berlin: Walter de Gruyter, c2008, xvii, s. 778-1353. ISBN 9783110207330.

PERKUHN, Rainer a Cyril BELICA. Korpuslinguistik - das unbekannte Wesen oder Mythen über Korpora und Korpuslinguistik. In: Sprachreport 1/2006. S. 2-8. Mannheim: 2006.

PERKUHN, Rainer, Holger KEIBEL a Marc KUPIETZ. Korpuslinguistik. Paderborn: Wilhelm Fink, 2012, 144 s. ISBN 9783825234331.

PETKEVIČ, Vladimír a Alexandr ROSEN. Korpusová lingvistika Praha 2011. Vyd. 1. Praha: Nakladatelství Lidové noviny, 2011, 225 s. ISBN 9788074221163.

ŠEBESTA, K./ŠORMOVÁ, K. et al.: Korpusy v jazykovém vyučování, Praha: FFUK, 2019, zdarma ke stažení zde: https://e-shop.ff.cuni.cz/e_knihy/e_publikace/korpusy_v_jazykovem_vyucovani_pdf_-1761

ŠEBESTA, K./ŠORMOVÁ, K. et al.: Korpusy v jazykovém vyučování II. Praha: FF UK 2019, zdarma ke stažení zde https://e-shop.ff.cuni.cz/e_knihy/e_publikace/korpusy_v_jazykovem_vyucovani_ii_pdf_-1908

ŠULC, Michal: Korpusová lingvistika. První vstup, Praha 1999.

Internet links:

www.ids-mannheim.de

https://cosmas2.ids-mannheim.de/cosmas2-web/

http://corpora.ids-mannheim.de/ccdb/

http://ucnk.ff.cuni.cz

https://www.korpus.cz/

Other recommended lecture

ABEL, Andrea/Renata ZANIN: Korpora in Lehre und Forschung. Bozen-Bolzano University Press, Bozen 2011, ISBN 978-88-6046-040-0

BELICA, Cyril a Kathrin STEYER. Korpusanalytische Zugänge zu sprachlichem Usus. In: Beiträge zur bilingualen Lexikographie, M. Vachková (Hg.), AUC (Acta Universitatis Carolinae), GERMANISTICA PRAGENSIA XX, Praha: Karolinum, 2008.

BUHOFER, Annelies a Gertrud GRÉCIANO, (eds.): Flut von Texten - Vielfalt der Kulturen. Ascona 2001 zur Methodologie und Kulturspezifik der Phraseologie. (=Phraseologie und Parömiologie 14). Baltmannsweiler, S. 33-46. 2003.

KÁŇA, Tomáš a Hana PELOUŠKOVÁ (eds.): Deutsch und Tschechisch im Vergleich – korpusbasierte linguistische Studien, Brno: Masarykova univerzita, 2009.

KOPŘIVOVÁ, Marie: Využití korpusu při zpracování frazeologie ve výkladovém slovníku, In: Frazeografia sĺowińska. 1. vyd. Opole: Wydawnictwo Uniwersytetu Opolskiego; 2001

KRATOCHVÍLOVÁ, Iva a Norbert Richard WOLF (eds.): Kompendium Korpuslinguistik: eine Bestandaufnahme aus deutsch-tschechischer Perspektive, Heidelberg, Winter, 2010.

KRATOCHVÍLOVÁ, Iva.  Zum aktuellen Thema: Die Korpuslinguistik im deutsch-tschechischen Sprachvergleich, In: Acta Facultatis Ostraviensis, Studia Germanistica, Ostrava, Ostravská univerzita v Ostravě, 2006, S. 35–45.

LEMNITZER, Lothar a Heike ZINSMEISTER. Korpuslinguistik. Eine Einführung. Tübingen: Gunter Narr, 2006, 220 s. ISBN 3-8233-6210-0.

KRATOCHVÍLOVÁ, Iva (Hrsg.): Kompendium Korpuslinguistik: eine Bestandaufnahme aus deutsch-tschechischer Perspektive, Heidelberg, Winter, 2010.

MOON, Rosamund: Fixed Expressions and Idioms in English: A Corpus-Based Approach. Oxford Studies In Lexicography & Lexicology. Oxford University Press, 1998.

PELOUŠKOVÁ, Hana a Tomáš KÁŇA: Elektronische Korpora in Tschechien und das tschechisch-deutsche Parallelkorpus, In: Planning, Gluing and Painting Corpora, B. Kettemann - G. Marko (eds.), Frankfurt am Main, Peter Lang GmbH, 2006,  S. 27–46.

STEYER, Kathrin: Korpus, Statistik, Kookkurrenz. Lässt sich Idiomatisches "berechnen"? In: Burger, Harald/Häcki, Buhofer, Annelies/Gréciano, Gertrud (eds.): Flut von Texten - Vielfalt der Kulturen. Ascona 2001 zur Methodologie und Kulturspezifik der Phraseologie. (=Phraseologie und Parömiologie 14). Baltmannsweiler, S. 33-46. 2003.

Teaching methods -
Last update: Mgr. Věra Hejhalová, Ph.D. (21.09.2022)

- a combination of theoretical explanation with practical exercises in class and during home preparation

- the possibility to influence the content of the teaching according to the needs of the students

Syllabus -
Last update: Mgr. Věra Hejhalová, Ph.D. (21.09.2022)

1. 3. 10. What is a corpus? What is it used for? How can it be used? Basic types of corpora. Czech and German corpora - comparison (DeReKo, ČNK, InterCorp) in the form of a presentation. Conditions and creation of background for working with corpora (laptops, internet access, registration in COSMAS II and ČNK).

2. 10. 10. Basic understanding of the corpus search engine environment (COSMAS II.) – selection of archive, corpus, formulation of the simplest queries,

3. 17. 10. DeReKo (IDS Mannheim) - COSMAS II. - basic functions of the corpus search engine; Practicing basic search in COSMAS II.,

4. 24. 10. Advanced search and its practice in COSMAS II.; multi-word conjunctions

5. 31. 10. Advanced search and its practice in COSMAS II.; multi-word compounds II; regular characters; setting options (Optionen)

6. 7.11. Expanding functions from the COSMAS II offer; working with tagged corpora

7. 14.11. COSMAS II - practicing CQL (Corpus Query Language); associated applications – CCDB, SOM, etc.; DWDS

8. 21. 11. Competitive analysis - method of entering and evaluating generated data; idioms and collocations - a special task for corpus linguistics

9. 28.11. Czech National Corpus - basic information, basic search

10. 5. 12. Czech National Corpus - practice searching, explanation of extension functions

11/12/12 InterCorp; CNK application

12. 19.12. Practical training based on the focus and interest of the course participants

13. 2.1 credit test

Learning resources -
Last update: Mgr. Věra Hejhalová, Ph.D. (21.09.2022)

Moodle course https://dl2.cuni.cz/course/view.php?id=3891

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html