|
|
|
||
The main objective of the course is to provide beginner-level digital linguistics students with all the necessary
information on text processing and analysis. Starting with basic topics, such as characteristics of a plain text format and the difference between data and metadata, the course goes on to explain the specifics of XML and different types of text annotation, to introduce the process of tokenization, segmentation and morphological analysis, to describe the limits and possibilities of syntactic and semantic tagging and, finally, to summarize the principles of CQL and corpus querying, including the use of regular expressions and querying parallel corpora. Poslední úprava: Lukešová Lucie, Mgr., Ph.D. (12.07.2018)
|
|
||
Poslední úprava: Lukešová Lucie, Mgr., Ph.D. (12.07.2018)
|
|
||
Poslední úprava: Lukešová Lucie, Mgr., Ph.D. (12.07.2018)
|