NLP Technology - NPFL092
Title: Technologie pro NLP
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2021
Semester: winter
E-Credits: 5
Hours per week, examination: winter s.:1/2, MC [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: cancelled
Language: English, Czech
Teaching methods: full-time
Teaching methods: full-time
Additional information: https://ufal.mff.cuni.cz/courses/npfl092
Guarantor: doc. Ing. Zdeněk Žabokrtský, Ph.D.
Mgr. Rudolf Rosa, Ph.D.
Class: Informatika Bc.
Informatika Mgr. - Matematická lingvistika
Classification: Informatics > Computer and Formal Linguistics
Incompatibility : NPFL125
Interchangeability : NPFL125
Is incompatible with: NPFL125
Is interchangeable with: NPFL125
Opinion survey results   Examination dates   Schedule   Noticeboard   
Annotation -
Last update: T_UFAL (10.05.2010)
The aim of the course is to get students familiar with basic software tools used in natural language processing.
Course completion requirements -
Last update: Mgr. Rudolf Rosa, Ph.D. (02.10.2018)

To pass the course. you will need to submit homework assignments and do a written test. A brief overview follows; detailed information on grading can be found at https://ufal.mff.cuni.cz/courses/npfl092#grading

Homework assignments

  • Assignments will be set in the class and specified on the website.
  • To pass the course, you need to get at least 50% of the total achievable points for the assignments.
  • If you miss the deadline, there is a second deadline in 2 weeks, but your points for the assignment will be multiplied by 0.5; after the second deadline, you get 0 points.

Test

  • There will be a written test at the end of the semester.
  • To pass the course, you need to get at least 50% of the total points from the test.

Grading

  • Your grade is based on the average of your performance; the test and the homework assignments are weighted 1:1.
  • ≥ 90%: grade 1 (excellent)
  • ≥ 70%: grade 2 (very good)
  • ≥ 50%: grade 3 (good)
  • < 50%: grade 4 (fail)
Literature -
Last update: T_UFAL (05.05.2017)

Mendel Cooper: Advanced Bash-Scripting Guide, http://tldp.org/LDP/abs/html/

Allen B. Downey: Think Python 2e, O'Reilly Media, http://greenteapress.com/wp/think-python-2e/

Steven Bird, Ewan Klein, and Edward Loper: Natural Language Processing with Python, O'Reilly Media, 2015, http://www.nltk.org/book/

Dive into NLTK: http://textminingonline.com/dive-into-nltk-part-i-getting-started-with-nltk

Syllabus -
Last update: T_UFAL (05.05.2017)

Survival in Linux, Bash command line and scripting, text-processing commands.

Introduction to Python, text processing, regular expressions.

Object-oriented interface for processing linguistic structures in Python.

Representing linguistic structures in XML, processing XML in Python.

NLP tools and frameworks, processing morphologically and syntactically annotated data, visualization, search.