Artificial Intelligence for Humanities - NPFL142
Title: Umělá inteligence pro humanitní a společenské vědy
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2023 to 2023
Semester: summer
E-Credits: 4
Hours per week, examination: summer s.:2/2, C+Ex [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
Key competences: data literacy
State of the course: not taught
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Additional information: https://ufal.mff.cuni.cz/courses/npfl142
Guarantor: doc. Mgr. Barbora Vidová Hladká, Ph.D.
RNDr. Martin Holub, Ph.D.
Opinion survey results   Examination dates   Schedule   Noticeboard   
Annotation -
Artificial Intelligence is a highly topical and growing trend penetrating into various areas of life and most scientific fields, including the humanities and social sciences. This course is a response to the increasing importance of rapidly advancing computer technologies, and it presents the technological foundations of Artificial Intelligence in an understandable way. The course is primarily designed for students in the humanities and social sciences at any level (BSc/MSc/PhD).
Last update: Mírovský Jiří, RNDr., Ph.D. (24.05.2023)
Aim of the course -

During the course, students will acquire theoretical knowledge as well as practical skills necessary for solving practical tasks using available data and Artificial Intelligence methods, particularly in the field of text analysis. To this end, they will learn to use tools implemented in the R software system and independently read technical literature. Upon completion of the course, graduates will have the ability to analyze and process data from various areas of the humanities or social sciences, and use this data for experimenting with Artificial Intelligence.

Last update: Mírovský Jiří, RNDr., Ph.D. (23.05.2023)
Course completion requirements -

The course will be concluded with an exam. Obtaining the course credit is a prerequisite for taking the exam. The credit is awarded for active participation throughout the term and the submission of ongoing homework assignments. Lab session attendance is mandatory. Lecture attendance is, in fact, essential for understanding the content of the lab sessions and completing the assignments.

Last update: Mírovský Jiří, RNDr., Ph.D. (23.05.2023)
Literature -
  • Arnold, Taylor and Lauren Tilton: Humanities Data in R. Exploring Networks, Geospatial Data, Images, and Text. Springer, 2015. [https://link.springer.com/book/10.1007/978-3-319-20702-5]
  • Lantz, Brett: Machine Learning with R. PACKT Publishing. 2013, 2019.
  • Grolemund, Garrett and Hadley Wickham: R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. O'Reilly Media, 2016. [https://r4ds.hadley.nz/]
  • Boehmke, Bradley and Brandon M. Greenwell: Hands-On Machine Learning with R. Chapman & Hall/CRC, 2019. [https://bradleyboehmke.github.io/HOML/]
  • Hvitfeldt, Emil and Julie Silge: Supervised Machine Learning for Text Analysis in R. CRC Press. 2022. [https://smltar.com/]

Last update: Holub Martin, RNDr., Ph.D. (06.06.2023)
Requirements to the exam -

The exam consists of a written and an oral part and we take into account the quality of ongoing homework completions as well. The examination requirements correspond to the course syllabus. More details are available on the course web site.

Last update: Mírovský Jiří, RNDr., Ph.D. (23.05.2023)
Syllabus -

The teaching is conducted through demonstrations of Artificial Intelligence methods on illustrative solutions of intentionally diverse practical tasks. These tasks include automatic authorship recognition, native language identification, text age estimation, predicting the success of advertising campaigns, analyzing texts from social media, conducting shopping cart analysis, analyzing and visualizing citation networks, visualizing image similarities, and various problems in psychometrics. Students are guided towards independent analysis of data sources from the humanities or social sciences and they acquire the knowledge necessary to use Artificial Intelligence methods implemented in the R software system. We particularly focus on the following topics:

Part I - Introduction to Artificial Intelligence methods

General technological principles of Artificial Intelligence and statistical Machine Learning

Historical overview of Artificial Intelligence development from a technological and user perspective

Statistical data analysis

Technologies available for processing textual data

Tools from the tidyverse package in the R software system

Part II - Traditional methods of statistical machine learning

Principles of learning from examples, classification and regression

Use and parameterization of selected learning algorithms

Clustering

Experiment evaluation

Part III - Deep Learning in Neural Networks

Neural Network Architecture

Representation of textual data using embeddings

Training Neural Networks

Last update: Vidová Hladká Barbora, doc. Mgr., Ph.D. (24.05.2023)
Entry requirements -

We expect students to have a willingness to experiment with Artificial Intelligence, including Neural Networks. Prospective participants of this course should have a basic understanding of working with the R system and should possess at least elementary knowledge of systematic data processing and statistical analysis. These prerequisite requirements can be fulfilled by attending the parallel course "Data Processing and Analysis for Humanities" [NPFL143].

Last update: Mírovský Jiří, RNDr., Ph.D. (23.05.2023)