SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Numerical Linear Algebra for Data Science and Informatics - NMNV468
Title: Numerical Linear Algebra for data science and informatics
Guaranteed by: Department of Numerical Mathematics (32-KNM)
Faculty: Faculty of Mathematics and Physics
Actual: from 2021
Semester: summer
E-Credits: 5
Hours per week, examination: summer s.:2/2, C+Ex [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: yes / unlimited
Key competences: 4EU+ Flagship 3
State of the course: taught
Language: English
Teaching methods: full-time
Teaching methods: full-time
Additional information: https://dl1.cuni.cz/course/view.php?id=13028
Guarantor: Erin Claire Carson, Ph.D.
Class: M Mgr. NVM
M Mgr. NVM > Volitelné
Classification: Mathematics > Numerical Analysis
Annotation -
Last update: doc. RNDr. Václav Kučera, Ph.D. (12.05.2019)
The goal of this course is to introduce students to underlying concepts of numerical linear algebra which appear in methods for data science and informatics. After an introduction and review of matrix representations of data and basic matrix decompositions, these concepts are illustrated in various applications, including data compression, clustering, classification, and neural networks. The course will also illustrate these concepts via software implementations and an introduction to tools for cluster computing.
Aim of the course
Last update: Erin Claire Carson, Ph.D. (05.02.2020)

The main goal of the course is to understand basic concepts of numerical linear algebra and where such computations arise in data science applications. The focus is on developing an understanding of the mathematical foundation of techniques in informatics and data science and informatics. The goal is also to gain practical experience via basic programming examples and to become familiar with recent research topics in the area.

Course completion requirements
Last update: Erin Claire Carson, Ph.D. (05.02.2020)

Students will complete short assignments given periodically throughout the semester (possibly including, but not limited to, simple programming assignments and written summaries of research articles) as well as a final exam.

Literature -
Last update: Erin Claire Carson, Ph.D. (05.02.2020)

G. Strang, Linear Algebra and Learning From Data, 2019.

A. Blum, J. Hopcroft, and R. Kannan. Foundations of Data Science.

J. Grus. Data Science from Scratch: First Principles with Python, O’Reilly Media, 2015.

G. Strang, Linear Algebra and Its Applications, Thomson/Brooks Cole.

B. Steele, J. Chandler, S. Reddy. Algorithms for Data Science, Springer, 2016.

J. Brownlee. Basics of Linear Algebra for Machine Learning, 2018.

Requirements to the exam
Last update: Erin Claire Carson, Ph.D. (10.01.2022)

The final exam will consist of an oral exam given during the scheduled exam period. In case of distance learning, the exam will be adapted to a distance format.

Syllabus -
Last update: Erin Claire Carson, Ph.D. (05.02.2020)

1. Matrix representations and matrix decompositions

2. Eigenvalue decomposition, least squares regression, singular value decomposition

3. Numerical linear algebra in data science applications

a. principal component analysis, low-rank approximation and compression

b. clustering and classification

c. Pagerank and semantic indexing

d. non-negative matrix decomposition

4. Current research directions and applications

Entry requirements
Last update: Erin Claire Carson, Ph.D. (05.02.2020)

As a preliminary we assume to have basic knowledge of linear algebra as, for example, from the course NMAG101 and experience in programming.

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html