SubjectsSubjects(version: 964)
Course, academic year 2024/2025
   Login via CAS
Numerical Linear Algebra for Data Science and Informatics - NMNV468
Title: Numerical Linear Algebra for data science and informatics
Guaranteed by: Department of Numerical Mathematics (32-KNM)
Faculty: Faculty of Mathematics and Physics
Actual: from 2021
Semester: summer
E-Credits: 5
Hours per week, examination: summer s.:2/2, C+Ex [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: yes / unlimited
Key competences: 4EU+ Flagship 3
State of the course: taught
Language: English
Teaching methods: full-time
Additional information: https://dl1.cuni.cz/course/view.php?id=13028
Guarantor: Erin Claire Carson, Ph.D.
Teacher(s): Erin Claire Carson, Ph.D.
Class: M Mgr. NVM
M Mgr. NVM > Volitelné
Classification: Mathematics > Numerical Analysis
Annotation -
The goal of this course is to introduce students to underlying concepts of numerical linear algebra which appear in methods for data science and informatics. After an introduction and review of matrix representations of data and basic matrix decompositions, these concepts are illustrated in various applications, including data compression, clustering, classification, and neural networks. The course will also illustrate these concepts via software implementations and an introduction to tools for cluster computing.
Last update: Kučera Václav, doc. RNDr., Ph.D. (12.05.2019)
Aim of the course

The main goal of the course is to understand basic concepts of numerical linear algebra and where such computations arise in data science applications. The focus is on developing an understanding of the mathematical foundation of techniques in informatics and data science and informatics. The goal is also to gain practical experience via basic programming examples and to become familiar with recent research topics in the area.

Last update: Carson Erin Claire, Ph.D. (05.02.2020)
Course completion requirements

Students will complete short assignments given periodically throughout the semester (possibly including, but not limited to, simple programming assignments and written summaries of research articles) as well as a final exam.

Last update: Carson Erin Claire, Ph.D. (05.02.2020)
Literature -

G. Strang, Linear Algebra and Learning From Data, 2019.

A. Blum, J. Hopcroft, and R. Kannan. Foundations of Data Science.

J. Grus. Data Science from Scratch: First Principles with Python, O’Reilly Media, 2015.

G. Strang, Linear Algebra and Its Applications, Thomson/Brooks Cole.

B. Steele, J. Chandler, S. Reddy. Algorithms for Data Science, Springer, 2016.

J. Brownlee. Basics of Linear Algebra for Machine Learning, 2018.

Last update: Carson Erin Claire, Ph.D. (05.02.2020)
Requirements to the exam

The final exam will consist of an oral exam given during the scheduled exam period. In case of distance learning, the exam will be adapted to a distance format.

Last update: Carson Erin Claire, Ph.D. (10.01.2022)
Syllabus -

1. Matrix representations and matrix decompositions

2. Eigenvalue decomposition, least squares regression, singular value decomposition

3. Numerical linear algebra in data science applications

a. principal component analysis, low-rank approximation and compression

b. clustering and classification

c. Pagerank and semantic indexing

d. non-negative matrix decomposition

4. Current research directions and applications

Last update: Carson Erin Claire, Ph.D. (05.02.2020)
Entry requirements

As a preliminary we assume to have basic knowledge of linear algebra as, for example, from the course NMAG101 and experience in programming.

Last update: Carson Erin Claire, Ph.D. (05.02.2020)
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html