SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Data Organisation and Processing II - NDBI003
Title: Organizace a zpracování dat II
Guaranteed by: Department of Software Engineering (32-KSI)
Faculty: Faculty of Mathematics and Physics
Actual: from 2014
Semester: summer
E-Credits: 5
Hours per week, examination: summer s.:2/1, C+Ex [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: cancelled
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Additional information: http://siret.ms.mff.cuni.cz/members/hoksza/lectures/ndbi003
Guarantor: doc. RNDr. David Hoksza, Ph.D.
Class: Informatika Mgr. - Softwarové systémy
Classification: Informatics > Database Systems
Annotation -
Last update: T_KSI (15.04.2003)
Spatial databases - their purpose, differences to relational db; rd-trees, space representations, data structures for storing point objects, data structures usable also for more complex objects, spatial join. Textual databases - inverted file, lemmatization; Term count reduction, Zipf's law; signature methods. Data compression - purpose, basic notions, integer coding, symbol coding methods, basic dictionary methods, index compression and compaction. Semistructured document indexing. Web indexing. Object persistency.
Literature - Czech
Last update: T_KSI (15.04.2003)

Pokorný, J.: Základy implementace souborů a databází. Skripta UK, Vydavatelství Karolinum, 1997.

Pokorný, J., Žemlička, M.: Základy implementace souborů a databází. Skripta UK, Vydavatelství Karolinum, 2003. 2. uprav. vydání.

Syllabus -
Last update: T_KSI (15.04.2003)

1. Spatial databases - services, differences to relational db; rd-trees, space representations (naive, spiral, z-ordering; path and width ordering).

2. Spatial databases - data structures for storing point objects (B-cubes, k-d-trees, buddy-trees).

3. Spatial databases - data structures usable also for more complex objects (R-trees, R*-trees).

4. Spatial databases - spatial join.

5. Textual databases - introduction: inverted file, lemmatization - what is it and trivial implementation; Term count reduction, Zipf's law.

6. Text databases - signature methods (document signature and query signature, superimposed coding, S-trees, multilevel signatures).

7. Data compression - purpose, basic notions, integer coding.

8. Data compression - symbol coding methods (Shannon-Fano, Huffman, arithmetic coding).

9. Data compression - basic dictionary methods 1 (LZ77,LZ78).

10. Data compression - basic dictionary methods 2 (LZW,BSTW), compression and compaction of indexes.

11. Semistructured document indexing.

12. Web indexing.

13. Object persistency.

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html