|
|
|
||
This course deals with speech recognition and generation tasks and feature extraction of voice and utterance
characteristics. Of particular interest will be topics related to Hidden Markov Models as applied to speech (FFT, n-
dimensional clustering, Gaussian mixtures, parameter value extraction from data, phonetic representation, prosodic
analysis etc.) and to their DNN-HMM hybrid models. Preparation and training of own speech recognition and
generation models.
Last update: Mírovský Jiří, RNDr., Ph.D. (11.05.2022)
|
|
||
Oral examination and project presentation. The practical part is controlled through the preparation and presentation of own models for speech recognition and generation. The presentation is repeatable. Last update: Peterek Nino, Mgr., Ph.D. (10.06.2019)
|
|
||
Gernot A. Fink, Markov Models for Pattern Recognition, Springer, 2014
Steve Young, Dan Kershaw, Julian Odell, Dave Ollason, Valtcho Valtchev, Phil Woodland, The HTK Book, Cambridge, Entropic Ltd. http://htk.eng.cam.ac.uk, 1995-2007 Zdena Palková, Fonetika a fonologie češtiny, Karolinum, Praha, 1997 Dong Yu,Li Deng, Automatic Speech Recognition A Deep Learning Approach, 2015 NPFL038 Details and News Last update: Peterek Nino, Mgr., Ph.D. (11.05.2022)
|
|
||
Exam covers theoretical part of the course (syllabus), there is only oral exam. Finalisation of practical part is not necessary before the exam. Last update: Peterek Nino, Mgr., Ph.D. (13.10.2017)
|
|
||
Introduction to Speech Production and Perception.
General Principles of Automatic Speech Recognition (HMM)
HTK Tools description
Data Preparation
Creating Monophone HMMs
Creating Triphones HMMs
Recogniser Evaluation.
General Principles of Automatic Speech Generation.
Speech Prosody Analysis. Last update: Peterek Nino, Mgr., Ph.D. (13.10.2017)
|