SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Digital Signal Processing, Speech Analysis and Synthesis - NPFL041
Title: Číslicové zpracování signálu, analýza a syntéza řeči
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2020
Semester: winter
E-Credits: 3
Hours per week, examination: winter s.:1/1, MC [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: cancelled
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Additional information: http://epos.ure.cas.cz/pfl041/
Guarantor: Petr Horák
Class: Informatika Mgr. - volitelný
Classification: Informatics > Computer and Formal Linguistics
Annotation -
Last update: T_UFAL (15.05.2002)
Introduction to the digital signal processing with the focus on speech processing, speech acoustics, speech analysis methods in time and frequency domains, speech coding, synthesis of the speech signal in time and frequency domains.
Literature - Czech
Last update: RNDr. Pavel Zakouřil, Ph.D. (05.08.2002)

Digital Processing of Speech Signals (Rabiner, Schafer, 78)

An Introduction to Text-to-Speech Synthesis (Dutoit, 96)

Číslicově zpracování signálů (Uhlíř, Sovka 95)

Transformace Z a některá její použití (Vích 83)

Syllabus -
Last update: T_UFAL (15.05.2002)
  • discrete signals and systems, DFT, FFT, Z-transformation, digital filters
  • speech acoustic, speech characteristics in time and frequency domain
  • speech signal analysis in time domain, spectral analysis, pitch detection, pitch contour analysis and synthesis, formants detection

Practice 1 - Speech analysis

  • speech signal analysis in frequency domain, spectral analysis, Fast Fourier Transform (FFT)
  • speech coding, Linear Predictive Coding (LPC), speech analysis and synthesis, advanced speech coding algorithms (RELP, CELP), GSM speech coding algoritms

Practice 2 - Speech coding

  • speech signals synthesis methods, parametric model of speech synthesis, formant synthesis, linear predictive synthesis, cepstral synthesis, harmonic speech modelling, speech synthesis in time domain

Practice 3 - Speech synthesis

  • cepstral analysis, Dynamic Time Warping (DTW) method, Viterbi algorithm, Hidden Markov Models (HMM)
  • speaker verification and recognition, forensic speaker identification
  • automatic speech recognition methods, recognition of isolated words, continuous speech recognition

Practice 4 - Speech recognition

References:

Digital Processing of Speech Signals (Rabiner, Schafer, 78)

An Introduction to Text-to-Speech Synthesis (Dutoit, 96)

Digital signal processing (in Czech) (Uhlíř, Sovka 95)

Z-Transformation and its using (Vích 83)

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html