Témata prací (Výběr práce)

Váš prohlížeč nepodporuje JavaScript nebo je jeho podpora vypnutá. Některé funkce nemusejí být dostupné.

Využití uživatelské odezvy pro zvýšení kvality řečové syntézy

Název práce v češtině:	Využití uživatelské odezvy pro zvýšení kvality řečové syntézy
Název v anglickém jazyce:	Improving text-to-speech in spoken dialogue systems by employing user’s feedback
Klíčová slova:	speech synthesis, phonetic dictionary, user feedback, machine learning, FST, speech recognition
Klíčová slova anglicky:	syntéza řeči, fonetický slovník, uživatelská odezva, strojové učení, FST, rozpoznávání řeči
Akademický rok vypsání:	2016/2017
Typ práce:	diplomová práce
Jazyk práce:	čeština
Ústav:	Ústav formální a aplikované lingvistiky (32-UFAL)
Vedoucí / školitel:	doc. Ing. Zdeněk Žabokrtský, Ph.D.
Řešitel:	skrytý - zadáno a potvrzeno stud. odd.
Datum přihlášení:	27.01.2017
Datum zadání:	27.01.2017
Datum potvrzení stud. oddělením:	26.04.2017
Datum a čas obhajoby:	07.09.2017 09:30
Datum odevzdání elektronické podoby:	19.07.2017
Datum odevzdání tištěné podoby:	21.07.2017
Datum proběhlé obhajoby:	07.09.2017
Oponenti:	Mgr. Nino Peterek, Ph.D.



Konzultanti:	Mgr. Ondřej Plátek

Zásady pro vypracování

Although spoken dialogue systems have greatly improved, they still cannot handle communications involving unknown topics and are very fragile. We will investigate methods that can improve spoken dialogue systems by correcting or even learn the pronunciation of unknown words. Thus we will provide better user experience, since for example mispronounced proper nouns are highly undesirable. Incorrect pronunciation is caused by imperfect phonetic representation, typically phonetic dictionary. We aim to detect incorrectly pronounced words by exploiting user’s feedback as well as using prior knowledge of the pronunciation and correct the transcriptions accordingly. Furthermore, the learned phonetic transcriptions can be used to improve speech recognition module by refining its models. Models used in speech recognition cannot handle words that are not in their vocabulary or have phonetic representation. Extracting those words from user’s utterances and adding them to the vocabulary should lead to a better overall performance.

Seznam odborné literatury

Huang, Xuedong, et al. Spoken language processing: A guide to theory, algorithm, and system development. Prentice hall PTR, 2001.
Psutka, Josef, et al. Mluvíme s počítačem česky. 2006.
Pappu, Aasish. Knowledge Discovery Through Spoken Dialog. Diss. Carnegie Mellon University, 2014.
Pappu, Aasish K., and Alexander I. Rudnicky. "Knowledge acquisition strategies for goal-oriented dialog systems." (2014):