Project management of NTIS P1 Cybernetic Systems and Department of Cybernetics | WiKKY

Project

General

Profile

Bibliografické údaje

Název Automatic Pitch-Synchronous Phonetic Segmentation with Context-Independent HMMs
Autor Matoušek, J.
Typ publikace Článek v časopise, odborném periodiku
Periodikum Lecture Notes in Artificial Intelligence: Text, Speech and Dialogue
Nakladatel Springer / Berlin, Heidelberg
Svazek 5729
Strana 178-185
Rok 2009
ISBN 978-3-642-04207-2
ISSN 0302-9743

Detail, PDF

Abstrakt

This paper deals with an HMM-based automatic phonetic segmentation (APS) system. In particular, the use of a pitch-synchronous (PS) coding scheme within the context-independent (CI) HMM-based APS system is examined and compared to the "more traditional'' pitch-asynchronous (PA) coding schemes for a given Czech male voice. For bootstrap-initialised CI-HMMs, exploited when some (manually) pre-segmented data are available, the proposed PS coding scheme performed best, especially in combination with CART-based refinement of the automatically segmented boundaries. For flat-start-initialised CI-HMMs, an inferior initialisation method used when no pre-segmented data are at disposal, standard PA coding schemes with longer parameterization shifts yielded better results. The results are also compared to the results obtained for APS systems with context-dependent (CD) HMMs. It was shown that, at least for the researched male voice, multiple-mixture CI-HMMs outperform CD-HMMs in the APS task.