Project management of NTIS P1 Cybernetic Systems and Department of Cybernetics | WiKKY

Project

General

Profile

Actions

Task #4161

closed

Automatický přepis MONO dat

Added by Zajíc Zbyněk about 7 years ago. Updated over 4 years ago.

Status:
Closed
Priority:
Normal
Start date:
09.02.2017
Due date:
% Done:

70%

Estimated time:

Description

Až bude nový LM, vytvořit přepis dat.

Actions #1

Updated by Zajíc Zbyněk about 7 years ago

  • Assignee changed from Pražák Aleš to Psutka Josef V.
  • % Done changed from 0 to 70

První výsledky na mono datech - na ~27k slovech ~3h reci je to 8kHz mono desiva kvalita.

LM (slovník) Corr[%] Acc[%]
1.2M 51.49 44.86
174k 67.33 60.17

AM:
The first experiment was made with the low-quality data without the distinguished channels (both the language counselor and the client of LCC stored in one channel). We applied our recent triphone HMM acoustic model. The basic speech unit was a three-state HMM with 32 mixtures of multivariate Gaussians for each of the 4969 states. The model was trained on various 500~hours of spontaneous telephone speech, all converted into low quality (8kHz, $\mu$-law resolution). We used the PLP parameterization as our front-end module (19 band pass filters, 12 cepstral coefficients with delta and delta-delta features with CMN).

LM:
1.2M
Our initial ASR system is using the universal trigram back-off Language Model (LM) with the mixed-case vocabularies with more than 1.2M words. Our training text corpus contains the data from newspapers (520 million tokens), web news (350 million tokens), subtitles (200 million tokens) and transcriptions of some TV programs (175 million tokens).

174k
For the better aim of the language model, we trained a domain LM with the dictionary size 174k as a standard trigram language model with Kneser-Ney smoothing. This model was trained on the available transcribed data from LCC of the language counselor (220 thousand tokens) and the client (180 thousand tokens) and from the email communication (counselor 3,8 million tokens and client 3,4 million tokens).

Actions #2

Updated by Zajíc Zbyněk over 4 years ago

  • Status changed from Assigned to Closed
Actions

Also available in: Atom PDF