Task #4161: Automatický přepis MONO dat - NAKI-II-UJC-UKONCENE - Projects of Department of Cybernetics & NTIS P1 - Cybernetic Systems, University of West Bohemia

Actions

Copy link

Task #4161

closed

Automatický přepis MONO dat

Added by Zajíc Zbyněk over 8 years ago. Updated almost 6 years ago.

Status:

Closed

Priority:

Normal

Assignee:

Psutka Josef V.

Start date:

09.02.2017

Due date:

% Done:

70%

Estimated time:

Description

Až bude nový LM, vytvořit přepis dat.

Actions

Copy link

Updated by Zajíc Zbyněk over 8 years ago

Assignee changed from Pražák Aleš to Psutka Josef V.
% Done changed from 0 to 70

První výsledky na mono datech - na ~27k slovech ~3h reci je to 8kHz mono desiva kvalita.

LM (slovník)	Corr[%]	Acc[%]
1.2M	51.49	44.86
174k	67.33	60.17

AM:
The first experiment was made with the low-quality data without the distinguished channels (both the language counselor and the client of LCC stored in one channel). We applied our recent triphone HMM acoustic model. The basic speech unit was a three-state HMM with 32 mixtures of multivariate Gaussians for each of the 4969 states. The model was trained on various 500~hours of spontaneous telephone speech, all converted into low quality (8kHz, $\mu$-law resolution). We used the PLP parameterization as our front-end module (19 band pass filters, 12 cepstral coefficients with delta and delta-delta features with CMN).

LM:
1.2M
Our initial ASR system is using the universal trigram back-off Language Model (LM) with the mixed-case vocabularies with more than 1.2M words. Our training text corpus contains the data from newspapers (520 million tokens), web news (350 million tokens), subtitles (200 million tokens) and transcriptions of some TV programs (175 million tokens).

174k
For the better aim of the language model, we trained a domain LM with the dictionary size 174k as a standard trigram language model with Kneser-Ney smoothing. This model was trained on the available transcribed data from LCC of the language counselor (220 thousand tokens) and the client (180 thousand tokens) and from the email communication (counselor 3,8 million tokens and client 3,4 million tokens).

Actions

Copy link

Updated by Zajíc Zbyněk almost 6 years ago

Status changed from Assigned to Closed

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

NAKI-II-UJC-UKONCENE

Custom queries

Task #4161

Automatický přepis MONO dat

Updated by Zajíc Zbyněk over 8 years ago

Updated by Zajíc Zbyněk almost 6 years ago