Project management of NTIS P1 Cybernetic Systems and Department of Cybernetics | WiKKY

Project

General

Profile

Actions

Task #3855

closed

Task #3680: RA4a - Automatic error prediction

Task #3698: Experiment with one-class clasification for join cost enhancements

More data for artefacts collection

Added by Tihelka Dan about 8 years ago. Updated over 7 years ago.

Status:
Postponed
Priority:
Normal
Start date:
06.04.2016
Due date:
10.04.2016
% Done:

0%

Estimated time:

Description

We need more data for listening tests. Especially we need to increase the coverage of rare vowels. Currently we have:

phone total OK artefact
a 78 60 18
e 82 46 36
i 49 30 19
o 92 50 42
u 23 22 1
A 123 17 104
E 4 4 0
I 23 17 6
O 0 0 0
U 4 4 0

We can either try to find additional words in the corpus (shorter, though), or build "artificial" words by joining two halves of words (or words transitions) from the corpus.


Files

prepare_words.py (26.8 KB) prepare_words.py Script to select words of the appropriate length from ASF Tihelka Dan, 06.04.2016 13:39
asf2json_mix.py (5.06 KB) asf2json_mix.py Tihelka Dan, 09.08.2016 15:15
Actions #1

Updated by Tihelka Dan about 8 years ago

Adding (rather messed) script prepare_words.py which was used to select the original list of words used for the listening tests. From the full list, only words starting/ending with unvoiced consonants were used.

Actions #2

Updated by Matoušek Jindřich about 8 years ago

  • Target version changed from RA1: Analysis of artifacts in synthetic speech to RA4: Automatic error prediction and signal modification
Actions #3

Updated by Grůber Martin over 7 years ago

  • Status changed from New to Assigned
  • Assignee changed from Grůber Martin to Tihelka Dan

I would also need a script which is used for word parts combining (and words synthesis) as it will be probably necessary to build "artificial" words.

Actions #4

Updated by Tihelka Dan over 7 years ago

Script asf2json_mix.py should take ASF file with individual word instances and create a set of JSON definitions which can them be passed to TTS scripting to create synthetic words for listening.

Actions #5

Updated by Grůber Martin over 7 years ago

  • Status changed from Assigned to Postponed
Actions

Also available in: Atom PDF