From SpeechWiki

Revision as of 19:49, 13 October 2008 by Arthur (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

This page links to the various things I've done with the Fisher corpus. It may be helpful for quickly building a basic speech recognizer.

Train/Devel/Test partitions

For all the models and experiments, the entire Fisher corpus into 80/10/10 percent for Train/Devel/Test partitions as follows

The utterance id file is in uttIds.txt And the splits are as follows:

Set	Conversation Sides	Lines in uttIds.txt	Lines in wordPerUtteranceTrans.txt
Training	00001A to 09360B	1 to 1775831	1 to 21718060
Devel	09361A to 10530B	1775832 to 1991965	21718061 to 24482529
Test	10531A to 11699B	1991965 to 2223159	24482530 to 27071554

The experiment infrastructure needs its own page.

The experiments

The goal of these experiments is to explore the utility of using mixed units (phones, syllables and whole words) for large vocabulary speech recognition. These experiments are preformed on the Fisher Corpus.

The phonetic and mixed-unit dictionaries, the language models and the front end used in my pronunciation experiments all have their own pages.

The Fisher Baseline Experiments and Mixed Unit Experiments.

Fisher Corpus

From SpeechWiki

Train/Devel/Test partitions

The experiments

Views

Personal tools

Navigation

Toolbox

Search