Revision as of 17:40, 2 October 2008 by Arthur (Talk | contribs)

The fisher corpus is still relatively new and rough, and this page is to help people quickly build a basic speech recognizer with it.

Train/Devel/Test partition

I've split the entire Fisher corpus into 80/10/10 percent for Train/Devel/Test partitions

The utterance id file is in filelists/uttIds.txt And the splits are as follows:

Dictionaries

There is a lot to say about the Fisher Language Models so they get their own page.