Data On Line

From SpeechWiki

(Difference between revisions)
Jump to: navigation, search
(Speech Corpora and dictionaries that we have developed and are distributing)
Line 1: Line 1:
-
==Speech Corpora and dictionaries that we have developed and are distributing==
+
==Databases Distributed by the Statistical Speech Technology Group==
-
These corpora and dictionaries are available via SFTP to interested researchers.
+
 
-
* [http://www.isle.uiuc.edu/AVICAR/home.htm AVICAR] corpus of audio/visual speech recognition in a car environment.
+
Our policy: everything we record is distributed for free. 
-
* [http://www.isle.uiuc.edu/dict/index.html ISLEdict] A rich dictionary for automatic speech recognition suitable in a wide variety of circumstances.
+
* Audiovisual speech is available, through secure ftp, to speech
-
* The [http://asr.cita.uiuc.edu/contacts.php UASPEECH] corpus.
+
researchers at university or government labs.<br>Contact username
 +
avicar at the domain name gmail.com for info.
 +
* Other types of data are posted free, on the web pages listed below.
 +
 
 +
<table border=2><tr>
 +
<tr><td>Audiovisual Speech</td></tr>
 +
<tr><td></td><td><a href="http://www.isle.uiuc.edu/ua/index.html">UASPEECH:</a>
 +
Train automatic recognizers of dysarthric speech</td></tr>
 +
<tr><td></td><td><a href="http://www.isle.uiuc.edu/AVICAR/home.htm">AVICAR:</a>
 +
100 Talkers, 4 Cameras, 8 Microphones, Moving Car</td></tr>
 +
 
 +
<tr><td>Dictionaries</td></tr>
 +
<tr><td></td><td>
 +
<a href="http://www.isle.uiuc.edu/dict/index.html">ISLEX:</a>
 +
International Speech Lexicon Project</td></tr>
 +
 
 +
<tr><td>Audio</td></tr>
 +
<tr><td></td><td>
 +
<a href="http://www.isle.uiuc.edu/virtualaudio/capture/index.html">RIR:</a>
 +
Measured Room Impulse Responses</td></tr>
 +
 
 +
<tr><td>MRI</td></tr>
 +
<tr><td></td><td><a href="http://www.isle.uiuc.edu/mri/index.html">VMRI:</a>
 +
5 Talkers, 10 Vowels, Axial and Coronal MR Image Stacks</td></tr>
 +
<tr><td></td><td>
 +
<a href="http://www.isle.uiuc.edu/physiology/alphabet/index.html">
 +
Alphabet:</a> 1 Talker reciting the alphabet</td></tr>
 +
<tr><td></td><td>
 +
<a href="http://www.isle.uiuc.edu/physiology/coronal_micro/index.html">
 +
Micro-MRI:</a>
 +
Voxel=59x59x49 microns, Human Cadaver Tongue</td></tr>
 +
 
 +
<tr><td>Data Analysis</td></tr>
 +
<tr><td></td><td>
 +
<a href="http://mickey.ifp.uiuc.edu/speechWiki/index.php/Fisher_Corpus">
 +
Fisher:</a>
 +
Everything you want to know about the Fisher corpus</td></tr>
 +
<tr><td></td><td>
 +
<a href="http://www.isle.uiuc.edu/recognition/infogram/index.html">
 +
Infograms:</a>
 +
Mutual information relative to phonetic landmarks (images)</td></tr>
 +
<tr><td></td><td>
 +
<a href="http://mickey.ifp.uiuc.edu/speechWiki/index.php/TIMIT">
 +
TIMIT:</a>
 +
TIMIT files with unusual speech production phenomenon</td></tr>
 +
 
 +
</table>
==Other speech corpora that we work with==
==Other speech corpora that we work with==

Revision as of 15:43, 4 November 2008

Databases Distributed by the Statistical Speech Technology Group

Our policy: everything we record is distributed for free.

  • Audiovisual speech is available, through secure ftp, to speech

researchers at university or government labs.
Contact username avicar at the domain name gmail.com for info.

  • Other types of data are posted free, on the web pages listed below.
Audiovisual Speech
<a href="http://www.isle.uiuc.edu/ua/index.html">UASPEECH:</a> Train automatic recognizers of dysarthric speech
<a href="http://www.isle.uiuc.edu/AVICAR/home.htm">AVICAR:</a> 100 Talkers, 4 Cameras, 8 Microphones, Moving Car
Dictionaries

<a href="http://www.isle.uiuc.edu/dict/index.html">ISLEX:</a>

International Speech Lexicon Project
Audio

<a href="http://www.isle.uiuc.edu/virtualaudio/capture/index.html">RIR:</a>

Measured Room Impulse Responses
MRI
<a href="http://www.isle.uiuc.edu/mri/index.html">VMRI:</a> 5 Talkers, 10 Vowels, Axial and Coronal MR Image Stacks

<a href="http://www.isle.uiuc.edu/physiology/alphabet/index.html">

Alphabet:</a> 1 Talker reciting the alphabet

<a href="http://www.isle.uiuc.edu/physiology/coronal_micro/index.html"> Micro-MRI:</a>

Voxel=59x59x49 microns, Human Cadaver Tongue
Data Analysis

<a href="http://mickey.ifp.uiuc.edu/speechWiki/index.php/Fisher_Corpus"> Fisher:</a>

Everything you want to know about the Fisher corpus

<a href="http://www.isle.uiuc.edu/recognition/infogram/index.html"> Infograms:</a>

Mutual information relative to phonetic landmarks (images)

<a href="http://mickey.ifp.uiuc.edu/speechWiki/index.php/TIMIT"> TIMIT:</a>

TIMIT files with unusual speech production phenomenon

Other speech corpora that we work with

Personal tools