Data On Line

From SpeechWiki

(Difference between revisions)
Jump to: navigation, search
 
(11 intermediate revisions not shown)
Line 1: Line 1:
-
==Speech Corpora and dictionaries that we have developed and are distributing==
+
==Databases Distributed by the Statistical Speech Technology Group==
-
* [http://www.isle.uiuc.edu/AVICAR/home.htm AVICAR] corpus of audio/visual speech recognition in a car environment
+
Our policy: everything we record is distributed for free.
-
* [http://www.isle.uiuc.edu/dict/index.html ISLEdict] A rich dictionary for automatic speech recognition suitable in a wide variety of circumstances
+
* Audiovisual speech is available, through secure ftp, to speech researchers at university or government labs. Contact username avicar at the domain name gmail.com for info.
 +
* Other types of data are posted free, on the web pages listed below.
 +
This page is intended to be the definitive list of data distributed by the SST group, because anybody in the group can edit it to add your own data.
-
==Speech Corpora that we work with==
+
<table border=2><tr>
 +
<tr><td>Audiovisual Speech</td></tr>
 +
<tr><td></td><td>[http://isle.uiuc.edu/sst/data/UASpeech UASPEECH]
 +
Train automatic recognizers of dysarthric speech</td></tr>
 +
<tr><td></td><td>[http://isle.uiuc.edu/sst/AVICAR AVICAR]
 +
100 Talkers, 4 Cameras, 8 Microphones, Moving Car</td></tr>
 +
<tr><td>Dictionaries</td></tr>
 +
<tr><td></td><td>[http://isle.uiuc.edu/sst/data/dict ISLEX]
 +
International Speech Lexicon Project</td></tr>
-
* [[Fisher Corpus]]
+
<tr><td>Audio</td></tr>
 +
<tr><td></td><td>[http://isle.uiuc.edu/sst/data/roomresponses RIR]
 +
Measured Room Impulse Responses</td></tr>
 +
 
 +
<tr><td>MRI</td></tr>
 +
<tr><td></td><td>[http://isle.uiuc.edu/sst/data/mri VMRI:]
 +
5 Talkers, 10 Vowels, Axial and Coronal MR Image Stacks</td></tr>
 +
<tr><td></td><td>
 +
[http://isle.uiuc.edu/sst/research/physiology/coronal_micro Micro-MRI:] Voxel=59x59x49 microns, Human Cadaver Tongue</td></tr>
 +
<tr><td></td><td>
 +
[http://isle.uiuc.edu/sst/research/physiology/histology Micro-MRI:] Histology of the same Human Cadaver Tongue specimen</td></tr>
 +
 
 +
<tr><td>LDC Corpora</td></tr>
 +
<tr><td></td><td>
 +
[[:Category:Fisher Experiments|Fisher]]: Everything you want to know about the Fisher corpus</td></tr>
 +
<tr><td></td><td>
 +
[http://isle.uiuc.edu/sst/research/infograms Infograms:] Mutual information relative to phonetic landmarks (images)</td></tr>
 +
<tr><td></td><td>
 +
[[TIMIT]]: TIMIT files with unusual speech production phenomenon</td></tr>
 +
 
 +
</table>

Latest revision as of 15:23, 26 July 2010

Databases Distributed by the Statistical Speech Technology Group

Our policy: everything we record is distributed for free.

  • Audiovisual speech is available, through secure ftp, to speech researchers at university or government labs. Contact username avicar at the domain name gmail.com for info.
  • Other types of data are posted free, on the web pages listed below.

This page is intended to be the definitive list of data distributed by the SST group, because anybody in the group can edit it to add your own data.

Audiovisual Speech
UASPEECH Train automatic recognizers of dysarthric speech
AVICAR 100 Talkers, 4 Cameras, 8 Microphones, Moving Car
Dictionaries
ISLEX International Speech Lexicon Project
Audio
RIR Measured Room Impulse Responses
MRI
VMRI: 5 Talkers, 10 Vowels, Axial and Coronal MR Image Stacks
Micro-MRI: Voxel=59x59x49 microns, Human Cadaver Tongue
Micro-MRI: Histology of the same Human Cadaver Tongue specimen
LDC Corpora
Fisher: Everything you want to know about the Fisher corpus
Infograms: Mutual information relative to phonetic landmarks (images)
TIMIT: TIMIT files with unusual speech production phenomenon
Personal tools