Computer Resources
From SpeechWiki
(Difference between revisions)
(→Applications) |
(→LVCSR at Illinois Computer Resources) |
||
Line 3: | Line 3: | ||
* Compute Facilities: | * Compute Facilities: | ||
** [http://ifp-32.ifp.uiuc.edu/ganglia/ Cluster Status] | ** [http://ifp-32.ifp.uiuc.edu/ganglia/ Cluster Status] | ||
+ | ** [https://portal.teragrid.org/gridsphere/gridsphere Teragrid Portal] | ||
* Data: | * Data: |
Revision as of 03:23, 16 June 2008
LVCSR at Illinois Computer Resources
- Compute Facilities:
- Data:
- We distribute the AVICAR corpus via sftp to interested researchers
- We distribute the UASPEECH corpus via sftp to interested researchers
- We are members of LDC. Most LDC data is organized as described in the Data Organization README. Some useful slices of LDC data that have not been moved to ifp-32-2 include:
- /workspace/fluffy1/12hour - 12 hours extracted from Switchboard 1, with SPHERE and WAV audio, MFCCs, transcriptions.
- /workspace/fluffy1/{train-ws96,train-ws97,misc-ws97} - The ICSI phonetically transcribed Switchboard-1 extracts
- /workspace/fletcher1/bdc - The Boston Directions Corpus, two speakers have prosodic transcriptions, others don't
- /workspace/nibbler0/data/ylzheng/WS04/DATA - Tsinghua Wu-accented Mandarin (MFCC and FMT only, no waveforms)
- Time-aligned Switchboard Disfluency corpus
- mickey0/sw_disTime-0.9.9 - merged from the original Switchboard time transcription and the Treebank-3 disfluency transcription (TextGrid included)
- mickey0/sw_disTime-1.0.0 (TextGrid NOT included)
Applications
- Software created at SST@UIUC
- Acoustic model training:
- HTK hidden Markov modeling toolkit: ifp-32-1/hasegawa/programs/htk-3.4
- GMTK Dynamic Bayesian Nets/Graphical Models: nibbler0/speech_apps/GMTK
- Sphinx speech recognizer
- LIUM speech tools, including speaker segmentation
- Language model training:
- SRILM Big N-gram counts and backoff, lattices: fluffy0/programs/srilm
- AT&T FSM Library: fluffy0/programs/fsm-4.0
- OpenFST: fluffy0/programs/OpenFst/
- Scoring
- NIST Speech Tools: ifp-32-1/hasegawa/programs
- SVMs, NNs, Boosting and such
- libSVM: fluffy0/programs/svmlib
- svm_light: fluffy0/programs/svm_light
- quicknet: mickey0/quicknet
- Boostexter
- Spectrograms and Waveform Viewing
- XKL (MIT): nibbler0/speech_apps/xkl-2.3.1
- ESPS (Entropic Systems, now Microsoft)
- Praat
Backups
If you have personal working directories that should be regularly backed up, outside of your own home directory, list them here.
- Art
- mickey0/akantor
- rizzo1/akantor is itself a backup of svn because it cannot be backed up in the normal way.
- Sarah
- nibbler0/data
- rizzo0/sborys
- spot1/sborys
- tico0/sborys
- Xiaodan
- spot1/xzhuang2/newbaseline
- spot1/xzhuang2/workshop
- c1-15/hasegawa/xzhuang2*
- /workspace/tico0/AED/