Computer Resources

From SpeechWiki

(Difference between revisions)

Jump to: navigation, search

Revision as of 00:53, 16 June 2008

LVCSR at Illinois Computer Resources

Compute Facilities:
- Cluster Status

Data:
- We distribute the AVICAR corpus via sftp to interested researchers
- We distribute the UASPEECH corpus via sftp to interested researchers
- We are members of LDC. Most LDC data is organized as described in the Data Organization README. Some useful slices of LDC data that have not been moved to ifp-32-2 include:
  - /workspace/fluffy1/12hour - 12 hours extracted from Switchboard 1, with SPHERE and WAV audio, MFCCs, transcriptions.
  - /workspace/fluffy1/{train-ws96,train-ws97,misc-ws97} - The ICSI phonetically transcribed Switchboard-1 extracts
  - /workspace/fletcher1/bdc - The Boston Directions Corpus, two speakers have prosodic transcriptions, others don't
  - /workspace/nibbler0/data/ylzheng/WS04/DATA - Tsinghua Wu-accented Mandarin (MFCC and FMT only, no waveforms)

Time-aligned Switchboard Disfluency corpus
- mickey0/sw_disTime-0.9.9 - merged from the original Switchboard time transcription and the Treebank-3 disfluency transcription (TextGrid included)
- mickey0/sw_disTime-1.0.0 (TextGrid NOT included)

Applications

Acoustic model training:
- HTK hidden Markov modeling toolkit: ifp-32-1/hasegawa/programs/htk-3.4
- GMTK Dynamic Bayesian Nets/Graphical Models: nibbler0/speech_apps/GMTK
- Sphinx speech recognizer

Decoding:
- Julius LVCSR decoder
- AT&T DCD LVCSR decoder - nibbler0/speech_apps/dcd-2.0

Language model training:
- SRILM Big N-gram counts and backoff, lattices: fluffy0/programs/srilm
- AT&T FSM Library: fluffy0/programs/fsm-4.0
- OpenFST: fluffy0/programs/OpenFst/

Scoring
- NIST Speech Tools: ifp-32-1/hasegawa/programs

Support Vector Machines and Neural Networks
- libSVMLIB: fluffy0/programs/svmlib
- svm_light: fluffy0/programs/svm_light
- quicknet: mickey0/quicknet

Spectrograms and Waveform Viewing
- XKL (MIT): nibbler0/speech_apps/xkl-2.3.1
- ESPS (Entropic Systems, now Microsoft)
- Praat

Backups

If you have personal working directories that should be regularly backed up, outside of your own home directory, list them here.

Art
- mickey0/akantor
- rizzo1/akantor is itself a backup of svn because it cannot be backed up in the normal way.

Sarah
- nibbler0/data
- rizzo0/sborys
- spot1/sborys
- tico0/sborys

Bowon
- mickey1/AVICAR_AUDIO
- mickey1/AVICAR_DATA
- mickey1/AVICAR_DIST
- mickey1/AVICAR_DIST_OLD
- rizzo1/bowonlee
- mickey0/bowonlee

Mital
- mickey0/magandhi

Xiaodan
- spot1/xzhuang2/newbaseline
- spot1/xzhuang2/workshop
- c1-15/hasegawa/xzhuang2*
- /workspace/tico0/AED/

Rajiv
- scratch/rreddy

@@ Line 18: / Line 18: @@
 ==Applications==
-* Hidden Markov Models:
-** HTK (Cambridge): fluffy0/programs/htk-3.3
+* Acoustic model training:
-** DCD (ATT): nibbler0/speech_apps/dcd-2.0
+** [http://www.htk.eng.cam.ac.uk HTK] hidden Markov modeling toolkit: ifp-32-1/hasegawa/programs/htk-3.4
-* Dynamic Bayesian Nets/Graphical Models: nibbler0/speech_apps/GMTK
+** [http://ssli.ee.washington.edu/~bilmes/gmtk/ GMTK] Dynamic Bayesian Nets/Graphical Models: nibbler0/speech_apps/GMTK
-* Language Models: fluffy0/programs/srilm
+** [http://cmusphinx.sourceforge.net Sphinx] speech recognizer
-* Finite State Machines:
-** FSM (ATT): fluffy0/programs/fsm-4.0
+* Decoding:
-** FST (MIT): fluffy0/programs/fst-1.0-RC1 (MIT)
+** [http://julius.sourceforge.jp/en_index.php Julius] LVCSR decoder
-** OpenFST : fluffy0/programs/OpenFst/
+** [http://www.research.att.com/~fsmtools/dcd/ AT&T DCD] LVCSR decoder - nibbler0/speech_apps/dcd-2.0
-* Support Vector Machines:
-** SVMLIB (NJTU): fluffy0/programs/svmlib
+* Language model training:
-** svm_light (Joachims): fluffy0/programs/svm_light
+** [http://www.speech.sri.com/projects/srilm/ SRILM] Big N-gram counts and backoff, lattices: fluffy0/programs/srilm
-** PVTK (UIUC): mickey0/SVM/PVTK
+** [http://www.research.att.com/~fsmtools/fsm/ AT&T FSM Library]: fluffy0/programs/fsm-4.0
-* Neural Nets
+** [http://www.openfst.org OpenFST]: fluffy0/programs/OpenFst/
-** quicknet (ICSI): mickey0/quicknet
+* Scoring
+** [http://www.nist.gov/speech/tools/ NIST Speech Tools]: ifp-32-1/hasegawa/programs
+* Support Vector Machines and Neural Networks
+** [http://www.csie.ntu.edu.tw/~cjlin/libsvm/ libSVMLIB]: fluffy0/programs/svmlib
+** [http://www.cs.cornell.edu/people/tj/svm_light/ svm_light]: fluffy0/programs/svm_light
+** [http://www.icsi.berkeley.edu/Speech/icsi-speech-tools.html quicknet]: mickey0/quicknet
 * Spectrograms and Waveform Viewing
 ** XKL (MIT): nibbler0/speech_apps/xkl-2.3.1
 ** ESPS (Entropic Systems, now Microsoft)
 ** Praat
-* Speech Data File Formats:
-** SPHERE (NIST): fluffy0/programs/sphere
-** sox (linux): /usr/bin/sox
-** HCopy (Cambridge): see HTK
 ==Backups==

Computer Resources

From SpeechWiki

Revision as of 00:53, 16 June 2008

LVCSR at Illinois Computer Resources

Applications

Backups

Views

Personal tools

Navigation

Toolbox

Search