Computer Resources

From SpeechWiki

Revision as of 20:27, 3 September 2006 by Mark Hasegawa-Johnson (Talk | contribs)
Jump to: navigation, search

Contents

LVCSR at Illinois Computer Resources

This page has two main purposes: (1) Help local users keep track of databases. Disk management is decentralized, therefore messy; this page is intended to help. (2) Specify which directories are archival, and which are temporary.

Directories listed below are accessible only to ISLE/IFP registered users.

Databases

These directories have been backed up. They may or may not be backed up again unless Paritosh is specifically told to do so. All paths are relative to the /workspace mount point. The data below are sorted first by channel (telephone, broadband, audiovisual, biomedical), and second by language.

  • English Telephone Speech
    • {helmholtz1/Switchboard,fluffy1/switchboard-1} - Switchboard 1 audio, orthographic transcriptions, and e-speech prosody predictions.
    • fluffy1/12hour - 12 hours extracted from Switchboard 1, with SPHERE and WAV audio, MFCCs, transcriptions.
    • fluffy1/{train-ws96,train-ws97,misc-ws97} - The ICSI phonetically transcribed Switchboard-1 extracts
    • spot0/switchboard-2 - Switchboard-2 is like Switchboard-1, but more, and with better U.S. dialect coverage
    • fluffy1/{hub5,hub5_eval} - The NIST hub5 training and test data
    • mickey0/english_callhome - Conversations among family and friends
    • fluffy1/ntimit-{train,test} - The NTIMIT read speech corpus, passed through a telephone channel
  • Chinese Telephone Speech
    • mickey0/Mandarin_callhome
  • Spanish Telephone Speech
    • nibbler0/speech_data/Spanish_callhome
  • Hindi Telephone Speech
    • nibbler0/speech_data/Hindi_callfriend
  • English Broadband Speech
    • mickey1/BN97 - Broadcast News
    • mickey1/HUB5E_98 - HUB5 NIST competition Broadcast News data
    • tidigits - Isolated Digits and Phone Numbers
    • fletcher1/kidspeech - Recordings of Children 8, 9, 10 years old
    • Radio_Speech_Corpus - The Boston University Radio Speech Corpus, seven speakers with prosodic transcription
    • fletcher1/bdc - The Boston Directions Corpus, two speakers have prosodic transcriptions, others don't
  • Chinese Broadband Speech
    • nibbler1/MBN - Mandarin Broadcast News
    • nibbler0/data/ylzheng/WS04/DATA - Tsinghua Wu-accented Mandarin (MFCC and FMT only, no waveforms)
  • Audiovisual and Multimicrophone Data
    • fluffy0/data/{icsi_mr,isl_meeting}_transcr - Transcriptions of meeting room data from ISL, ICSI. Audio and (remote camera) video available but not online
    • mickey1/AVICAR_DIST - 4-camera, 8-microphone recording of read sentences, phone numbers, and isolated letters in a moving car
    • rizzo1/speech_hearing - 7-microphone recordings of isolated words produced by talkers with dysarthria
  • Biomedical Image Data
    • mickey0/UW_XRAY_MICROBEAM - point tracking data, 100Hz sampling, obtained using X-ray microbeam
    • {speech_web/mri,http://www.isle.uiuc.edu/mri} - much 3D MRI of vowels, a little fast 2D MRI of the alphabet, and a little high-res 3D MRI of excised tongues
  • Text
    • rizzo0/treebank - The Penn Treebank syntactically parsed corpus
  • Time-aligned Switchboard Disfluency corpus
    • mickey0/sw_disTime-0.9.9 - merged from the original Switchboard time transcription and the Treebank-3 disfluency transcription (TextGrid included)
    • mickey0/sw_disTime-1.0.0 (TextGrid NOT included)

Applications

  • Hidden Markov Models:
    • HTK (Cambridge): fluffy0/programs/htk-3.3
    • DCD (ATT): nibbler0/speech_apps/dcd-2.0
  • Dynamic Bayesian Nets/Graphical Models: nibbler0/speech_apps/GMTK
  • Language Models: fluffy0/programs/srilm
  • Finite State Machines:
    • FSM (ATT): fluffy0/programs/fsm-4.0
    • FST (MIT): fluffy0/programs/fst-1.0-RC1 (MIT)
  • Support Vector Machines:
    • SVMLIB (NJTU): fluffy0/programs/svmlib
    • svm_light (Joachims): fluffy0/programs/svm_light
    • PVTK (UIUC): mickey0/SVM/PVTK
  • Spectrograms and Waveform Viewing
    • XKL (MIT): nibbler0/speech_apps/xkl-2.3.1
    • ESPS (Entropic Systems, now Microsoft)
    • Praat
  • Speech Data File Formats:
    • SPHERE (NIST): fluffy0/programs/sphere
    • sox (linux): /usr/bin/sox
    • HCopy (Cambridge): see HTK

Backups

If you have personal working directories that should be regularly backed up, outside of your own home directory, list them here.

  • Art
    • mickey0/akantor
    • rizzo1/akantor is itself a backup of svn because it cannot be backed up in the normal way.
  • Sarah
    • nibbler0/data
    • rizzo0/sborys
    • spot1/sborys
    • tico0/sborys
  • Bowon
    • mickey1/AVICAR_AUDIO
    • mickey1/AVICAR_DATA
    • mickey1/AVICAR_DIST
    • mickey1/AVICAR_DIST_OLD
    • rizzo1/bowonlee
    • mickey0/bowonlee
  • Mital
    • mickey0/magandhi
  • Xiaodan
    • spot1/xzhuang2
  • Rajiv
    • scratch/rreddy
Personal tools