Software
From SpeechWiki
(→Statistical Speech Technology Group Software) |
|||
Line 9: | Line 9: | ||
<table border=2><tr> | <table border=2><tr> | ||
+ | <tr><td>Scripts</td></tr> | ||
+ | <tr><td>Scripts</td><td>miscellaneous perl, python, bash, and ruby<br> | ||
+ | [svn://mickey.ifp.uiuc.edu/scripts SVN archive], | ||
+ | [Scripts_Documentation Documentation]</td></tr> | ||
<tr><td>Learning</td></tr> | <tr><td>Learning</td></tr> | ||
<tr><td>Pronounce</td><td>Letters to phones using an HMM<br> | <tr><td>Pronounce</td><td>Letters to phones using an HMM<br> |
Revision as of 21:41, 12 January 2009
Statistical Speech Technology Group Software
Our policy: everything we write is free on the web. This wiki page is intended to be definitive, because anybody in the group can edit it to add their own software. [1] provides a spider-indexable backup copy.
All of our software is available via <a href="http://subversion.tigris.org/">subversion</a>, using login name "anon" with no password (hit "enter" when a password is requested).
- If you are using Windows, download TortoiseSVN
- If you are using linux, use the svn command interface, e.g., svn co svn://mickey.ifp.uiuc.edu/speechfileformats
Scripts | |
Scripts | miscellaneous perl, python, bash, and ruby [Scripts_Documentation Documentation] |
Learning | |
Pronounce | Letters to phones using an HMM SVN archive (Arthur Kantor, 2007) |
HDK | HTK-based Explicit-duration HMM Description, TGZ archive, SVN repository (Ken Chen, 2003) |
HTKtrain | Scripts for training HMMs using HTK SVN repository (Sarah Borys and Mark Hasegawa-Johnson, 2008) |
Signal Processing | |
PVTK | Extract HTK features as training vecs for libSVM, apply trained SVMs directly to feature files TGZ archive, SVN repository (Sarah Borys and MH 2005-8) |
VAD | Voice activity detector w/improved noise model lee_vad.m, SVN repository (Bowon Lee, 2007) |
Computation | |
GMTK Parallel | Split GMTK commands into batch jobs for a cluster SVN repository (Arthur Kantor, 2008) |
HTK Parallel |
Split an HTK command into batch jobs for a cluster (Bowon Lee, 2006) |
Data | |
dtmfseg | Segment audio files at DTMF tones SVN repository (Bowon Lee, 2006) |
transcription tools | Convert transcription formats TGZ archive, SVN repository (Mark Hasegawa-Johnson, 2005) |
speechfileformats | Read and write HTK files in matlab TGZ archive, SVN repository (Mark Hasegawa-Johnson, 2004) |
CTMRedit | Manually and automatically segment CT and MR image stacks Description, SVN repository (Jul Cha and MH 1999) |
improved MVA | Perform mean and variance normalization and ARMA filtering It's essentially the version found here, with these improvements:
source: svn://mickey.ifp.uiuc.edu/corporaNormalizationScripts/fisher/MVA.cc binary: http://mickey.ifp.uiuc.edu/speech/akantor/fisher/programs/bin.Linux/MVA |
Phonetic Transcription Tool
This tool gives the American English phonetic transcription of any string. It uses an HMM model to either generate a most likely phonetic transcription, or if a phonetic transcription is provided, it can perform forced alignment. So, it gives a reasonable pronounciation for out-of-dictionary words, or partially pronounced words.
Phonetic Transcription Tool. You can try out the demo here.
Perl scripts for parallel processing of HTK commands
Perl scripts for parallel processing of four HTK commands (HCopy, HERest, HVite, and HResults) are available.
1. HCopy.pl
2. HERest.pl
3. HVite.pl
4. HResults.pl
Those scripts use the SGE (Sun Grid Engine) for job queuing.
Detailed information about the above Perl scripts can be found here.
Brief introduction about the SGE can be found here
Written by Bowon Lee, 02/24/2006