Software

From SpeechWiki

(Difference between revisions)

Jump to: navigation, search

Revision as of 04:12, 19 March 2010

Learning

Pronounce: An orthographic string to phonetic string mapping tool. (Arthur Kantor, 2007); This tool computes American English phonetic transcriptions from plaintext. Its HMM either generates a most likely phonetic transcription, or forces alignment if a phonetic transcription is provided. So, it gives a reasonable pronunciation for both out-of-dictionary words and partially pronounced words.; Description, Demo,; SVN archive

HTK-based Explicit-duration HMM (Ken Chen, 2003): Description, TGZ archive; SVN repository

HTKtrain: Scripts for training HMMs using HTK (Sarah Borys and Mark Hasegawa-Johnson, 2008); SVN repository

Signal Processing

PVTK (Sarah Borys and MH 2005-8): Extract HTK features as training vectors for libSVM, apply trained SVMs directly to feature files; TGZ archive; SVN repository

VAD (Bowon Lee, 2007): Voice activity detector with improved noise model; Description, lee_vad.m; SVN repository

Nested STFTs (Dave Cohen, Camille Goudeseune, Mark Hasegawa-Johnson 2009): Efficient Simultaneous Multi-Scale Computation of FFTs; Description, stft.c

Improved Mistral (Qingsong Liu 2009)

State of the Art Text-Independent Speaker Verification System,especially for NIST SRE

Based on Mistral Open Source package

Improved and New Features:

add full factor analysis(eigenchannel and eigenvoice), instead of simple factor analysis(eigenchannel)
add multi-threads for Windows as well as Linux
support read HTK format feature/model
add an effective Algorithm for fast implementation of FA.
code optimization(for FA)
fixed some bugs

Source: /ws/ifp-32-2/hasegawa/pineking/programs/Improved_Mistral

Computation

GMTK Parallel (Arthur Kantor, 2008): Split GMTK commands into batch jobs for a cluster; Description,; SVN repository

HTK Parallel (Bowon Lee, 2006): These Perl scripts (description) Split an HTK command for parallel excution on a SGE cluster.; Description,; HCopy.pl,

HVite.pl, HERest.pl, HResults.pl

SVN repository

Data

dtmfseg (Bowon Lee, 2006): Segment audio files at DTMF tones; SVN repository

transcription tools (Mark Hasegawa-Johnson, 2005): Convert transcription formats; TGZ archive; SVN repository

speechfileformats (Mark Hasegawa-Johnson, 2004): Read and write HTK files in matlab; TGZ archive; SVN repository

CTMRedit (Jul Cha and MH 1999): Manually and automatically segment CT and MR image stacks; Description; SVN repository

improved MVA (Arthur Kantor 2008)

Perform mean and variance normalization and ARMA filtering

It's essentially this version but with

better error reporting (e.g. failing to open file tells you so instead of core dumping)
more accurate mean and variance estimation (doubles instead of floats in strategic places)
faster computation in the case of MV (ARMA order 0)

binary SVN repository

Miscellaneous

Other scripts written in perl, python, bash, and ruby can be found in SVN archive.

There is also auto-generated documentation for them.

@@ Line 1: / Line 1: @@
 ===Statistical Speech Technology Group Software===
 Our policy: everything we write is free on the web. This wiki is intended to be definitive, because anybody in the group can edit it to add their own software. A spider-indexable backup is at http://www.isle.uiuc.edu/software .
@@ Line 7: / Line 6: @@
 * On Windows, download [http://tortoisesvn.tigris.org/ TortoiseSVN]
 * On Linux, use the svn command, e.g., svn co svn://mickey.ifp.uiuc.edu/speechfileformats
 ==Learning==
 ; Pronounce: An orthographic string to phonetic string mapping tool. (Arthur Kantor, 2007)
 :This tool computes American English phonetic transcriptions from plaintext.  Its HMM either generates a most likely phonetic transcription, or forces alignment if a phonetic transcription is provided.  So, it gives a reasonable pronunciation for both out-of-dictionary words and partially pronounced words.
@@ Line 39: / Line 38: @@
 : [http://fodava.gatech.edu/files/reports/GT-FODAVA-09-01.pdf Description], [http://www.isle.uiuc.edu/software/stft.c stft.c]
+;Improved Mistral (Qingsong Liu 2009)
-:Improved Mistral (Qingsong Liu 2009)
 :State of the Art Text-Independent Speaker Verification System,especially for NIST SRE
 :Based on [http://mistral.univ-avignon.fr/wiki/index.php/Main_Page Mistral Open Source package]
@@ Line 51: / Line 49: @@
 :* fixed some bugs
 :Source: /ws/ifp-32-2/hasegawa/pineking/programs/Improved_Mistral
 ==Computation==
@@ Line 95: / Line 94: @@
 :* faster computation in the case of MV (ARMA order 0)
 :[http://mickey.ifp.uiuc.edu/speech/akantor/fisher/programs/bin.Linux/MVA binary] [svn://mickey.ifp.uiuc.edu/corporaNormalizationScripts/fisher/MVA.cc SVN repository]
 ==Miscellaneous==

Software

From SpeechWiki

Revision as of 04:12, 19 March 2010

Contents

Statistical Speech Technology Group Software

Learning

Signal Processing

Computation

Data

Miscellaneous

Views

Personal tools

Navigation

Toolbox

Search