Software

From SpeechWiki

(Difference between revisions)
Jump to: navigation, search
Line 33: Line 33:
;Nested STFTs (Dave Cohen, Camille Goudeseune, Mark Hasegawa-Johnson 2009)
;Nested STFTs (Dave Cohen, Camille Goudeseune, Mark Hasegawa-Johnson 2009)
-
:Efficient Simultaneous Multi-Scale Computation of FFTs
 
: [http://fodava.gatech.edu/files/reports/GT-FODAVA-09-01.pdf Description], [http://www.isle.uiuc.edu/software/stft.c stft.c]  
: [http://fodava.gatech.edu/files/reports/GT-FODAVA-09-01.pdf Description], [http://www.isle.uiuc.edu/software/stft.c stft.c]  
 +
:Efficient Simultaneous Multi-Scale Computation of FFTs
;Improved Mistral (Qingsong Liu 2009)
;Improved Mistral (Qingsong Liu 2009)
Line 51: Line 51:
==Computation==
==Computation==
;GMTK Parallel (Arthur Kantor, 2008)
;GMTK Parallel (Arthur Kantor, 2008)
-
:Split GMTK commands into batch jobs for a cluster
+
:[[GMTK parallel tools|Description]]
-
:[[GMTK parallel tools|Description]],
+
:The scripts are written and working but not packaged yet.  Email [[User:Arthur|Arthur]] for code.
-
:[svn://mickey.ifp.uiuc.edu/gmtkScripts/ SVN repository]  
+
:Run GMTK commands in parallel on a compute cluster
;HTK Parallel (Bowon Lee, 2006)
;HTK Parallel (Bowon Lee, 2006)
-
:These Perl scripts ([http://www.ifp.uiuc.edu/~bowonlee/research/cluster/HTK_parallel.htm description]) Split an HTK command for parallel excution on a [http://www.ifp.uiuc.edu/~bowonlee/research/cluster/linux_cluster.htm SGE] cluster.  
+
:[http://www.ifp.uiuc.edu/~bowonlee/research/cluster/HTK_parallel.htm description], [{{SoftwarePath}}/HTK_parallel source], [{{SoftwarePath}}/HTK_parallel.tgz tgz]
-
:[http://www.ifp.uiuc.edu/~bowonlee/research/cluster/HTK_parallel.htm Description],
+
:These Perl scripts Split an HTK command for parallel excution on a [http://www.ifp.uiuc.edu/~bowonlee/research/cluster/linux_cluster.htm SGE] cluster.  
-
:[http://www.ifp.uiuc.edu/~bowonlee/research/htk-pl/HCopy.pl HCopy.pl],
+
-
[http://www.ifp.uiuc.edu/~bowonlee/research/htk-pl/HVite.pl HVite.pl],
+
-
[http://www.ifp.uiuc.edu/~bowonlee/research/htk-pl/HERest.pl HERest.pl],
+
-
[http://www.ifp.uiuc.edu/~bowonlee/research/htk-pl/HResults.pl HResults.pl]
+
-
:[svn://mickey.ifp.uiuc.edu/HTK_parallel/ SVN repository]
+
==Data==
==Data==
;dtmfseg (Bowon Lee, 2006)
;dtmfseg (Bowon Lee, 2006)
 +
:[{{SoftwarePath}}/dtmfseg source], [{{SoftwarePath}}/dtmfseg.tgz tgz]
:Segment audio files at DTMF tones
:Segment audio files at DTMF tones
-
:[svn://mickey.ifp.uiuc.edu/dtmfseg/ SVN repository]
 
;transcription tools (Mark Hasegawa-Johnson, 2005)
;transcription tools (Mark Hasegawa-Johnson, 2005)
 +
:[{{SoftwarePath}}/transcription_tools source], [{{SoftwarePath}}/transcription_tools.tgz tgz]
:Convert transcription formats
:Convert transcription formats
-
:[http://www.isle.uiuc.edu/software/transcription_tools2005May.tgz TGZ archive]
 
-
:[svn://mickey.ifp.uiuc.edu/transcription_tools/ SVN repository]
 
;speechfileformats (Mark Hasegawa-Johnson, 2004)
;speechfileformats (Mark Hasegawa-Johnson, 2004)
 +
:[{{SoftwarePath}}/speechfileformats source], [{{SoftwarePath}}/speechfileformats.tgz tgz]
:Read and write HTK files in matlab
:Read and write HTK files in matlab
-
:[http://www.isle.uiuc.edu/software/speechfileformats.tgz TGZ archive]
 
-
:[svn://mickey.ifp.uiuc.edu/speechfileformats/ SVN repository]
 
;CTMRedit (Jul Cha and MH 1999)
;CTMRedit (Jul Cha and MH 1999)
 +
:[http://www.isle.uiuc.edu/pubs/1990s/hasegawa-johnson99embs.pdf Description], [{{SoftwarePath}}/CTMRedit source], [{{SoftwarePath}}/CTMRedit.tgz tgz]
:Manually and automatically segment CT and MR image stacks
:Manually and automatically segment CT and MR image stacks
-
:[http://www.isle.uiuc.edu/pubs/1990s/hasegawa-johnson99embs.pdf Description]
 
-
:[svn://mickey.ifp.uiuc.edu/CTMRedit SVN repository]
 
;improved MVA (Arthur Kantor 2008)
;improved MVA (Arthur Kantor 2008)
 +
:[http://mickey.ifp.uiuc.edu/speech/akantor/fisher/programs/bin.Linux/MVA Linux binary]
:Perform mean and variance normalization and ARMA filtering
:Perform mean and variance normalization and ARMA filtering
:It's essentially [http://ssli.ee.washington.edu/people/chiaping/mva.html this] version but with
:It's essentially [http://ssli.ee.washington.edu/people/chiaping/mva.html this] version but with
Line 91: Line 84:
:* more accurate mean and variance estimation (doubles instead of floats in strategic places)
:* more accurate mean and variance estimation (doubles instead of floats in strategic places)
:* faster computation in the case of MV (ARMA order 0)
:* faster computation in the case of MV (ARMA order 0)
-
:[http://mickey.ifp.uiuc.edu/speech/akantor/fisher/programs/bin.Linux/MVA binary] [svn://mickey.ifp.uiuc.edu/corporaNormalizationScripts/fisher/MVA.cc SVN repository]
+
:svn location is svn://mickey.ifp.uiuc.edu/corporaNormalizationScripts/fisher/MVA.cc  
-
 
+
==Miscellaneous==
==Miscellaneous==

Revision as of 18:31, 27 March 2010

Contents

Statistical Speech Technology Group Software

Our policy: everything we write is free on the web. This wiki is intended to be definitive, because anybody in the group can edit it to add their own software. A spider-indexable backup is at http://www.isle.uiuc.edu/software .

You can access each project by browsing an SVN snapshot online or downloading at tgz file by using one of the links below.

You can also check it out of our subversion server using login name "anon" with no password (hit "enter" when a password is requested).


Learning

Pronounce (Arthur Kantor, 2007)
Description, Demo, source, tgz
An orthographic string to phonetic string mapping tool.
This tool computes American English phonetic transcriptions from plaintext. Its HMM either generates a most likely phonetic transcription, or forces alignment if a phonetic transcription is provided. So, it gives a reasonable pronunciation for both out-of-dictionary words and partially pronounced words.
HTK-based Explicit-duration HMM (Ken Chen, 2003)
Description, source, tgz
HTKtrain (Sarah Borys and Mark Hasegawa-Johnson, 2008)
source, tgz
Scripts for training HMMs using HTK


Signal Processing

PVTK (Sarah Borys and MH 2005-8)
source, tgz
Extract HTK features as training vectors for libSVM, apply trained SVMs directly to feature files
VAD (Bowon Lee, 2007)
Description, source, tgz
Voice activity detector with improved noise model
Nested STFTs (Dave Cohen, Camille Goudeseune, Mark Hasegawa-Johnson 2009)
Description, stft.c
Efficient Simultaneous Multi-Scale Computation of FFTs
Improved Mistral (Qingsong Liu 2009)
State of the Art Text-Independent Speaker Verification System,especially for NIST SRE
Based on Mistral Open Source package
Improved and New Features:
  • add full factor analysis(eigenchannel and eigenvoice), instead of simple factor analysis(eigenchannel)
  • add multi-threads for Windows as well as Linux
  • support read HTK format feature/model
  • add an effective Algorithm for fast implementation of FA.
  • code optimization(for FA)
  • fixed some bugs
Source: /ws/ifp-32-2/hasegawa/pineking/programs/Improved_Mistral


Computation

GMTK Parallel (Arthur Kantor, 2008)
Description
The scripts are written and working but not packaged yet. Email Arthur for code.
Run GMTK commands in parallel on a compute cluster
HTK Parallel (Bowon Lee, 2006)
description, source, tgz
These Perl scripts Split an HTK command for parallel excution on a SGE cluster.


Data

dtmfseg (Bowon Lee, 2006)
source, tgz
Segment audio files at DTMF tones
transcription tools (Mark Hasegawa-Johnson, 2005)
source, tgz
Convert transcription formats
speechfileformats (Mark Hasegawa-Johnson, 2004)
source, tgz
Read and write HTK files in matlab
CTMRedit (Jul Cha and MH 1999)
Description, source, tgz
Manually and automatically segment CT and MR image stacks
improved MVA (Arthur Kantor 2008)
Linux binary
Perform mean and variance normalization and ARMA filtering
It's essentially this version but with
  • better error reporting (e.g. failing to open file tells you so instead of core dumping)
  • more accurate mean and variance estimation (doubles instead of floats in strategic places)
  • faster computation in the case of MV (ARMA order 0)
svn location is svn://mickey.ifp.uiuc.edu/corporaNormalizationScripts/fisher/MVA.cc

Miscellaneous

Other scripts written in perl, python, bash, and ruby can be found in SVN archive.

There is also auto-generated documentation for them.

Personal tools