Timeliner
From SpeechWiki
(Difference between revisions)
(Created page with ' == ruby codes /workspace/ifp-32-2/hasegawa/data/multimodal/nonspeech/FODAVA/timeliner (the feature file opened is currently hardcoded , update that line before using) == set u...') |
(more dependencies) |
||
(23 intermediate revisions not shown) | |||
Line 1: | Line 1: | ||
+ | ==Ruby codes== | ||
- | |||
/workspace/ifp-32-2/hasegawa/data/multimodal/nonspeech/FODAVA/timeliner | /workspace/ifp-32-2/hasegawa/data/multimodal/nonspeech/FODAVA/timeliner | ||
- | |||
- | + | Feature files and directory are hardcoded in the .rb file. Update those lines before running. | |
- | + | ||
- | + | ||
- | + | ||
- | + | ||
- | + | ||
- | + | ||
- | + | ||
- | + | == Dependencies == | |
- | + | ||
- | + | ||
- | + | Ubuntu. Verified with Xiaodan's 9.04, Camille's 64bit 8.10, Mark's 9.04. 9.1 in progress. | |
- | + | ||
- | + | Get security updates. Install packages with aptitude (or apt-get) and gem (some back and forth to discover dependencies): | |
- | + | * aptitude update | |
- | + | * aptitude install sox audacity libaudiofile-dev ruby ruby1.8-dev rubygems1.8 mesa-common-dev libglu1-mesa-dev freeglut3-dev imagemagick libmagickcore-dev libmagickwand-dev ''(may need to change order; may replace *magick* with libmagick++9-dev)'' | |
+ | * From http://rubygems.org/ , install gem from the .tgz file. Follow its instructions. | ||
+ | * gem update | ||
+ | * gem install rake mkrf ZenTest RubyInline rspec rice | ||
+ | (consider sticking to ruby 1.8 for the sake of http://rmagick.rubyforge.org/install-faq.html) | ||
+ | * gem install rmagick --no-ri --no-rdoc (fails on 8.10, imagemagick is too old?) | ||
+ | * gem install ruby-opengl (fails on ubuntu 9.1 and 10.04): http://rubyforge.org/tracker/index.php?func=detail&aid=27386&group_id=2103&atid=8185 ) | ||
+ | * Install HCopy and HList from HTK-3.4.tar.gz (register first). | ||
+ | |||
+ | ==== Notes ==== | ||
+ | * locate ''finds a file on local disk'' | ||
+ | |||
+ | * apt-file find ruby.h ''lists packages that generate ruby.h'' | ||
+ | * alsamixer ''adjusts volume'' | ||
+ | |||
+ | ==Feature files in HTK format== | ||
+ | |||
+ | /workspace/ifp-32-2/hasegawa/xzhuang2/AED2009/tmp/forCamille | ||
+ | |||
+ | ====Making==== | ||
+ | HCopy -C $ConfigFile AIT_20061020_AmarkIII_1.ch4.wav $ResultFile | ||
+ | |||
+ | ====Viewing==== | ||
+ | HList -h $ResultFile | ||
+ | |||
+ | ====Examples==== | ||
+ | {| {{prettytable}} | ||
+ | ! ConfigFile | ||
+ | ! ResultFile | ||
+ | ! Description | ||
+ | ! Dims | ||
+ | ! Window | ||
+ | ! Step | ||
+ | |- | ||
+ | | HCopy_MFCC.cfg | ||
+ | | AIT_20061020_AmarkIII_1.ch4.feaMFCC | ||
+ | | MFCC | ||
+ | | 78 | ||
+ | | 25ms | ||
+ | | 10ms | ||
+ | |- | ||
+ | | HCopy_FB.cfg | ||
+ | | AIT_20061020_AmarkIII_1.ch4.feaFB | ||
+ | | Narrow-band filterbank | ||
+ | | 78 | ||
+ | | 25ms | ||
+ | | 10ms | ||
+ | |- | ||
+ | | HCopy_FB_w6ms_o2ms.cfg | ||
+ | | AIT_20061020_AmarkIII_1.ch4.feaFB_w6ms_o2ms | ||
+ | | Wide-band filterbank | ||
+ | | 78 | ||
+ | | 6ms | ||
+ | | 2ms | ||
+ | |} | ||
+ | |||
+ | ====Notes==== | ||
+ | *Audio: | ||
+ | AIT_20061020_AmarkIII_1.ch4.wav | ||
+ | |||
+ | *92 dims (14 dim decorrelated event-classifier neural network output + 78 dim filter bank parameters): | ||
+ | AIT_20061020_AmarkIII_1.ch4.annhtk | ||
+ | |||
+ | *For each 78 dim filter bank parameters (or MFCCs), the first 26 dims are original, the second and third 26 dims are respectively first- and second-order regression coefficients derived from the first 26 dims. |
Latest revision as of 22:55, 17 August 2010
Contents |
Ruby codes
/workspace/ifp-32-2/hasegawa/data/multimodal/nonspeech/FODAVA/timeliner
Feature files and directory are hardcoded in the .rb file. Update those lines before running.
Dependencies
Ubuntu. Verified with Xiaodan's 9.04, Camille's 64bit 8.10, Mark's 9.04. 9.1 in progress.
Get security updates. Install packages with aptitude (or apt-get) and gem (some back and forth to discover dependencies):
- aptitude update
- aptitude install sox audacity libaudiofile-dev ruby ruby1.8-dev rubygems1.8 mesa-common-dev libglu1-mesa-dev freeglut3-dev imagemagick libmagickcore-dev libmagickwand-dev (may need to change order; may replace *magick* with libmagick++9-dev)
- From http://rubygems.org/ , install gem from the .tgz file. Follow its instructions.
- gem update
- gem install rake mkrf ZenTest RubyInline rspec rice
(consider sticking to ruby 1.8 for the sake of http://rmagick.rubyforge.org/install-faq.html)
- gem install rmagick --no-ri --no-rdoc (fails on 8.10, imagemagick is too old?)
- gem install ruby-opengl (fails on ubuntu 9.1 and 10.04): http://rubyforge.org/tracker/index.php?func=detail&aid=27386&group_id=2103&atid=8185 )
- Install HCopy and HList from HTK-3.4.tar.gz (register first).
Notes
- locate finds a file on local disk
- apt-file find ruby.h lists packages that generate ruby.h
- alsamixer adjusts volume
Feature files in HTK format
/workspace/ifp-32-2/hasegawa/xzhuang2/AED2009/tmp/forCamille
Making
HCopy -C $ConfigFile AIT_20061020_AmarkIII_1.ch4.wav $ResultFile
Viewing
HList -h $ResultFile
Examples
ConfigFile | ResultFile | Description | Dims | Window | Step |
---|---|---|---|---|---|
HCopy_MFCC.cfg | AIT_20061020_AmarkIII_1.ch4.feaMFCC | MFCC | 78 | 25ms | 10ms |
HCopy_FB.cfg | AIT_20061020_AmarkIII_1.ch4.feaFB | Narrow-band filterbank | 78 | 25ms | 10ms |
HCopy_FB_w6ms_o2ms.cfg | AIT_20061020_AmarkIII_1.ch4.feaFB_w6ms_o2ms | Wide-band filterbank | 78 | 6ms | 2ms |
Notes
- Audio:
AIT_20061020_AmarkIII_1.ch4.wav
- 92 dims (14 dim decorrelated event-classifier neural network output + 78 dim filter bank parameters):
AIT_20061020_AmarkIII_1.ch4.annhtk
- For each 78 dim filter bank parameters (or MFCCs), the first 26 dims are original, the second and third 26 dims are respectively first- and second-order regression coefficients derived from the first 26 dims.