Timeshrinking

From SpeechWiki

(Difference between revisions)

Revision as of 20:33, 18 September 2009

Fisher experiments

number of frames dropped on Fisher corpus
<math>\tau</math>	frames dropped
1	0%
.9	~5%
.6	~35%

Timeshrinking results on fisher
train <math>\tau</math>	test <math>\tau</math>	dev 2k utt WER	comments
1	1	51.6%	old baseline
1	1	TR	new baseline
.6	.6	69.3%
.9	.9	56.3%
1	.9	53.9%
.9	.9	57.2	using the non-timeshrinking str file for test
.9	1	54.6

I have to check for bugs. It could be that the threshold is too low or it could be something else too. We should probably rerun baseline too, just to make sure I didn't optimize it unfairly.

Things to try

Test svitchboard with fisher-trained model to see if we still get good results
Train and test on plp+mlp, like svitchboard timeshrinking was done.
Do baseline train+test to see if something changed in going from baseline to timeshrink structure files.

Future Directions

Can be viewed as a two-mode special case of best-first viterbi search. So make a real best-first lattice search. Mark mentioned some attempts in the 80'ies to do this.

@@ Line 1: / Line 1: @@
 ==Fisher experiments==
 {| class="wikitable"
-! train <math>\tau</math> !! test <math>\tau</math> !! dev 2k utt WER !! frames dropped !! comments
+|+  number of frames dropped on Fisher corpus
+! <math>\tau</math> !! frames dropped
 |-
-| 1 || 1 || [{{SERVER}}/speech/akantor/fisher/exp/triphone/test2kUttOnConvGaus.noUnits/config73/LATEST.log 51.6%] || 0% || old baseline
+| 1 || 0%
 |-
-| 1 || 1 || [{{SERVER}}/speech/akantor/fisher/exp/.../LATEST.log TR] || 0% || new baseline
+| .9 || ~5%
 |-
-| .6 || .6 || 69.3% || ~35%
+| .6 || ~35%
+|}
+{| class="wikitable"
+|+ Timeshrinking results on fisher
+! train <math>\tau</math> !! test <math>\tau</math> !! dev 2k utt WER  !! comments
 |-
-| .9 || .9 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/test/unit.tri.timeshrink.point9/LATEST.log 56.3%] || ~5%
+| 1 || 1 || [{{SERVER}}/speech/akantor/fisher/exp/triphone/test2kUttOnConvGaus.noUnits/config73/LATEST.log 51.6%] || old baseline
 |-
-| 1 || .9 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/testOnBaseline/unit.tri.timeshrink.point9/LATEST.log 53.9%]|| ~5%
+| 1 || 1 || [{{SERVER}}/speech/akantor/fisher/exp/.../LATEST.log TR] || new baseline
 |-
-| .9 || .9 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/test/unit.tri.timeshrink.point9.noTsStr/LATEST.log 57.2] || ~5% || using the non-timeshrinking str file for test
+| .6 || .6 || 69.3%
 |-
-| .9 || 1 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/test/unit.tri.timeshrink.1.noTsStr/LATEST.log 54.6] || ~5%
+| .9 || .9 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/test/unit.tri.timeshrink.point9/LATEST.log 56.3%]
+|-
+| 1 || .9 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/testOnBaseline/unit.tri.timeshrink.point9/LATEST.log 53.9%]
+|-
+| .9 || .9 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/test/unit.tri.timeshrink.point9.noTsStr/LATEST.log 57.2] || using the non-timeshrinking str file for test
+|-
+| .9 || 1 || [{{SERVER}}/speech/akantor/fisher/exp/timeshrink/test/unit.tri.timeshrink.1.noTsStr/LATEST.log 54.6]
 |}

Timeshrinking

From SpeechWiki

Revision as of 20:33, 18 September 2009

Fisher experiments

Things to try

Future Directions

Views

Personal tools

Navigation

Toolbox

Search