Landmark-Based and Prosody-Dependent Speech Recognition

From SpeechWiki

(Difference between revisions)

Jump to: navigation, search

Revision as of 21:40, 15 February 2010

Landmark-Based and Prosody-Dependent Speech Recognition, 2009-2010

May 2010

Tuesday May 11,: 8:00-6:30, 2169 BI; Speech Prosody

Tuesday May 4,: 12:30-2:00, 2169 BI; Jui-Ting Huang, Jennifer Cole; Speech Prosody Practice Talks

April 2010

Tuesday April 27,: 12:30-2:00, 2169 BI; Yoonsook Mo, David Harwath; Speech Prosody Practice Talks

Tuesday April 20,: 12:30-2:00, 2169 BI; Skip meeting because of ASA?

Tuesday April 13,: 12:30-2:00, 2169 BI; open

Tuesday April 6,: 12:30-2:00, 2169 BI; open

March 2010

Tuesday March 30,: 12:30-2:00, 2169 BI; open

Tuesday March 23,: 12:30-2:00, 2169 BI; Spring Break

Tuesday March 16,: 12:30-2:00, 2169 BI; Skip meeting because of ICASSP?

Tuesday March 9,: 12:30-2:00, 2169 BI; open

Tuesday March 2,: 12:30-2:00, 2169 BI; open

February 2010

Tuesday February 23,: 12:30-2:00, 2169 BI; Chi Hu; Gesture-based lexicon for speech recognition

Tuesday February 16, 2010,: 12:30-2:00, 2169 BI; open

Tuesday February 9,: 12:30-2:00, 2169 BI; Xiaodan Zhuang; Audiovisual speech synthesis

Tuesday February 2,: 12:30-2:00, 2169 BI; Dayna; Phonetic correlates of focus scope

January 2010

Tuesday January 26, 2010,: 12:30-2:00, 2169 BI; Open discussion; What are the TADA gestures? Gestural scores; Some sketch of Canonical Gesture Scores in TADA: "before", "about", "brush", "companions",

Tuesday January 19, 2010,: 12:30-2:00, 2169 BI; Planning meeting for spring semester

Fall 2009

Tuesday December 8, 2009,: 12:30-2:00, 2169 BI; Tim Mahrt; Automatic P-score and B-score labeling using HMMs

Tuesday December 1, 2009,: 12:30-2:00, 2169 BI; Yoonsook Mo; Speaker-dependent vs. speaker-independent models of prosody; Boundary detection with vs without pause

Tuesday November 11, 2009,: 12:30-2:00, 2169 BI; Jui-Ting Huang and Po-Sen Huang; Variable-parameter HMM indexed by P-score (prominence score)

Tuesday October 20, 2009,: 12:30-2:00, 2169 BI; Chi Hu; Finite State ASR Dictionary using Gesture Pattern Vectors as Units

Tuesday October 13, 2009,: 12:30-2:00, 2169 BI; Alina Khasanova; Stop Consonant Reduction Phenomena

Tuesday October 6, 2009,: 12:30-2:00, 2169 BI; Jennifer Cole; presents Daniel Hirst's tutorial, Prosody Modeling and Synthesis, from Interspeech

Tuesday September 30, 2009,: 12:30-2:00, 2169 BI; Mark Hasegawa-Johnson; presents Tokuda & Zen tutorial, HMM-Based Speech Synthesis, from Interspeech

Summer 2009

The landmark-based speech recognition group will meet during Summer 2009 on roughly alternate Thursdays, from 10:00-11:30 AM, in Beckman 2369.

August 6, 2009: Sarah will present her work with auditory modeling.

July 23, 2009: Chi will lead a discussion of three papers on finite state transducers to model pronunciation variation:; Timothy J. Hazen, I. Lee Hetherington, Han Shu, and Karen Livescu, 2002. PRONUNCIATION MODELING USING (Priority); A FINITE-STATE TRANSDUCER REPRESENTATION. http://groups.csail.mit.edu/sls//publications/2002/hazen-pmla.pdf; Han Shu and I. Lee Hetherington, 2002. EM TRAINING OF FINITE-STATE TRANSDUCERS; AND ITS APPLICATION TO PRONUNCIATION MODELING. http://groups.csail.mit.edu/sls//publications/2002/shu-icslp.pdf; I. Lee Hetherington, 2001. An Efficient Implementation of Phonological Rules; using Finite-State Transducers. http://groups.csail.mit.edu/sls/publications/2001/ilh-preprint.pdf; Chi will present her work with Xiaodan on word recognition from tract variables using Vikram's data(If time is enough)

July 16, 2009: Alina will discuss her current work on the frequency of vowel co-occurrence patterns in the English CELEX lexicon.; Dave will lead the discussion of Tilsen & Johnson's JASA paper. The 2008 CLS paper covers the same material but is lighter on technical detail and directed to a linguistics reader. The 2009 CogSci paper will not be discussed but is shared here.

Tilsen, S. & Johnson, K. (2008). Low-frequency Fourier analysis of speech rhythm. Journal of the Acoustical Society of America, 124:2, pp. EL34-39.

Tilsen, S. (2008). Relations between speech rhythm and segmental deletion. Paper presented at the 44th annual meeting of the Chicago Linguistic Society.

Tilsen, S. (2009). Multitimescale dynamical interactions between speech rhythm and gesture. Cognitive Science, 33, 839-879.

These articles can be found at http://linguistics.berkeley.edu/~stilsen/CV.html

July 2, 2009: Alina discussed the design of her EMA study on plosive release

June 18, 2009: Discuss plans for summer

Spring 2009

May 7-8, 2009: Multi-University Landmark-Based Speech Recognition Group Meeting; University of Maryland

April 30: Practice talks for Illinois Speech Day, ASA; Yoonsook Mo, Arthur Kantor, Chi Hu, Jui-Ting Huang, Sarah Borys

April 23: A nice intro to kernel methods is Kernel Methods and their potential use in signal processing, F. Perez-Cruz, O. Bousquet, IEEE SIGNAL PROCESSING MAGAZINE MAY 2004 --Arthur
April 16: Discussion of Interspeech Papers

April 9

April 2

March 26: Spring break

March 19: Five-minute presentations of student research; Bob McMurray will be here

March 12: Practice of the Universal Access Open House demo; Heejin Kim, Mark Hasegawa-Johnson, Sarah Borys, Sujeeth Bhardwoy

March 5, 2009: Language Processing in the Natural World, Michael T. Tanenhaus and Sarah Brown-Schmidt

February 26, 2009: Cross-Lingual Recognition and Sound Pattern Retrieval, Jui-Ting Huang and Xiaodan Zhuang

February 12, 2009: Automatic Burst Location, Alina Khasanova

February 19, 2009: Discussion of Kuperman et al. 2008 (JASA v. 124.6) and Margaret Fleck's attempts to replicate results with Buckeye

February 5, 2009: F0 Peak and Formant Values as Cues for Prominence, Yoonsook Mo

January 29, 2009: Landmark-Based Speech Recognition Using SVM/HMM Hybrids, Sarah Borys

January 22, 2009: Planning meeting

Fall 2008

Faculty and students from the University of Maryland, Boston University, the University of Illinois, UCLA, and USC met in Urbana on September 12, 2008 to present new results in landmark-based speech recognition.

Insights Into Pronunciation Modeling and ASR Using Mixed Unit Pronunciation Models: Arthur Kantor

Landmark-Based and Prosody-Dependent Speech Recognition

From SpeechWiki

Revision as of 21:40, 15 February 2010

Contents

Landmark-Based and Prosody-Dependent Speech Recognition, 2009-2010

May 2010

April 2010

March 2010

February 2010

January 2010

Fall 2009

Summer 2009

Spring 2009

Fall 2008

Views

Personal tools

Navigation

Toolbox

Search

@@ Line 1: / Line 1: @@
 ==Landmark-Based and Prosody-Dependent Speech Recognition, 2009-2010==
-===September===
+===May 2010===
-; Tuesday September 30, 2009,
+; Tuesday May 11,
-: 12:30-2:00, 2169 BI
+: 8:00-6:30, 2169 BI
-: Mark Hasegawa-Johnson
+: [http://speechprosody2010.illinois.edu Speech Prosody]
-: presents Tokuda & Zen tutorial, [http://interspeech2009.org/tutorials/t-7.php HMM-Based Speech Synthesis], from Interspeech
-===October===
+; Tuesday May 4,
-; Tuesday October 6, 2009,
 : 12:30-2:00, 2169 BI
-: Jennifer Cole
+: Jui-Ting Huang, Jennifer Cole
-: presents Daniel Hirst's tutorial, [http://interspeech2009.org/tutorials/t-1.php Prosody Modeling and Synthesis], from Interspeech
+: Speech Prosody Practice Talks
-; Tuesday October 13, 2009,
+===April 2010===
-: 12:30-2:00, 2169 BI
-: Alina Khasanova
-: Stop Consonant Reduction Phenomena
-; Tuesday October 20, 2009,
+; Tuesday April 27,
 : 12:30-2:00, 2169 BI
-: Chi Hu
+: Yoonsook Mo, David Harwath
-: [[Media:GMword_FSM_score_OCT.pdf|Finite State ASR Dictionary using Gesture Pattern Vectors as Units]]
+: Speech Prosody Practice Talks
-===November===
+; Tuesday April 20,
-; Tuesday November 11, 2009,
 : 12:30-2:00, 2169 BI
-: Jui-Ting Huang and Po-Sen Huang
+: Skip meeting because of [http://asa.aip.org/baltimore/baltimore.html ASA]?
-: Variable-parameter HMM indexed by P-score (prominence score)
-===December===
+; Tuesday April 13,
-; Tuesday December 1, 2009,
 : 12:30-2:00, 2169 BI
-: Yoonsook Mo
+: open
-: Speaker-dependent vs. speaker-independent models of prosody
-: Boundary detection with vs without pause
-; Tuesday December 8, 2009,
+; Tuesday April 6,
 : 12:30-2:00, 2169 BI
-: Tim Mahrt
+: open
-: Automatic P-score and B-score labeling using HMMs
-===January===
+===March 2010===
-; Tuesday January 19, 2010,
+; Tuesday March 30,
 : 12:30-2:00, 2169 BI
-: Planning meeting for spring semester
+: open
-; Tuesday January 26, 2010,
+; Tuesday March 23,
 : 12:30-2:00, 2169 BI
-: Open discussion
+: Spring Break
-: What are the [http://macserver.haskins.yale.edu/tada_download/index.html TADA] gestures?  Gestural scores
-: Some sketch of Canonical Gesture Scores in TADA:  [[Media:before_gs.jpg|"before"]], [[Media:about_gs.jpg|"about"]], [[Media:brush_gs.jpg|"brush"]], [[Media:companions_gs.jpg|"companions"]],
-===February===
+; Tuesday March 16,
-; Tuesday February 2,
 : 12:30-2:00, 2169 BI
-: Dayna
+: Skip meeting because of [http://www.icassp2010.com ICASSP]?
-: Phonetic correlates of focus scope
-; Tuesday February 9,
+; Tuesday March 9,
 : 12:30-2:00, 2169 BI
-: Xiaodan Zhuang
+: open
-: Audiovisual speech synthesis
-; Tuesday February 16, 2010,
+; Tuesday March 2,
 : 12:30-2:00, 2169 BI
 : open
+===February 2010===
 ; Tuesday February 23,
@@ Line 78: / Line 60: @@
 : Gesture-based lexicon for speech recognition
-===March===
+; Tuesday February 16, 2010,
-; Tuesday March 2,
 : 12:30-2:00, 2169 BI
 : open
-; Tuesday March 9,
+; Tuesday February 9,
 : 12:30-2:00, 2169 BI
-: open
+: Xiaodan Zhuang
+: Audiovisual speech synthesis
-; Tuesday March 16,
+; Tuesday February 2,
 : 12:30-2:00, 2169 BI
-: Skip meeting because of [http://www.icassp2010.com ICASSP]?
+: Dayna
+: Phonetic correlates of focus scope
-; Tuesday March 23,
+===January 2010===
+; Tuesday January 26, 2010,
 : 12:30-2:00, 2169 BI
-: Spring Break
+: Open discussion
+: What are the [http://macserver.haskins.yale.edu/tada_download/index.html TADA] gestures?  Gestural scores
+: Some sketch of Canonical Gesture Scores in TADA:  [[Media:before_gs.jpg|"before"]], [[Media:about_gs.jpg|"about"]], [[Media:brush_gs.jpg|"brush"]], [[Media:companions_gs.jpg|"companions"]],
-; Tuesday March 30,
+; Tuesday January 19, 2010,
 : 12:30-2:00, 2169 BI
-: open
+: Planning meeting for spring semester
-===April===
+===Fall 2009===
-; Tuesday April 6,
+; Tuesday December 8, 2009,
 : 12:30-2:00, 2169 BI
-: open
+: Tim Mahrt
+: Automatic P-score and B-score labeling using HMMs
-; Tuesday April 13,
+; Tuesday December 1, 2009,
 : 12:30-2:00, 2169 BI
-: open
+: Yoonsook Mo
+: Speaker-dependent vs. speaker-independent models of prosody
+: Boundary detection with vs without pause
-; Tuesday April 20,
+; Tuesday November 11, 2009,
 : 12:30-2:00, 2169 BI
-: Skip meeting because of [http://asa.aip.org/baltimore/baltimore.html ASA]?
+: Jui-Ting Huang and Po-Sen Huang
+: Variable-parameter HMM indexed by P-score (prominence score)
-; Tuesday April 27,
+; Tuesday October 20, 2009,
 : 12:30-2:00, 2169 BI
-: Yoonsook Mo, David Harwath
+: Chi Hu
-: Speech Prosody Practice Talks
+: [[Media:GMword_FSM_score_OCT.pdf|Finite State ASR Dictionary using Gesture Pattern Vectors as Units]]
-===May===
+; Tuesday October 13, 2009,
+: 12:30-2:00, 2169 BI
+: Alina Khasanova
+: Stop Consonant Reduction Phenomena
-; Tuesday May 4,
+; Tuesday October 6, 2009,
 : 12:30-2:00, 2169 BI
-: Jui-Ting Huang, Jennifer Cole
+: Jennifer Cole
-: Speech Prosody Practice Talks
+: presents Daniel Hirst's tutorial, [http://interspeech2009.org/tutorials/t-1.php Prosody Modeling and Synthesis], from Interspeech
-; Tuesday May 11,
+; Tuesday September 30, 2009,
-: 8:00-6:30, 2169 BI
+: 12:30-2:00, 2169 BI
-: [http://speechprosody2010.illinois.edu Speech Prosody]
+: Mark Hasegawa-Johnson
+: presents Tokuda & Zen tutorial, [http://interspeech2009.org/tutorials/t-7.php HMM-Based Speech Synthesis], from Interspeech
+===Summer 2009===
+The landmark-based speech recognition group will meet during Summer 2009 on roughly alternate Thursdays, from 10:00-11:30 AM, in Beckman 2369.
+; August 6, 2009
+: Sarah will present her work with auditory modeling.
+; July 23, 2009
+: Chi will lead a discussion of three papers on finite state transducers to model pronunciation variation:
+: Timothy J. Hazen, I. Lee Hetherington, Han Shu, and Karen Livescu, 2002. PRONUNCIATION MODELING USING (Priority)
+: A FINITE-STATE TRANSDUCER REPRESENTATION. http://groups.csail.mit.edu/sls//publications/2002/hazen-pmla.pdf
+: Han Shu and I. Lee Hetherington, 2002. EM TRAINING OF FINITE-STATE TRANSDUCERS
+: AND ITS APPLICATION TO PRONUNCIATION MODELING. http://groups.csail.mit.edu/sls//publications/2002/shu-icslp.pdf
+: I. Lee Hetherington, 2001. An Efficient Implementation of Phonological Rules
+: using Finite-State Transducers. http://groups.csail.mit.edu/sls/publications/2001/ilh-preprint.pdf
+: Chi will present her work with Xiaodan on word recognition from tract variables using Vikram's data(If time is enough)
+; July 16, 2009
+: Alina will discuss her current work on the frequency of vowel co-occurrence patterns in the English CELEX lexicon.
+: Dave will lead the discussion of Tilsen & Johnson's JASA paper. The 2008 CLS paper covers the same material but is lighter on technical detail and directed to a linguistics reader. The 2009 CogSci paper will not be discussed but is shared here.
+: Tilsen, S. & Johnson, K. (2008). Low-frequency Fourier analysis of speech rhythm.  Journal of the Acoustical Society of America, 124:2, pp. EL34-39.
+: Tilsen, S. (2008). Relations between speech rhythm and segmental deletion. Paper presented at the 44th annual meeting of the Chicago Linguistic Society.
+: Tilsen, S. (2009). Multitimescale dynamical interactions between speech rhythm and gesture. Cognitive Science, 33, 839-879.
+: These articles can be found at http://linguistics.berkeley.edu/~stilsen/CV.html
+; July 2, 2009
+: Alina discussed the design of her EMA study on plosive release
+; June 18, 2009
+: Discuss plans for summer
+===Spring 2009===
+; May 7-8, 2009
+: Multi-University Landmark-Based Speech Recognition Group Meeting
+: University of Maryland
+; April 30
+: Practice talks for Illinois Speech Day, ASA
+: Yoonsook Mo, Arthur Kantor, Chi Hu, Jui-Ting Huang, Sarah Borys
+; April 23
+: A nice intro to kernel methods is [http://mickey.ifp.uiuc.edu/speech/akantor/ece513/papers/P%e9rez-Cruz2004Kernel%20methods%20and%20their%20potential%20use%20in%20signal%20processing.pdf Kernel Methods and their potential use in signal processing, F. Perez-Cruz, O. Bousquet, IEEE SIGNAL PROCESSING MAGAZINE MAY 2004] --[[User:Arthur|Arthur]]
+; April 16
+: Discussion of Interspeech Papers
+; April 9
+; April 2
+; March 26
+: Spring break
+; March 19
+: Five-minute presentations of student research; Bob McMurray will be here
+; March 12
+: Practice of the Universal Access Open House demo
+: Heejin Kim, Mark Hasegawa-Johnson, Sarah Borys, Sujeeth Bhardwoy
+; March 5, 2009
+: [http://www.isle.uiuc.edu/papers/Tanenhaus08.pdf Language Processing in the Natural World], Michael T. Tanenhaus and Sarah Brown-Schmidt
+; February 26, 2009
+: [http://www.isle.uiuc.edu/slides/2009/MULTIIR-0226.pdf Cross-Lingual Recognition and Sound Pattern Retrieval], Jui-Ting Huang and Xiaodan Zhuang
+; February 12, 2009
+: [http://www.isle.uiuc.edu/slides/2009/Khasanova2009Feb12.ppt Automatic Burst Location], Alina Khasanova
+; February 19, 2009
+: Discussion of Kuperman et al. 2008 (JASA v. 124.6) and Margaret Fleck's attempts to replicate results with Buckeye
+; February 5, 2009
+: [http://www.isle.uiuc.edu/slides/2009/Mo2009Feb05.pdf F0 Peak and Formant Values as Cues for Prominence], Yoonsook Mo
+; January 29, 2009
+: [http://www.isle.uiuc.edu/slides/2009/Borys2009Jan29.pdf Landmark-Based Speech Recognition Using SVM/HMM Hybrids], Sarah Borys
+; January 22, 2009: Planning meeting
+==Fall 2008==
+Faculty and students from the University of Maryland, Boston University, the University of Illinois, UCLA, and USC met in Urbana on September 12, 2008 to present new results in landmark-based speech recognition.
+; [http://www.isle.uiuc.edu/slides/2008/Kantor2008Sep12.pdf Insights Into Pronunciation Modeling and ASR Using Mixed Unit Pronunciation Models]
+: Arthur Kantor
+[[Category:Events]]