Projects

From SpeechWiki

(Difference between revisions)

Jump to: navigation, search

Latest revision as of 23:06, 11 March 2013

Here are some projects that SST People are working on. For another view, see our Publications.

@@ Line 1: / Line 1: @@
 Here are some projects that [[SST People]] are working on.  For another view, see our [http://www.isle.uiuc.edu/pubs Publications].
-===Phonetics, Phonology, Semantics===
+===SST Group Meetings===
+* [[SST Group Meetings]]
+===Phonetics, Phonology, Semantics===
 ; Prosody and Phonology in Automatic Speech Recognition (Landmark-Based Speech Recognition)
@@ Line 20: / Line 24: @@
 ; GroupScope --- Dynamics of Medium-Sized Groups
-: [[groupscope09| Group Meeting Schedules and Slides]]
+: [[GroupScope]]
 ===Language Acquisition, Language Contact, Variability, and Disability===
@@ Line 56: / Line 60: @@
 : [http://www.isle.uiuc.edu/AVICAR/ AVICAR Database]
+; Smaragdis collaboration
+: [[Image:Smaragdis-130218.jpg]]
+: [[Image:Smaragdis-130311.jpg]]
+Pseudocode spec for the sound input class
+(and also output later, but not read-and-write):
+class input_t{
+// Definition of stream characteristics
+class specs_t{
+size_t channels;
+double sample_rate;
+enum sample_format;
+};
+//
+// Constructors
+//
+input_t( ??? stream, bool in_or_out, size_t ch, double sr, enum frm)
+{
+switch( stream){
+case "file"
+use ffmpeg
+case "socket"
+use homebrew code?
+case "url"
+use VLC?
+case "adc"
+Use portaudio
+case "dac"
+Use portaudio
+}
+}
+input_t( ??? stream, input_t example); // copy stream attributes
+input_t( ??? stream, input_t::specs_t example); // copy stream attributes
+Assignment/copy operators
+//
+// Destructor
+//
+~input_t() // bookkeeping with closing file/net/etc.
+//
+// Utilities
+//
+double sample_rate();
+size_t channels();
+enum sample_format();
+bool eof();
+bool();
+//
+// Seeking
+//
+seek( size_t s); // move to sample frame s
+seek( double t); // move to second t
+//
+// Reading
+// output should be channels by sample frames
+array<T> &read( size_t n, size_t offset, int channel_mask); // sample frames
+array<T> &read( double n, double offset, int channel_mask); // seconds
+//
+// Writing
+//
+write( array<T> &x, size_t offset, int channel_mask); // sample frames
+write( array<T> &x, double offset, int channel_mask); // seconds
+write_add( array<T> &x, size_t offset, int channel_mask); // sample frames
+write_add( array<T> &x, double offset, int channel_mask); // seconds
+};
+We are going for a blocking interface instead of cumbersome callbacks for now.  The stream parameters when reading can be used to perform
++on the fly resampling and channel remapping.  I'm attaching the board doodling in case I missed something.
+We are currently working on the getting code to work for the simple case:
+main()
+{
+input_t in( ...);
+while( in){
+x = in.read( ...);
+y = feature( x);
+plot( y);
+}
+}
+I'm working on the feature object, Camille is working on the input object.
 ==See also==
-[http://www.isle.uiuc.edu/pubs SST Publications] | [http://www.isle.uiuc.edu/sst.html SST Group]
+* [http://www.isle.illinois.edu/sst/pubs/ SST publications]
+* [http://www.isle.illinois.edu/sst/ SST group web page]
+* [[Special:Upload]]

Projects

From SpeechWiki

Latest revision as of 23:06, 11 March 2013

Contents

SST Group Meetings

Phonetics, Phonology, Semantics

Group dynamics and Discourse

Language Acquisition, Language Contact, Variability, and Disability

Multimodal Fusion, Speech and Non-Speech

See also

Views

Personal tools

Navigation

Toolbox

Search