[Adapt] Abstract of Tomorrow's Talk

Tue Jun 11 23:07:55 CST 2013

Dear all, 

I'm going to give a talk about my work on Activity Inference from Audio Signal tomorrow. Below is the abstract. 

ABSTRACT 
Audio activity inference/context recognition means to classification among daily environments using ambient audio clips. In previous works, acoustic events, as basic units, in training clips are manually labeled. This thesis presents a novel method to recognize contexts of audio clips without manual annotation on the training dataset. We first build an audible concept vocabulary, as a definition to audio events that we are concerned, with the help of online sound taxonomies, WordNet and Probase. Short audio clips for these events are then obtained through sound search engines (SSEs), and labeled with their query words automatically. In the training stage, each context is modeled with a set of events that frequently co-occur with it in descriptive corpus. In the testing stage, Mel-frequency cepstrum coefficients (MFCC) of unknown clips are extracted, then individual sound events are detected using a network of Hidden Markov Model (HMM) classifiers with Gaussian mixture models (GMMs). Context recognition is performed by computing the exact similarity between this event set and that of each predefined context. An average classification accuracy of 56% is obtained in the recognition among 10 everyday contexts, while it reaches 72.5% on contexts that have more than 18 important sound events collected. In terms of event detection, the system is capable of recognizing almost half of the events, while the temporal positioning needs further alignment. 
Hope you can enjoy it:) 
Regards, 
Menglu 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://cs.sjtu.edu.cn/pipermail/adapt/attachments/20130611/c50ad044/attachment.html>