• Best Paper Award @ ACM Multimedia 2015 (Brisbane) October 31st, 2015 Xavier Alameda-Pineda, Yan Yan, Elisa Ricci, Oswald Lanz and Nicu Sebe I happily announce that we received the Best Paper Award for Analyzing free-standing conversational groups: a multimodal approach at ACM International Conference in Multimedia 2015 (Brisbane, Australia) . Abstract During natural social gatherings, humans tend to organize themselves in the so-called free-standing conversational groups. In ...
  • Best Student Paper Award @ IEEE WASPAA 2015 (New P… October 25th, 2015 I am glad to announce that we got the Best Student Paper Award for A Variational EM Algorithm for the Separation of Moving Sound Sources at IEEE Workshop on Applications of Signal Processing to Audio and Acoustics 2015 (New Paltz, USA). See the topic’s page and . References
  • Geomtric Sound Source Localization @ EUSIPCO, WASP… October 27th, 2013 Geomtric Sound Source Localization @ EUSIPCO, WASPAA & TASLP Xavier Alameda-Pineda and Radu Horaud Keywords: multivariate optimization, hyperbolic constraints, log-barrier interior point, branch & bound. We address the problem of sound-source localization from time-delay estimates using arbitrarily-shaped non-coplanar microphone arrays. A novel geometric formulation is proposed, together with a thorough algebraic analysis and a global optimization solver. The proposed model is thoroughly described and evaluated. ...
  • AV Robot Command recognition @ ICMI & ICASSP May 17th, 2013 Xavier Alameda-Pineda, Jordi Sanchez and Radu Horaud Keywords: audio-visual fusion, discriminative multimodal methods, robot command recognition. We investigated the problem of choosing a classifier for audio-visual command recognition. Because such commands are culture- and user-dependant, methods need to learn new commands from a few examples. We benchmark three state-of-the-art discriminative classifiers based on bag of words ...
  • Audio-visual speaker detection and localization April 27th, 2013 Joint work with a lot of people Keywords: EM algorithm, Model selection, Weighted-data, Audio-visual fusion. Natural human–robot interaction (HRI) in complex and unpredictable environments is important with many potential applications. While vision-based HRI has been thoroughly investigated, robot hearing and audio-based HRI are emerging research topics in robotics. In typical real-world scenarios, humans are ...
  • MQ Entropy Coder (in Matlab) January 28th, 2013 MQ Entropy Coder (in Matlab)Matlab implementation of the MQ-coder, the entropy coder used in the image compression standard JPEG2000.
  • The Ravel Dataset December 20th, 2012 The Ravel Dataset Xavier Alameda-Pineda, Jordi Sanchez-Riera, Johannes Wienke, Vojtech Franc, Jan Cech, Kaustubh Kulkarni, Antoine Deleforge, and Radu Horaud Keywords: audio-visual data set, human-robot interaction, natural indoor scene. We introduce Ravel (Robots with Audio-visual Abilities), a publicly available data set which covers examples of Human Robot Interaction (HRI) scenarios. These scenarios are recorded using the audio-visual robot head POPEYE, ...