- The SALSA dataset December 12th, 2015
Xavier Alameda-Pineda, Jacopo Staiano, Ramanathan Subramanian, Ligia Batrinca, Elisa Ricci, Bruno Lepri, Oswald Lanz and Nicu Sebe
Keywords: Multimodal group behavior analysis, Free-standing conversational groups, multimodal social data sets, Tracking, Head and body pose estimation, Personality traits.
Synergetic sociAL Scene Analysis (SALSA) contains uninterrupted recordings of an indoor social event involving 18 subjects over 60 minutes. It serves as a rich ...
- Variational EM and non-convex optimization for mul… December 4th, 2015
Next december 22nd I will be giving a seminar at the Robot-Action-Perception team of LAAS/CNRS, whose abstract reads below.
In this talk I describe the mathematical foundations we used in the recent past to address four different multi–sensor scene analysis tasks, namely: audio-visual speaker detection and localization, separation of moving sound sources, geometric sound source localization ...
- Best Paper Award @ ACM Multimedia 2015 (Brisbane) October 31st, 2015
Xavier Alameda-Pineda, Yan Yan, Elisa Ricci, Oswald Lanz and Nicu Sebe
I happily announce that we received the Best Paper Award for
Analyzing free-standing conversational groups: a multimodal approach
at ACM International Conference in Multimedia 2015 (Brisbane, Australia) .
Abstract During natural social gatherings, humans tend to organize themselves in the so-called free-standing conversational groups. In ...
- Best Student Paper Award @ IEEE WASPAA 2015 (New P… October 25th, 2015
I am glad to announce that we got the Best Student Paper Award for
A Variational EM Algorithm for the Separation of Moving Sound Sources
at IEEE Workshop on Applications of Signal Processing to Audio and Acoustics 2015 (New Paltz, USA). See the topic’s page and .
- Geomtric Sound Source Localization @ EUSIPCO, WASP… October 27th, 2013
Xavier Alameda-Pineda and Radu Horaud
Keywords: multivariate optimization, hyperbolic constraints, log-barrier interior point, branch & bound.
We address the problem of sound-source localization from time-delay estimates using arbitrarily-shaped non-coplanar microphone arrays. A novel geometric formulation is proposed, together with a thorough algebraic analysis and a global optimization solver. The proposed model is thoroughly described and evaluated. ...
- AV Robot Command recognition @ ICMI & ICASSP May 17th, 2013
Xavier Alameda-Pineda, Jordi Sanchez and Radu Horaud
Keywords: audio-visual fusion, discriminative multimodal methods, robot command recognition.
We investigated the problem of choosing a classifier for audio-visual command recognition. Because such commands are culture- and user-dependant, methods need to learn new commands from a few examples. We benchmark three state-of-the-art discriminative classifiers based on bag of words ...
- Audio-visual speaker detection and localization April 27th, 2013
Joint work with a lot of people
Keywords: EM algorithm, Model selection, Weighted-data, Audio-visual fusion.
Natural human–robot interaction (HRI) in complex and unpredictable environments is important with many potential applications. While vision-based HRI has been thoroughly investigated, robot hearing and audio-based HRI are emerging research topics in robotics. In typical real-world scenarios, humans are ...
- MQ Entropy Coder (in Matlab) January 28th, 2013
Matlab implementation of the MQ-coder, the entropy coder used in the image compression standard JPEG2000.
- The Ravel Dataset December 20th, 2012
Xavier Alameda-Pineda, Jordi Sanchez-Riera, Johannes Wienke, Vojtech Franc, Jan Cech, Kaustubh Kulkarni, Antoine Deleforge, and Radu Horaud
Keywords: audio-visual data set, human-robot interaction, natural indoor scene.
We introduce Ravel (Robots with Audio-visual Abilities), a publicly available data set which covers examples of Human Robot Interaction (HRI) scenarios. These scenarios are recorded using the audio-visual robot head POPEYE, ...