2025
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder Proceedings Article
In: IEEE International Conference on Audio, Speech, and Signal Processing, 2025.
MEGA: Masked Generative Autoencoder for Human Mesh Recovery Proceedings Article
In: IEEE International Conference on Computer Vision and Pattern Recognition, 2025.
Diffusion-based Unsupervised Audio-visual Speech Enhancement Proceedings Article
In: IEEE International Conference on Acoustics, Speech and Audio Processing, 2025.
2024
Autoregressive GAN for Semantic Unconditional Head Motion Generation Journal Article
In: ACM Transactions on Multimedia Computing, Communications, and Applications, 2024.
Socially Pertinent Robots in Gerontological Healthcare Unpublished
2024, (Under Review at International Journal on Social Robotics).
Robust Audio-Visual Contrastive Learning for Proposal-based Self-supervised Sound Source Localization in Videos Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 46, pp. 4896–4907, 2024.
Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking Proceedings Article
In: European Conference on Computer Vision, 2024.
A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning Journal Article
In: Neural Networks, 2024.
Unsupervised performance analysis of 3D face alignment with a statistically robust confidence test Journal Article
In: Neurocomputing, vol. 564, 2024, (urlhttps://team.inria.fr/robotlearn/upa3dfa/).
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation Unpublished
2024.
Navigating the Practical Pitfalls of Reinforcement Learning for Social Robot Navigation Proceedings Article
In: Robotics: Science and Systems (RSS) Workshop on Unsolved Problems in Social Robot Navigation, 2024.
VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space Proceedings Article
In: European Conference on Comptuer Vision, 2024.
A weighted-variance variational autoencoder model for speech enhancement Proceedings Article
In: IEEE International Conference on Acoustics Speech and Signal Processing, 2024.
2023
Variational Meta Reinforcement Learning for Social Robotics Journal Article
In: Applied Intelligence, vol. 53, pp. 27249-27268, 2023.
Learning and controlling the source-filter representation of speech with a variational autoencoder Journal Article
In: Speech Communication, vol. 148, pp. 53-65, 2023, (urlhttps://samsad35.github.io/site-sfvae/).
Successor Feature Representations Journal Article
In: Transactions on Machine Learning Research, 2023.
Univariate Radial Basis Function Layers: Brain-inspired Deep Neural Layers for Low-Dimensional Inputs Miscellaneous
2023.
On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers Proceedings Article
In: International Conference on Computer Vision Workshops, 2023.
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation Journal Article
In: Transactions on Machine Learning Research, 2023.
Unsupervised speech enhancement with deep dynamical generative speech and noise models Proceedings Article
In: Interspeech, pp. 5102-5106, 2023.
Motion-DVAE: Unsupervised learning for fast human motion denoising Proceedings Article
In: ACM SIGGRAPH Conference on Motion, Interaction and Games, 2023.
Semi-supervised learning made simple with self-supervised clustering Proceedings Article
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3187-3197, 2023.
Speech Modeling with a Hierarchical Transformer Dynamical VAE Proceedings Article
In: IEEE International Conference on Audio, Speech and Signal Processing, 2023.
Expression-preserving face frontalization improves visually assisted speech processing Journal Article
In: International Journal of Computer Vision, vol. 131, iss. 5, pp. 1122-1140, 2023.
Back to MLP: A Simple Baseline for Human Motion Prediction Proceedings Article
In: IEEE Winter Conference on Applications of Computer Vision, pp. 4809-4819, 2023.
2022
SocialInteractionGAN: Multi-person Interaction Sequence Generation Journal Article
In: IEEE/ACM Transactions on Affective Computing, 2022.
Uncertainty-aware Contrastive Distillation for Incremental Semantic Segmentation Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
Continual Attentive Fusion for Incremental Learning in Semantic Segmentation Journal Article
In: IEEE Transactions on Multimedia, vol. 25, pp. 3841-3854, 2022.
TransCenter: Transformers with Dense Queries for Multiple-Object Tracking Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
M4MM'22: 1st International Workshop on Methodologies for Multimedia Proceedings Article
In: ACM International Conference on Multimedia, Lisbon, Portugal, 2022.
Active Contrastive Set Mining for Robust Audio-Visual Instance Discrimination Proceedings Article
In: International Joint Conference on Artificial Intelligence, 2022.
A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos Proceedings Article
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE Unpublished
2022.
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders Journal Article
In: IEEE/ACM Transactions on Audio, Signal and Language Processing, 2022.
Self-supervised models are continual learners Proceedings Article
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9621–9630, 2022.
Dynamical Variational Autoencoders: A Comprehensive Review Journal Article
In: Foundations and Trends in Machine Learning, vol. 1-2, no. 15, 2022.
Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole Proceedings Article
In: XXXIVe Journées d'Études sur la Parole, 2022.
Multi-Person Extreme Motion Prediction with Cross-Interaction Attention Proceedings Article
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
The impact of removing head movements on audio-visual speech enhancement Proceedings Article
In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 7302–7306, 2022.
2021
Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement Proceedings Article
In: IEEE International Conference on Audio, Speech and Signal Processing, 2021.
Variational Structured Attention Networks for Deep Visual Representation Learning Unpublished
2021.
Deep Variational Generative Models for Audio-visual Speech Separation Proceedings Article
In: IEEE Workshop on Machine Learning for Signal Processing, Queensland, Australia, 2021.
PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation Proceedings Article
In: IEEE Winter Conference on Applications of Computer Vision, 2021.
Successor Feature Neural Episodic Control Proceedings Article
In: Fifth Workshop on Meta-Learning at the Conference on Neural Information Processing Systems, 2021.
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling Proceedings Article
In: ISCA Interspeech, 2021.
Variational Inference and Learning of Piecewise-linear Dynamical Systems Journal Article
In: IEEE Transactions on Neural Networks and Learning Systems, 2021.
2020
CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentification Proceedings Article
In: IEEE International Conference on Pattern Recognition, 2020.
Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 5, pp. 1761-1776, 2020.
Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders Proceedings Article
In: IEEE International Conference on Audio, Speech and Signal Processing, Barcelona, Spain, 2020.
Probabilistic Graph Attention Network with Conditional Kernels for Pixel-Wise Prediction Journal Article
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020.