Publications

2024

Shashi Kumar, Srikanth Madikeri, Nigmatulina Iuliia, Esaú Villatoro-Tello, Petr Motlicek, Karthik Pandia D S, S. Pavankumar Dubagunta and Aravind Ganapathiraju, “Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers,” in Proceedings of ICASSP. [DOI]

2023

Lokesh Bansal, S. Pavankumar Dubagunta, Malolan Chetlur, Pushpak Jagtap, and Aravind Ganapathiraju, “On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition,” in Proceedings of Interspeech. [PDF]

Tilak Purohit, Sarthak Yadav, Bogdan Vlasenko, S. Pavankumar Dubagunta, and Mathew Magimai Doss, “Towards Learning Emotion Information from Short Segments of Speech,” in Proceedings of ICASSP. [PDF]

2022

S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos, and Mathew Magimai Doss, “Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings,” in Companion proceedings of the International Conference on Multimodal Interaction (ICMI). [DOI]

S. Pavankumar Dubagunta, Rob J. J. H. van Son, and Mathew Magimai Doss, “Adjustable deterministic pseudonymization of speech,” in Computer Speech and Language. [DOI]

2021

S. Pavankumar Dubagunta, “Novel Methods for Incorporating Prior Knowledge for Automatic Speech Assessment,” PhD thesis, École polytechnique fédérale de Lausanne (EPFL). [PDF] [Talk]

Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek, and Mathew Magimai Doss, “Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition,” Proceedings of Interspeech. [PDF]

2020

S. Pavankumar Dubagunta, Rob J. J. H. van Son, and Mathew Magimai Doss, “Adjustable deterministic pseudonymization of speech: Idiap-NKI’s submission to VoicePrivacy 2020 challenge,” peer-reviewed at the 2020 VoicePrivacy challenge. [PDF] [Talk]

Julian Fritsch, S. Pavankumar Dubagunta, and Mathew Magimai Doss, “Estimating the degree of sleepiness by integrating articulatory feature knowledge in raw waveform based CNNs,” in Proceedings of ICASSP. [PDF]

Alejandro Gomez-Alanis, Jose A Gonzalez-Lopez, S. Pavankumar Dubagunta, Antonio M Peinado, and Mathew Magimai Doss, “On joint optimization of automatic speaker verification and antispoofing in the embedding space,” in IEEE Transactions on Information Forensics and Security. [PDF]

2019

S. Pavankumar Dubagunta, and Mathew Magimai Doss, “Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN speech recognition,” in Proceedings of ICASSP. [PDF]

S. Pavankumar Dubagunta, Selen Hande Kabil, and Mathew Magimai Doss, “Improving children speech recognition through feature learning from raw speech signal,” in Proceedings of ICASSP. [PDF]

S. Pavankumar Dubagunta, Bogdan Vlasenko, and Mathew Magimai Doss, “Learning voice source related information for depression detection,” in Proceedings of ICASSP. [PDF]

S. Pavankumar Dubagunta, and Mathew Magimai Doss, “Using speech production knowledge for raw waveform modelling based Styrian dialect identification,” in Proceedings of Interspeech. [PDF]

Vinayak Abrol, S. Pavankumar Dubagunta, and Mathew Magimai Doss, “Understanding raw waveform based CNN through low-rank spectro-temporal decoupling,” technical report peer-reviewed and presented at the Swiss Machine Learning Day. [PDF]

2018

Bogdan Vlasenko, Jilt Sebastian, S. Pavankumar Dubagunta, and Mathew Magimai Doss, “Implementing fusion techniques for the classification of paralinguistic information,” in Proceedings of Interspeech. [PDF]

Jilt Sebastian, Manoj Kumar, S. Pavankumar Dubagunta, Mathew Magimai- Doss, Hema A Murthy, and Shrikanth Narayanan, “Denoising and raw-waveform networks for weakly-supervised gender identification on noisy speech,” in Proceedings of Interspeech. [PDF]

2016

Tejas Godambe, Naresh Kumar, S. Pavankumar Dubagunta, Veera Raghavendra, and Aravind Ganapathiraju, “ININ Submission to Zero Cost ASR Task at MediaEval 2016,” in Proceedings of MediaEval. [PDF]

2013

S. Pavankumar Dubagunta, “Feature normalisation for robust speech recognition,” Masters thesis, Indian Institute of Technology Madras. [PDF]

S. Pavankumar Dubagunta, N. Vishnu Prasad, Vikas Joshi, Umesh Srinivasan, “Modified SPLICE and its extension to non-stereo data for noise robust speech recognition,” in Proceedings of ASRU. [PDF] [DOI]

S. Pavankumar Dubagunta, Raghavendra R. Bilgi, Umesh Srinivasan, “Non-negative subspace projection during conventional MFCC feature extraction for noise robust speech recognition,” in Proceedings of NCC. [DOI]