Uniphore • Staff AI Scientist[2021 - 2025] Technical and team leadership, focusing on end-to-end speech recognition, personalization, and related areas for conversational AI.
Google • Research Intern[2020]
Worked on a research project with the audio processing team.
Idiap Research Institute • Research Assistant[2017 - 2021]
Developed a PhD thesis on automatic speech assessment and recognition.
Interactive Intelligence • Senior Speech Engineer[2015 - 2017]
Built production-grade ASR models in multiple languages, enhancing acoustic modelling to improve efficacy, while reducing systematic ASR errors.
Samsung R&D Institute India • Lead Engineer[2013 - 2015]
Developed robust feature extraction techniques and implemented data selection methods, and improved acoustic models across multiple languages.
Shashi Kumar, Srikanth Madikeri, Nigmatulina Iuliia, Esaú Villatoro-Tello, Petr Motlicek, Karthik Pandia D S, S. Pavankumar Dubagunta and Aravind Ganapathiraju, “Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers,” in Proceedings of ICASSP. [DOI]
2023
Lokesh Bansal, S. Pavankumar Dubagunta, Malolan Chetlur, Pushpak Jagtap, and Aravind Ganapathiraju, “On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition,” in Proceedings of Interspeech. [PDF]
Tilak Purohit, Sarthak Yadav, Bogdan Vlasenko, S. Pavankumar Dubagunta, and Mathew Magimai Doss, “Towards Learning Emotion Information from Short Segments of Speech,” in Proceedings of ICASSP. [PDF]
2022
S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos, and Mathew Magimai Doss, “Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings,” in Companion proceedings of the International Conference on Multimodal Interaction (ICMI). [DOI]
S. Pavankumar Dubagunta, Rob J. J. H. van Son, and Mathew Magimai Doss, “Adjustable deterministic pseudonymization of speech,” in Computer Speech and Language. [DOI]
2021
S. Pavankumar Dubagunta, “Novel Methods for Incorporating Prior Knowledge for Automatic Speech Assessment,” PhD thesis, École polytechnique fédérale de Lausanne (EPFL). [PDF] [Talk]