Follow
Aparna Khare
Title
Cited by
Cited by
Year
Multi-task learning and weighted cross-entropy for DNN-based keyword spotting
S Panchapagesan, M Sun, A Khare, S Matsoukas, A Mandal, ...
1692016
Using system command utterances to generate a speaker profile
V Krishnamoorthy, S Srinivasan, S Matsoukas, A Khare, A Mandal, ...
US Patent 10,490,195, 2019
492019
Self-supervised learning with cross-modal transformers for emotion recognition
A Khare, S Parthasarathy, S Sundaram
2021 IEEE Spoken Language Technology Workshop (SLT), 381-388, 2021
482021
Multiresolution and multimodal speech recognition with transformers
G Paraskevopoulos, S Parthasarathy, A Khare, S Sundaram
arXiv preprint arXiv:2004.14840, 2020
432020
Keyword spotting using multi-task configuration
S Panchapagesan, B Hoffmeister, A Mandal, A Khare, SNP Vitaladevuni, ...
US Patent 10,304,440, 2019
292019
Speech based user recognition
S Matsoukas, A Khare, V Krishnamoorthy, S Somashekar, A Mandal
US Patent 10,522,134, 2019
212019
Multi-modal embeddings using multi-task learning for emotion recognition
A Khare, S Parthasarathy, S Sundaram
Interspeech 2020, 384-388, 2020
192020
Asr-aware end-to-end neural diarization
A Khare, E Han, Y Yang, A Stolcke
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
142022
Method and apparatus for discovering and labeling speakers in a large and growing collection of videos with minimal user effort
S Kajarekar, A Sankar, S Gannu, A Khare
US Patent App. 13/312,800, 2013
92013
Automatic collection of speaker name pronunciations
A Khare, N Agrawal, SS Kajarekar, M Paulik
US Patent 9,240,181, 2016
72016
Audiovisual highlight detection in videos
K Mundnich, A Fenster, A Khare, S Sundaram
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
52021
Voice profile updating
S Srinivasan, A Mandal, K Subramanian, S Matsoukas, A Khare, ...
US Patent 11,004,454, 2021
52021
Voice profile updating
S Srinivasan, A Mandal, K Subramanian, S Matsoukas, A Khare, ...
US Patent 11,200,884, 2021
42021
Speech based user recognition
S Matsoukas, A Khare, V Krishnamoorthy, S Somashekar, A Mandal
US Patent 11,270,685, 2022
32022
Multi-channel acoustic modeling using mixed bitrate Opus compression
A Khare, S Sundaram, M Wu
arXiv preprint arXiv:2002.00122, 2020
32020
Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
S Wager, A Khare, M Wu, K Kumatani, S Sundaram
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and …, 2020
22020
Multi-stage multi-modal pre-training for automatic speech recognition
Y Jain, D Chan, P Dheram, A Khare, O Shonibare, V Ravichandran, ...
arXiv preprint arXiv:2403.19822, 2024
2024
Speech based user recognition
SS Kopuri, J Moore, S Srinivasan, A Khare, A Mandal, S Matsoukas, ...
US Patent 11,893,999, 2024
2024
Turn-taking and backchannel prediction with acoustic and large language model fusion
J Wang, L Chen, A Khare, A Raju, P Dheram, D He, M Wu, A Stolcke, ...
arXiv preprint arXiv:2401.14717, 2024
2024
Two-pass endpoint detection for speech recognition
A Raju, A Khare, D He, I Sklyar, L Chen, S Alptekin, VA Trinh, Z Zhang, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20