Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Buddi, Sai Srujana"'
Autor:
Kumar, Satyam, Buddi, Sai Srujana, Sarawgi, Utkarsh Oggy, Garg, Vineet, Ranjan, Shivesh, Ognjen, Rudovic, Abdelaziz, Ahmed Hussen, Adya, Saurabh
Voice activity detection (VAD) is a critical component in various applications such as speech recognition, speech enhancement, and hands-free communication systems. With the increasing demand for personalized and context-aware technologies, the need
Externí odkaz:
http://arxiv.org/abs/2406.09443
Autor:
Sarawgi, Utkarsh Oggy, Berkowitz, John, Garg, Vineet, Kundu, Arnav, Cho, Minsik, Buddi, Sai Srujana, Adya, Saurabh, Tewfik, Ahmed
Publikováno v:
In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6110-6114). IEEE
Streaming neural network models for fast frame-wise responses to various speech and sensory signals are widely adopted on resource-constrained platforms. Hence, increasing the learning capacity of such streaming models (i.e., by adding more parameter
Externí odkaz:
http://arxiv.org/abs/2310.05886
Autor:
Buddi, Sai Srujana, Sarawgi, Utkarsh Oggy, Heeramun, Tashweena, Sawnhey, Karan, Yanosik, Ed, Rathinam, Saravana, Adya, Saurabh
The adoption of multimodal interactions by Voice Assistants (VAs) is growing rapidly to enhance human-computer interactions. Smartwatches have now incorporated trigger-less methods of invoking VAs, such as Raise To Speak (RTS), where the user raises
Externí odkaz:
http://arxiv.org/abs/2305.12063
Autor:
Lee, Isabelle G., Zu, Vera, Buddi, Sai Srujana, Liang, Dennis, Kulkarni, Purva, Fitzgerald, Jack G. M.
Virtual Assistants can be quite literal at times. If the user says "tell Bob I love him," most virtual assistants will extract the message "I love him" and send it to the user's contact named Bob, rather than properly converting the message to "I lov
Externí odkaz:
http://arxiv.org/abs/2010.02600