Zobrazeno 1 - 10
of 1 578
pro vyhledávání: '"HUSSAIN, Amir"'
Autor:
Jain, Arnav, Sanjotra, Jasmer Singh, Choudhary, Harshvardhan, Agrawal, Krish, Shah, Rupal, Jha, Rohan, Sajid, M., Hussain, Amir, Tanveer, M.
Publikováno v:
INTERSPEECH 2024
In this paper, we propose long short term memory speech enhancement network (LSTMSE-Net), an audio-visual speech enhancement (AVSE) method. This innovative method leverages the complementary nature of visual and audio information to boost the quality
Externí odkaz:
http://arxiv.org/abs/2409.02266
Foundational Large Language Models (LLMs) have changed the way we perceive technology. They have been shown to excel in tasks ranging from poem writing and coding to essay generation and puzzle solving. With the incorporation of image generation capa
Externí odkaz:
http://arxiv.org/abs/2409.00105
Autor:
Cresswell, Kathrin, Tahir, Ahsen, Sheikh, Zakariya, Hussain, Zain, Domínguez Hernández, Andrés, Harrison, Ewen, Williams, Robin, Sheikh, Aziz, Hussain, Amir
Publikováno v:
Journal of Medical Internet Research, Vol 23, Iss 5, p e26618 (2021)
BackgroundThe emergence of SARS-CoV-2 in late 2019 and its subsequent spread worldwide continues to be a global health crisis. Many governments consider contact tracing of citizens through apps installed on mobile phones as a key mechanism to contain
Externí odkaz:
https://doaj.org/article/a1a274e806f643779e4ba385c54790ed
Autor:
Hussain, Amir, Tahir, Ahsen, Hussain, Zain, Sheikh, Zakariya, Gogate, Mandar, Dashtipour, Kia, Ali, Azhar, Sheikh, Aziz
Publikováno v:
Journal of Medical Internet Research, Vol 23, Iss 4, p e26627 (2021)
BackgroundGlobal efforts toward the development and deployment of a vaccine for COVID-19 are rapidly advancing. To achieve herd immunity, widespread administration of vaccines is required, which necessitates significant cooperation from the general p
Externí odkaz:
https://doaj.org/article/e876c27ce10f4b48acd56d2a678281b9
In real-world environments, background noise significantly degrades the intelligibility and clarity of human speech. Audio-visual speech enhancement (AVSE) attempts to restore speech quality, but existing methods often fall short, particularly in dyn
Externí odkaz:
http://arxiv.org/abs/2402.16394
Autor:
Kirton-Wingate, Jasper, Ahmed, Shafique, Hussain, Adeel, Gogate, Mandar, Dashtipour, Kia, Hou, Jen-Cheng, Hussain, Tassadaq, Tsao, Yu, Hussain, Amir
Since the advent of Deep Learning (DL), Speech Enhancement (SE) models have performed well under a variety of noise conditions. However, such systems may still introduce sonic artefacts, sound unnatural, and restrict the ability for a user to hear am
Externí odkaz:
http://arxiv.org/abs/2402.16757
Autor:
Zhao, Weiguang, Yang, Guanyu, Zhang, Rui, Jiang, Chenru, Yang, Chaolong, Yan, Yuyao, Hussain, Amir, Huang, Kaizhu
With the explosive 3D data growth, the urgency of utilizing zero-shot learning to facilitate data labeling becomes evident. Recently, methods transferring language or language-image pre-training models like Contrastive Language-Image Pre-training (CL
Externí odkaz:
http://arxiv.org/abs/2312.07039
Autor:
Ahmed, Shafique, Chen, Chia-Wei, Ren, Wenze, Li, Chin-Jou, Chu, Ernie, Chen, Jun-Cheng, Hussain, Amir, Wang, Hsin-Min, Tsao, Yu, Hou, Jen-Cheng
Recent studies have increasingly acknowledged the advantages of incorporating visual data into speech enhancement (SE) systems. In this paper, we introduce a novel audio-visual SE approach, termed DCUC-Net (deep complex U-Net with conformer network).
Externí odkaz:
http://arxiv.org/abs/2309.11059
Autor:
Chamola, Vinay, Bansal, Gaurang, Das, Tridib Kumar, Hassija, Vikas, Reddy, Naga Siva Sai, Wang, Jiacheng, Zeadally, Sherali, Hussain, Amir, Yu, F. Richard, Guizani, Mohsen, Niyato, Dusit
Imagine stepping into a virtual world that's as rich, dynamic, and interactive as our physical one. This is the promise of the Metaverse, and it's being brought to life by the transformative power of Generative Artificial Intelligence (AI). This pape
Externí odkaz:
http://arxiv.org/abs/2308.06272
Individuals with hearing impairments face challenges in their ability to comprehend speech, particularly in noisy environments. The aim of this study is to explore the effectiveness of audio-visual speech enhancement (AVSE) in enhancing the intelligi
Externí odkaz:
http://arxiv.org/abs/2307.07748