Výsledky vyhledávání - "HUSSAIN, Amir"

Report

LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement

Autor: Jain, Arnav, Sanjotra, Jasmer Singh, Choudhary, Harshvardhan, Agrawal, Krish, Shah, Rupal, Jha, Rohan, Sajid, M., Hussain, Amir, Tanveer, M.

Publikováno v: INTERSPEECH 2024

In this paper, we propose long short term memory speech enhancement network (LSTMSE-Net), an audio-visual speech enhancement (AVSE) method. This innovative method leverages the complementary nature of visual and audio information to boost the quality

Externí odkaz: http://arxiv.org/abs/2409.02266

Zobrazit plný text záznamu

Report

Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation

Autor: Nadeem, Mohammad, Sohail, Shahab Saquib, Cambria, Erik, Schuller, Björn W., Hussain, Amir

Foundational Large Language Models (LLMs) have changed the way we perceive technology. They have been shown to excel in tasks ranging from poem writing and coding to essay generation and puzzle solving. With the incorporation of image generation capa

Externí odkaz: http://arxiv.org/abs/2409.00105

Zobrazit plný text záznamu

Akademický článek

Understanding Public Perceptions of COVID-19 Contact Tracing Apps: Artificial Intelligence–Enabled Social Media Analysis

Autor: Cresswell, Kathrin, Tahir, Ahsen, Sheikh, Zakariya, Hussain, Zain, Domínguez Hernández, Andrés, Harrison, Ewen, Williams, Robin, Sheikh, Aziz, Hussain, Amir

Publikováno v: Journal of Medical Internet Research, Vol 23, Iss 5, p e26618 (2021)

BackgroundThe emergence of SARS-CoV-2 in late 2019 and its subsequent spread worldwide continues to be a global health crisis. Many governments consider contact tracing of citizens through apps installed on mobile phones as a key mechanism to contain

Externí odkaz: https://doaj.org/article/a1a274e806f643779e4ba385c54790ed

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Artificial Intelligence–Enabled Analysis of Public Attitudes on Facebook and Twitter Toward COVID-19 Vaccines in the United Kingdom and the United States: Observational Study

Autor: Hussain, Amir, Tahir, Ahsen, Hussain, Zain, Sheikh, Zakariya, Gogate, Mandar, Dashtipour, Kia, Ali, Azhar, Sheikh, Aziz

Publikováno v: Journal of Medical Internet Research, Vol 23, Iss 4, p e26627 (2021)

BackgroundGlobal efforts toward the development and deployment of a vaccine for COVID-19 are rapidly advancing. To achieve herd immunity, widespread administration of vaccines is required, which necessitates significant cooperation from the general p

Externí odkaz: https://doaj.org/article/e876c27ce10f4b48acd56d2a678281b9

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues

Autor: Hussain, Tassadaq, Dashtipour, Kia, Tsao, Yu, Hussain, Amir

In real-world environments, background noise significantly degrades the intelligibility and clarity of human speech. Audio-visual speech enhancement (AVSE) attempts to restore speech quality, but existing methods often fall short, particularly in dyn

Externí odkaz: http://arxiv.org/abs/2402.16394

Zobrazit plný text záznamu

Report

Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids

Autor: Kirton-Wingate, Jasper, Ahmed, Shafique, Hussain, Adeel, Gogate, Mandar, Dashtipour, Kia, Hou, Jen-Cheng, Hussain, Tassadaq, Tsao, Yu, Hussain, Amir

Since the advent of Deep Learning (DL), Speech Enhancement (SE) models have performed well under a variety of noise conditions. However, such systems may still introduce sonic artefacts, sound unnatural, and restrict the ability for a user to hear am

Externí odkaz: http://arxiv.org/abs/2402.16757

Zobrazit plný text záznamu

Report

Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges

Autor: Zhao, Weiguang, Yang, Guanyu, Zhang, Rui, Jiang, Chenru, Yang, Chaolong, Yan, Yuyao, Hussain, Amir, Huang, Kaizhu

With the explosive 3D data growth, the urgency of utilizing zero-shot learning to facilitate data labeling becomes evident. Recently, methods transferring language or language-image pre-training models like Contrastive Language-Image Pre-training (CL

Externí odkaz: http://arxiv.org/abs/2312.07039

Zobrazit plný text záznamu

Report

Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement

Autor: Ahmed, Shafique, Chen, Chia-Wei, Ren, Wenze, Li, Chin-Jou, Chu, Ernie, Chen, Jun-Cheng, Hussain, Amir, Wang, Hsin-Min, Tsao, Yu, Hou, Jen-Cheng

Recent studies have increasingly acknowledged the advantages of incorporating visual data into speech enhancement (SE) systems. In this paper, we introduce a novel audio-visual SE approach, termed DCUC-Net (deep complex U-Net with conformer network).

Externí odkaz: http://arxiv.org/abs/2309.11059

Zobrazit plný text záznamu

Report

Beyond Reality: The Pivotal Role of Generative AI in the Metaverse

Autor: Chamola, Vinay, Bansal, Gaurang, Das, Tridib Kumar, Hassija, Vikas, Reddy, Naga Siva Sai, Wang, Jiacheng, Zeadally, Sherali, Hussain, Amir, Yu, F. Richard, Guizani, Mohsen, Niyato, Dusit

Imagine stepping into a virtual world that's as rich, dynamic, and interactive as our physical one. This is the promise of the Metaverse, and it's being brought to life by the transformative power of Generative Artificial Intelligence (AI). This pape

Externí odkaz: http://arxiv.org/abs/2308.06272

Zobrazit plný text záznamu

Report

Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations

Autor: Lai, Richard Lee, Hou, Jen-Cheng, Gogate, Mandar, Dashtipour, Kia, Hussain, Amir, Tsao, Yu

Individuals with hearing impairments face challenges in their ability to comprehend speech, particularly in noisy environments. The aim of this study is to explore the effectiveness of audio-visual speech enhancement (AVSE) in enhancing the intelligi

Externí odkaz: http://arxiv.org/abs/2307.07748

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání