Zobrazeno 1 - 10
of 29
pro vyhledávání: '"Abdulatif, Sherif"'
The rapid evolution of deep learning and its integration with autonomous driving systems have led to substantial advancements in 3D perception using multimodal sensors. Notably, radar sensors show greater robustness compared to cameras and lidar unde
Externí odkaz:
http://arxiv.org/abs/2408.06772
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 2477-2493, 2024
In this work, we further develop the conformer-based metric generative adversarial network (CMGAN) model for speech enhancement (SE) in the time-frequency (TF) domain. This paper builds on our previous work but takes a more in-depth look by conductin
Externí odkaz:
http://arxiv.org/abs/2209.11112
Publikováno v:
Proceedings of INTERSPEECH, 2022, pp. 936--940
Recently, convolution-augmented transformer (Conformer) has achieved promising performance in automatic speech recognition (ASR) and time-domain speech enhancement (SE), as it can capture both local and global dependencies in the speech signal. In th
Externí odkaz:
http://arxiv.org/abs/2203.15149
Radar for deep learning-based human identification has become a research area of increasing interest. It has been shown that micro-Doppler ($\mu$-D) can reflect the walking behavior through capturing the periodic limbs' micro-motions. One of the main
Externí odkaz:
http://arxiv.org/abs/2110.08595
Age is an essential factor in modern diagnostic procedures. However, assessment of the true biological age (BA) remains a daunting task due to the lack of reference ground-truth labels. Current BA estimation approaches are either restricted to skelet
Externí odkaz:
http://arxiv.org/abs/2103.08491
Recent years have seen a surge in the number of available frameworks for speech enhancement (SE) and recognition. Whether model-based or constructed via deep learning, these frameworks often rely in isolation on either time-domain signals or time-fre
Externí odkaz:
http://arxiv.org/abs/2010.10468
Autor:
Armanious, Karim, Abdulatif, Sherif, Shi, Wenbin, Salian, Shashank, Küstner, Thomas, Weiskopf, Daniel, Hepp, Tobias, Gatidis, Sergios, Yang, Bin
The concept of biological age (BA), although important in clinical practice, is hard to grasp mainly due to the lack of a clearly defined reference standard. For specific applications, especially in pediatrics, medical image data are used for BA esti
Externí odkaz:
http://arxiv.org/abs/2009.10765
The understanding of the surrounding environment plays a critical role in autonomous robotic systems, such as self-driving cars. Extensive research has been carried out concerning visual perception. Yet, to obtain a more complete perception of the en
Externí odkaz:
http://arxiv.org/abs/2003.01609
Autor:
Armanious, Karim, Kumar, Vijeth, Abdulatif, Sherif, Hepp, Tobias, Gatidis, Sergios, Yang, Bin
Local deformations in medical modalities are common phenomena due to a multitude of factors such as metallic implants or limited field of views in magnetic resonance imaging (MRI). Completion of the missing or distorted regions is of special interest
Externí odkaz:
http://arxiv.org/abs/1910.09230
Automatic speech recognition (ASR) systems are of vital importance nowadays in commonplace tasks such as speech-to-text processing and language translation. This created the need for an ASR system that can operate in realistic crowded environments. T
Externí odkaz:
http://arxiv.org/abs/1910.12620