Výsledky vyhledávání - "Recognition system"

Report

Design and Development of Laughter Recognition System Based on Multimodal Fusion and Deep Learning

Autor: Zhao, Fuzheng, Bai, Yu

This study aims to design and implement a laughter recognition system based on multimodal fusion and deep learning, leveraging image and audio processing technologies to achieve accurate laughter recognition and emotion analysis. First, the system lo

Externí odkaz: http://arxiv.org/abs/2407.21391

Zobrazit plný text záznamu

Report

Novel Human Machine Interface via Robust Hand Gesture Recognition System using Channel Pruned YOLOv5s Model

Autor: Sen, Abir, Mishra, Tapas Kumar, Dash, Ratnakar

Hand gesture recognition (HGR) is a vital component in enhancing the human-computer interaction experience, particularly in multimedia applications, such as virtual reality, gaming, smart home automation systems, etc. Users can control and navigate t

Externí odkaz: http://arxiv.org/abs/2407.02585

Zobrazit plný text záznamu

Report

Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System

Autor: Meng, Lingwei, Kang, Jiawen, Wang, Yuejiao, Jin, Zengrui, Wu, Xixin, Liu, Xunying, Meng, Helen

Multi-talker speech recognition and target-talker speech recognition, both involve transcription in multi-talker contexts, remain significant challenges. However, existing methods rarely attempt to simultaneously address both tasks. In this study, we

Externí odkaz: http://arxiv.org/abs/2407.09817

Zobrazit plný text záznamu

Report

A Streaming Multi-Channel End-to-End Speech Recognition System with Realistic Evaluations

Autor: Kong, Xiangzhu, Ning, Tianqi, Huang, Hao, Ou, Zhijian

Recently multi-channel end-to-end (ME2E) ASR systems have emerged. While streaming single-channel end-to-end ASR has been extensively studied, streaming ME2E ASR is limited in exploration. Additionally, recent studies call attention to the gap betwee

Externí odkaz: http://arxiv.org/abs/2407.09807

Zobrazit plný text záznamu

Report

GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text Recognition System

Autor: Kumari, Lalita, Singh, Sukhdeep, Rathore, Vaibhav Varish Singh, Sharma, Anuj

The Handwritten Text Recognition problem has been a challenge for researchers for the last few decades, especially in the domain of computer vision, a subdomain of pattern recognition. Variability of texts amongst writers, cursiveness, and different

Externí odkaz: http://arxiv.org/abs/2404.14062

Zobrazit plný text záznamu

Report

Voice-Assisted Real-Time Traffic Sign Recognition System Using Convolutional Neural Network

Autor: Manawadu, Mayura, Wijenayake, Udaya

Traffic signs are important in communicating information to drivers. Thus, comprehension of traffic signs is essential for road safety and ignorance may result in road accidents. Traffic sign detection has been a research spotlight over the past few

Externí odkaz: http://arxiv.org/abs/2404.07807

Zobrazit plný text záznamu

Akademický článek

Optical Recognition System of Non-Dotted Arabic Letters.

Autor: Kadhim, Ahlam M.¹ ahlammjeed@yahoo.com, Jawad, Huda M.¹, Kadhum, Farah Jawad¹, Al-Zuky, Ali A.¹

Publikováno v: Iraqi Journal of Science. 2024, Vol. 65 Issue 5, p2749-2760. 12p.

Zobrazit plný text záznamu

Report

Unveiling Social Media Comments with a Novel Named Entity Recognition System for Identity Groups

Autor: Carvallo, Andrés, Quiroga, Tamara, Aspillaga, Carlos, Mendoza, Marcelo

While civilized users employ social media to stay informed and discuss daily occurrences, haters perceive these platforms as fertile ground for attacking groups and individuals. The prevailing approach to counter this phenomenon involves detecting su

Externí odkaz: http://arxiv.org/abs/2405.13011

Zobrazit plný text záznamu

Report

The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge

Autor: Tian, Jingguang, Ye, Shuaishuai, Chen, Shunfei, Xiang, Yang, Yin, Zhaohui, Hu, Xinhui, Xu, Xinkang

This paper presents our system submission for the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) Challenge, which focuses on speaker diarization and speech recognition in complex multi-speaker scenarios. To address these challenges, we

Externí odkaz: http://arxiv.org/abs/2405.05498

Zobrazit plný text záznamu

Report

Automatic Speech Recognition System-Independent Word Error Rate Estimation

Autor: Park, Chanho, Chen, Mingjie, Hain, Thomas

Word error rate (WER) is a metric used to evaluate the quality of transcriptions produced by Automatic Speech Recognition (ASR) systems. In many applications, it is of interest to estimate WER given a pair of a speech utterance and a transcript. Prev

Externí odkaz: http://arxiv.org/abs/2404.16743

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání