Zobrazeno 1 - 10
of 1 896
pro vyhledávání: '"PAN, Gang"'
Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model
Spiking Neural Networks (SNNs), renowned for their low power consumption, brain-inspired architecture, and spatio-temporal representation capabilities, have garnered considerable attention in recent years. Similar to Artificial Neural Networks (ANNs)
Externí odkaz:
http://arxiv.org/abs/2410.15689
Existing large pre-trained models typically map text input to text output in an end-to-end manner, such as ChatGPT, or map a segment of text input to a hierarchy of action decisions, such as OpenVLA. However, humans can simultaneously generate text a
Externí odkaz:
http://arxiv.org/abs/2410.15885
Autor:
Jiang, Yi, Shen, Qingyang, Lai, Shuzhong, Qi, Shunyu, Zheng, Qian, Yao, Lin, Wang, Yueming, Pan, Gang
Autism spectrum disorder(ASD) is a pervasive developmental disorder that significantly impacts the daily functioning and social participation of individuals. Despite the abundance of research focused on supporting the clinical diagnosis of ASD, there
Externí odkaz:
http://arxiv.org/abs/2410.05684
3D Gaussian Splatting is capable of reconstructing 3D scenes in minutes. Despite recent advances in improving surface reconstruction accuracy, the reconstructed results still exhibit bias and suffer from inefficiency in storage and training. This pap
Externí odkaz:
http://arxiv.org/abs/2410.07266
Humans naturally perform audiovisual speech recognition (AVSR), enhancing the accuracy and robustness by integrating auditory and visual information. Spiking neural networks (SNNs), which mimic the brain's information-processing mechanisms, are well-
Externí odkaz:
http://arxiv.org/abs/2408.16564
Deep learning has revolutionized artificial intelligence (AI), achieving remarkable progress in fields such as computer vision, speech recognition, and natural language processing. Moreover, the recent success of large language models (LLMs) has fuel
Externí odkaz:
http://arxiv.org/abs/2409.02111
Autor:
Chen, Zhuo, Ma, De, Jin, Xiaofei, Xing, Qinghui, Jin, Ouwen, Du, Xin, He, Shuibing, Pan, Gang
Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy o
Externí odkaz:
http://arxiv.org/abs/2407.20947
The Quick-view (QV) technique serves as a primary method for detecting defects within sewerage systems. However, the effectiveness of QV is impeded by the limited visual range of its hardware, resulting in suboptimal image quality for distant portion
Externí odkaz:
http://arxiv.org/abs/2407.19271
In sewer pipe Closed-Circuit Television (CCTV) inspection, accurate temporal defect localization is essential for effective defect classification, detection, segmentation and quantification. Industry standards typically do not require time-interval a
Externí odkaz:
http://arxiv.org/abs/2407.15170
Grounded Multimodal Named Entity Recognition (GMNER) task aims to identify named entities, entity types and their corresponding visual regions. GMNER task exhibits two challenging attributes: 1) The tenuous correlation between images and text on soci
Externí odkaz:
http://arxiv.org/abs/2406.07268