Zobrazeno 1 - 10
of 46
pro vyhledávání: '"Runwei, Ding"'
Publikováno v:
CAAI Transactions on Intelligence Technology, Vol 9, Iss 5, Pp 1078-1091 (2024)
Abstract Underwater object detection (UOD) is crucial for marine economic development, environmental protection, and the planet's sustainable development. The main challenges of this task arise from low‐contrast, small objects, and mimicry of aquat
Externí odkaz:
https://doaj.org/article/3c5bb101874a4cf2a2878d7c0aeeb634
Publikováno v:
Cyborg and Bionic Systems, Vol 5 (2024)
Three-dimensional skeleton-based action recognition (3D SAR) has gained important attention within the computer vision community, owing to the inherent advantages offered by skeleton data. As a result, a plethora of impressive works, including those
Externí odkaz:
https://doaj.org/article/e5173812d9b04116a5265366b963e446
Publikováno v:
CAAI Transactions on Intelligence Technology, Vol 7, Iss 3, Pp 446-454 (2022)
Abstract This article proposes a deep neural network (DNN)‐based direct‐path relative transfer function (DP‐RTF) enhancement method for robust direction of arrival (DOA) estimation in noisy and reverberant environments. The DP‐RTF refers to t
Externí odkaz:
https://doaj.org/article/2131e26277fa4547bf556b4f7244c429
Publikováno v:
ACM Transactions on Multimedia Computing, Communications & Applications; Apr2024, Vol. 20 Issue 4, p1-20, 20p
Publikováno v:
IEEE Access, Vol 7, Pp 5597-5608 (2019)
This paper presents a new framework for human action recognition by fusing human motion with skeletal joints. First, adaptive hierarchical depth motion maps (AH-DMMs) are proposed to capture the shape and motion cues of action sequences. Specifically
Externí odkaz:
https://doaj.org/article/679eb4e519b54ff5a5602108497f5391
Publikováno v:
Neurocomputing. 528:20-34
Publikováno v:
Complexity, Vol 2020 (2020)
For speaker tracking, integrating multimodal information from audio and video provides an effective and promising solution. The current challenges are focused on the construction of a stable observation model. To this end, we propose a 3D audio-visua
Externí odkaz:
https://doaj.org/article/a213f219ba264b2484ef3965aa64ee9c
Publikováno v:
Complexity, Vol 2020 (2020)
Most binaural speech source localization models perform poorly in unprecedentedly noisy and reverberant situations. Here, this issue is approached by modelling a multiscale dilated convolutional neural network (CNN). The time-related crosscorrelation
Externí odkaz:
https://doaj.org/article/af47ab4117c944e6924c7b40adb3feeb
Publikováno v:
CAAI Transactions on Intelligence Technology (2019)
Tiny defect detection (TDD) which aims to perform the quality control of printed circuit boards (PCBs) is a basic and essential task in the production of most electronic products. Though significant progress has been made in PCB defect detection, tra
Externí odkaz:
https://doaj.org/article/fda952c6f5be42dd8f8b4afcc5350998
3D human mesh recovery from a 2D pose plays an important role in various applications. However, it is hard for existing methods to simultaneously capture the multiple relations during the evolution from skeleton to mesh, including joint-joint, joint-
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b927b308b7e38bdb1d8cdde5b2362e53
http://arxiv.org/abs/2303.05652
http://arxiv.org/abs/2303.05652