Zobrazeno 1 - 10
of 4 389
pro vyhledávání: '"LEI, Ting"'
Autor:
Wu, Ruitao, Fang, Juncheng, Pan, Rui, Lin, Rongyi, Li, Kaiyuan, Lei, Ting, Du, Luping, Yuan, Xiaocong
Inspired by neural network algorithms in deep learning, diffractive optical networks have arisen as new platforms for manipulating light-matter interactions. Inherited from the deep learning black box nature, clear physical meanings have never been g
Externí odkaz:
http://arxiv.org/abs/2410.12233
Zero-shot Human-Object Interaction (HOI) detection has emerged as a frontier topic due to its capability to detect HOIs beyond a predefined set of categories. This task entails not only identifying the interactiveness of human-object pairs and locali
Externí odkaz:
http://arxiv.org/abs/2408.02484
Open-vocabulary human-object interaction (HOI) detection, which is concerned with the problem of detecting novel HOIs guided by natural language, is crucial for understanding human-centric scenes. However, prior zero-shot HOI detectors often employ t
Externí odkaz:
http://arxiv.org/abs/2404.06194
Publikováno v:
Pigment & Resin Technology, 2023, Vol. 53, Issue 6, pp. 1064-1073.
Externí odkaz:
http://www.emeraldinsight.com/doi/10.1108/PRT-05-2023-0043
Human Object Interaction (HOI) detection aims to localize and infer the relationships between a human and an object. Arguably, training supervised models for this task from scratch presents challenges due to the performance drop over rare classes and
Externí odkaz:
http://arxiv.org/abs/2309.03696
Understanding human tasks through video observations is an essential capability of intelligent agents. The challenges of such capability lie in the difficulty of generating a detailed understanding of situated actions, their effects on object states
Externí odkaz:
http://arxiv.org/abs/2210.03929
Publikováno v:
In Alexandria Engineering Journal January 2025 111:521-529
Autor:
Qin, Qingqing, Hu, Yingmo, Sun, Ning, Lei, Ting, Qin, Shuhao, Yang, Yuanyuan, Wu, Xiao, Cui, Zhenyu, An, Mingze
Publikováno v:
In Carbon January 2025 231
In this technical report, we briefly introduce the solutions of our team `PKU-WICT-MIPL' for the PIC Makeup Temporal Video Grounding (MTVG) Challenge in ACM-MM 2022. Given an untrimmed makeup video and a step query, the MTVG aims to localize a tempor
Externí odkaz:
http://arxiv.org/abs/2207.02687