Zobrazeno 1 - 10
of 157
pro vyhledávání: '"Wang, Hongsong"'
Zero-shot action recognition, which addresses the issue of scalability and generalization in action recognition and allows the models to adapt to new and unseen actions dynamically, is an important research topic in computer vision communities. The k
Externí odkaz:
http://arxiv.org/abs/2409.14336
Federated learning is an efficient framework designed to facilitate collaborative model training across multiple distributed devices while preserving user data privacy. A significant challenge of federated learning is data-level heterogeneity, i.e.,
Externí odkaz:
http://arxiv.org/abs/2408.07966
Human action understanding is a fundamental and challenging task in computer vision. Although there exists tremendous research on this area, most works focus on action recognition, while action retrieval has received less attention. In this paper, we
Externí odkaz:
http://arxiv.org/abs/2407.09924
3D face alignment is a very challenging and fundamental problem in computer vision. Existing deep learning-based methods manually design different networks to regress either parameters of a 3D face model or 3D positions of face vertices. However, des
Externí odkaz:
http://arxiv.org/abs/2406.07873
Dance plays an important role as an artistic form and expression in human culture, yet the creation of dance remains a challenging task. Most dance generation methods primarily rely solely on music, seldom taking into consideration intrinsic attribut
Externí odkaz:
http://arxiv.org/abs/2406.07871
Federated learning shows promise as a privacy-preserving collaborative learning technique. Existing heterogeneous federated learning mainly focuses on skewing the label distribution across clients. However, most approaches suffer from catastrophic fo
Externí odkaz:
http://arxiv.org/abs/2312.09881
Autor:
Wang, Hongsong, Zhang, Yuqi
Patent retrieval has been attracting tremendous interest from researchers in intellectual property and information retrieval communities in the past decades. However, most existing approaches rely on textual and metadata information of the patent, an
Externí odkaz:
http://arxiv.org/abs/2308.13749
Visual retrieval tasks such as image retrieval and person re-identification (Re-ID) aim at effectively and thoroughly searching images with similar content or the same identity. After obtaining retrieved examples, re-ranking is a widely adopted post-
Externí odkaz:
http://arxiv.org/abs/2306.08792
Autor:
Liu, Chong, Zhang, Yuqi, Wang, Hongsong, Chen, Weihua, Wang, Fan, Huang, Yan, Shen, Yi-Dong, Wang, Liang
Image-text retrieval is a central problem for understanding the semantic relationship between vision and language, and serves as the basis for various visual and language tasks. Most previous works either simply learn coarse-grained representations o
Externí odkaz:
http://arxiv.org/abs/2306.08789
Publikováno v:
In Pattern Recognition September 2024 153