Zobrazeno 1 - 10
of 59
pro vyhledávání: '"Chen, Tianlang"'
Molecular modeling, a central topic in quantum mechanics, aims to accurately calculate the properties and simulate the behaviors of molecular systems. The molecular model is governed by physical laws, which impose geometric constraints such as invari
Externí odkaz:
http://arxiv.org/abs/2406.16853
Unlike vision and language data which usually has a unique format, molecules can naturally be characterized using different chemical formulations. One can view a molecule as a 2D graph or define it as a collection of atoms located in a 3D space. For
Externí odkaz:
http://arxiv.org/abs/2210.01765
Autor:
Deng, Jiajun, Yang, Zhengyuan, Liu, Daqing, Chen, Tianlang, Zhou, Wengang, Zhang, Yanyong, Li, Houqiang, Ouyang, Wanli
In this work, we explore neat yet effective Transformer-based frameworks for visual grounding. The previous methods generally address the core problem of visual grounding, i.e., multi-modal fusion and reasoning, with manually-designed mechanisms. Suc
Externí odkaz:
http://arxiv.org/abs/2206.06619
Autor:
Chen, Yuxiao, Yuan, Jianbo, Zhao, Long, Chen, Tianlang, Luo, Rui, Davis, Larry, Metaxas, Dimitris N.
Cross-modal attention mechanisms have been widely applied to the image-text matching task and have achieved remarkable improvements thanks to its capability of learning fine-grained relevance across different modalities. However, the cross-modal atte
Externí odkaz:
http://arxiv.org/abs/2105.09597
In this paper, we present a neat yet effective transformer-based framework for visual grounding, namely TransVG, to address the task of grounding a language query to the corresponding region onto an image. The state-of-the-art methods, including two-
Externí odkaz:
http://arxiv.org/abs/2104.08541
Autor:
Xue, Peixuan, Chen, Tianlang, Huang, Xiehan, Hu, Qiang, Hu, Junhao, Zhang, Han, Yang, Haiping, Chen, Hanping
Publikováno v:
In International Journal of Hydrogen Energy 2 January 2024 49 Part A:356-370
We improve one-stage visual grounding by addressing current limitations on grounding long and complex queries. Existing one-stage methods encode the entire language query as a single sentence embedding vector, e.g., taking the embedding from BERT or
Externí odkaz:
http://arxiv.org/abs/2008.01059
Transferring the sentiment of an image is an unexplored research topic in the area of computer vision. This work proposes a novel framework consisting of a reference image retrieval step and a global sentiment transfer step to transfer sentiments of
Externí odkaz:
http://arxiv.org/abs/2006.11989
In this work, we introduce an important but still unexplored research task -- image sentiment transfer. Compared with other related tasks that have been well-studied, such as image-to-image translation and image style transfer, transferring the senti
Externí odkaz:
http://arxiv.org/abs/2006.11337
Example-guided image synthesis has recently been attempted to synthesize an image from a semantic label map and an exemplary image. In the task, the additional exemplar image provides the style guidance that controls the appearance of the synthesized
Externí odkaz:
http://arxiv.org/abs/2004.10024