Zobrazeno 1 - 10
of 3 391
pro vyhledávání: '"Yin, Fei"'
Geometry problem solving (GPS) requires capacities of multi-modal understanding, multi-hop reasoning and theorem knowledge application. In this paper, we propose a neural-symbolic model for plane geometry problem solving (PGPS), named PGPSNet-v2, wit
Externí odkaz:
http://arxiv.org/abs/2407.07327
Autor:
Li, Zhong-Zhi, Zhang, Ming-Liang, Yin, Fei, Ji, Zhi-Long, Bai, Jin-Feng, Pan, Zhen-Ru, Zeng, Fan-Hu, Xu, Jian, Zhang, Jia-Xin, Liu, Cheng-Lin
Due to the rapid advancements in multimodal large language models, evaluating their multimodal mathematical capabilities continues to receive wide attention. Despite the datasets like MathVista proposed benchmarks for assessing mathematical capabilit
Externí odkaz:
http://arxiv.org/abs/2407.12023
Graph matching is a commonly used technique in computer vision and pattern recognition. Recent data-driven approaches have improved the graph matching accuracy remarkably, whereas some traditional algorithm-based methods are more robust to feature no
Externí odkaz:
http://arxiv.org/abs/2403.06457
Recent advancements in large language models (LLMs) and multi-modal models (MMs) have demonstrated their remarkable capabilities in problem-solving. Yet, their proficiency in tackling geometry math problems, which necessitates an integrated understan
Externí odkaz:
http://arxiv.org/abs/2402.10104
Geometry problem solving (GPS) is a challenging mathematical reasoning task requiring multi-modal understanding, fusion, and reasoning. Existing neural solvers take GPS as a vision-language task but are short in the representation of geometry diagram
Externí odkaz:
http://arxiv.org/abs/2311.16476
Autor:
Mon-Chien Lee, Chin-Shan Ho, Yi-Ju Hsu, Nai-Wen Kan, Chen-Yin Fei, Hung-Jen Yang, Chi-Chang Huang
Publikováno v:
Clocks & Sleep, Vol 6, Iss 3, Pp 499-516 (2024)
The application of far-infrared blankets has shown certain benefits in health promotion and therapy, such as improving blood circulation and alleviating muscle pain. However, the effects of such blankets on increasing deep sleep, reducing blood press
Externí odkaz:
https://doaj.org/article/e2ff23725c3342f094a4d3557c66e91c
Autor:
Gong, Yuan, Zhang, Yong, Cun, Xiaodong, Yin, Fei, Fan, Yanbo, Wang, Xuan, Wu, Baoyuan, Yang, Yujiu
We target cross-domain face reenactment in this paper, i.e., driving a cartoon image with the video of a real person and vice versa. Recently, many works have focused on one-shot talking face generation to drive a portrait with a real video, i.e., wi
Externí odkaz:
http://arxiv.org/abs/2308.12866
Autor:
Yu, Wangbo, Fan, Yanbo, Zhang, Yong, Wang, Xuan, Yin, Fei, Bai, Yunpeng, Cao, Yan-Pei, Shan, Ying, Wu, Yang, Sun, Zhongqian, Wu, Baoyuan
3D facial avatar reconstruction has been a significant research topic in computer graphics and computer vision, where photo-realistic rendering and flexible controls over poses and expressions are necessary for many related applications. Recently, it
Externí odkaz:
http://arxiv.org/abs/2307.03441
Autor:
Cheng, Yiji, Yin, Fei, Huang, Xiaoke, Yu, Xintong, Liu, Jiaxiang, Feng, Shikun, Yang, Yujiu, Tang, Yansong
Text-to-3D is an emerging task that allows users to create 3D content with infinite possibilities. Existing works tackle the problem by optimizing a 3D representation with guidance from pre-trained diffusion models. An apparent drawback is that they
Externí odkaz:
http://arxiv.org/abs/2306.02083
Diffusion models have recently demonstrated an impressive ability to address inverse problems in an unsupervised manner. While existing methods primarily focus on modifying the posterior sampling process, the potential of the forward process remains
Externí odkaz:
http://arxiv.org/abs/2305.16965