Výsledky vyhledávání

Report

Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram

Autor: Zhang, Ming-Liang, Li, Zhong-Zhi, Yin, Fei, Lin, Liang, Liu, Cheng-Lin

Geometry problem solving (GPS) requires capacities of multi-modal understanding, multi-hop reasoning and theorem knowledge application. In this paper, we propose a neural-symbolic model for plane geometry problem solving (PGPS), named PGPSNet-v2, wit

Externí odkaz: http://arxiv.org/abs/2407.07327

Zobrazit plný text záznamu

Report

CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models

Autor: Li, Zhong-Zhi, Zhang, Ming-Liang, Yin, Fei, Ji, Zhi-Long, Bai, Jin-Feng, Pan, Zhen-Ru, Zeng, Fan-Hu, Xu, Jian, Zhang, Jia-Xin, Liu, Cheng-Lin

Due to the rapid advancements in multimodal large language models, evaluating their multimodal mathematical capabilities continues to receive wide attention. Despite the datasets like MathVista proposed benchmarks for assessing mathematical capabilit

Externí odkaz: http://arxiv.org/abs/2407.12023

Zobrazit plný text záznamu

Report

Ensemble Quadratic Assignment Network for Graph Matching

Autor: Tan, Haoru, Wang, Chuang, Wu, Sitong, Zhang, Xu-Yao, Yin, Fei, Liu, Cheng-Lin

Graph matching is a commonly used technique in computer vision and pattern recognition. Recent data-driven approaches have improved the graph matching accuracy remarkably, whereas some traditional algorithm-based methods are more robust to feature no

Externí odkaz: http://arxiv.org/abs/2403.06457

Zobrazit plný text záznamu

Report

GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving

Autor: Zhang, Jiaxin, Li, Zhongzhi, Zhang, Mingliang, Yin, Fei, Liu, Chenglin, Moshfeghi, Yashar

Recent advancements in large language models (LLMs) and multi-modal models (MMs) have demonstrated their remarkable capabilities in problem-solving. Yet, their proficiency in tackling geometry math problems, which necessitates an integrated understan

Externí odkaz: http://arxiv.org/abs/2402.10104

Zobrazit plný text záznamu

Report

LANS: A Layout-Aware Neural Solver for Plane Geometry Problem

Autor: Li, Zhong-Zhi, Zhang, Ming-Liang, Yin, Fei, Liu, Cheng-Lin

Geometry problem solving (GPS) is a challenging mathematical reasoning task requiring multi-modal understanding, fusion, and reasoning. Existing neural solvers take GPS as a vision-language task but are short in the representation of geometry diagram

Externí odkaz: http://arxiv.org/abs/2311.16476

Zobrazit plný text záznamu

Akademický článek

The Impact of DAZZEON αSleep® Far-Infrared Blanket on Sleep, Blood Pressure, Vascular Health, Muscle Function, Inflammation, and Fatigue

Autor: Mon-Chien Lee, Chin-Shan Ho, Yi-Ju Hsu, Nai-Wen Kan, Chen-Yin Fei, Hung-Jen Yang, Chi-Chang Huang

Publikováno v: Clocks & Sleep, Vol 6, Iss 3, Pp 499-516 (2024)

The application of far-infrared blankets has shown certain benefits in health promotion and therapy, such as improving blood circulation and alleviating muscle pain. However, the effects of such blankets on increasing deep sleep, reducing blood press

Externí odkaz: https://doaj.org/article/e2ff23725c3342f094a4d3557c66e91c

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

ToonTalker: Cross-Domain Face Reenactment

Autor: Gong, Yuan, Zhang, Yong, Cun, Xiaodong, Yin, Fei, Fan, Yanbo, Wang, Xuan, Wu, Baoyuan, Yang, Yujiu

We target cross-domain face reenactment in this paper, i.e., driving a cartoon image with the video of a real person and vice versa. Recently, many works have focused on one-shot talking face generation to drive a portrait with a real video, i.e., wi

Externí odkaz: http://arxiv.org/abs/2308.12866

Zobrazit plný text záznamu

Report

NOFA: NeRF-based One-shot Facial Avatar Reconstruction

Autor: Yu, Wangbo, Fan, Yanbo, Zhang, Yong, Wang, Xuan, Yin, Fei, Bai, Yunpeng, Cao, Yan-Pei, Shan, Ying, Wu, Yang, Sun, Zhongqian, Wu, Baoyuan

3D facial avatar reconstruction has been a significant research topic in computer graphics and computer vision, where photo-realistic rendering and flexible controls over poses and expressions are necessary for many related applications. Recently, it

Externí odkaz: http://arxiv.org/abs/2307.03441

Zobrazit plný text záznamu

Report

Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution

Autor: Cheng, Yiji, Yin, Fei, Huang, Xiaoke, Yu, Xintong, Liu, Jiaxiang, Feng, Shikun, Yang, Yujiu, Tang, Yansong

Text-to-3D is an emerging task that allows users to create 3D content with infinite possibilities. Existing works tackle the problem by optimizing a 3D representation with guidance from pre-trained diffusion models. An apparent drawback is that they

Externí odkaz: http://arxiv.org/abs/2306.02083

Zobrazit plný text záznamu

Report

Accelerating Diffusion Models for Inverse Problems through Shortcut Sampling

Autor: Liu, Gongye, Sun, Haoze, Li, Jiayi, Yin, Fei, Yang, Yujiu

Diffusion models have recently demonstrated an impressive ability to address inverse problems in an unsupervised manner. While existing methods primarily focus on modifying the posterior sampling process, the potential of the forward process remains

Externí odkaz: http://arxiv.org/abs/2305.16965

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání