Výsledky vyhledávání

Report

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Autor: Zhu, Chenming, Wang, Tai, Zhang, Wenwei, Pang, Jiangmiao, Liu, Xihui

Recent advancements in Large Multimodal Models (LMMs) have greatly enhanced their proficiency in 2D visual understanding tasks, enabling them to effectively process and understand images and videos. However, the development of LMMs with 3D-awareness

Externí odkaz: http://arxiv.org/abs/2409.18125

Zobrazit plný text záznamu

Report

SLAM assisted 3D tracking system for laparoscopic surgery

Autor: Song, Jingwei, Zhang, Ray, Zhang, Wenwei, Zhou, Hao, Ghaffari, Maani

A major limitation of minimally invasive surgery is the difficulty in accurately locating the internal anatomical structures of the target organ due to the lack of tactile feedback and transparency. Augmented reality (AR) offers a promising solution

Externí odkaz: http://arxiv.org/abs/2409.11688

Zobrazit plný text záznamu

Report

Temporal Reversed Training for Spiking Neural Networks with Generalized Spatio-Temporal Representation

Autor: Zuo, Lin, Ding, Yongqi, Luo, Wenwei, Jing, Mengmeng, Tian, Xianlong, Yang, Kunshan

Spiking neural networks (SNNs) have received widespread attention as an ultra-low energy computing paradigm. Recent studies have focused on improving the feature extraction capability of SNNs, but they suffer from inefficient inference and suboptimal

Externí odkaz: http://arxiv.org/abs/2408.09108

Zobrazit plný text záznamu

Report

A Mean Field Ansatz for Zero-Shot Weight Transfer

Autor: Chen, Xingyuan, Kuang, Wenwei, Deng, Lei, Han, Wei, Bai, Bo, Reis, Goncalo dos

The pre-training cost of large language models (LLMs) is prohibitive. One cutting-edge approach to reduce the cost is zero-shot weight transfer, also known as model growth for some cases, which magically transfers the weights trained in a small model

Externí odkaz: http://arxiv.org/abs/2408.08681

Zobrazit plný text záznamu

Report

Automated Defects Detection and Fix in Logging Statement

Autor: Zhong, Renyi, Li, Yichen, Kuang, Jinxi, Gu, Wenwei, Huo, Yintong, Lyu, Michael R.

Developers use logging statements to monitor software, but misleading logs can complicate maintenance by obscuring actual activities. Existing research on logging quality issues is limited, mainly focusing on single defects and manual fixes. To addre

Externí odkaz: http://arxiv.org/abs/2408.03101

Zobrazit plný text záznamu

Report

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Autor: Chen, Zehui, Liu, Kuikun, Wang, Qiuchen, Liu, Jiangning, Zhang, Wenwei, Chen, Kai, Zhao, Feng

Information seeking and integration is a complex cognitive task that consumes enormous time and effort. Inspired by the remarkable progress of Large Language Models, recent works attempt to solve this task by combining LLMs and search engines. Howeve

Externí odkaz: http://arxiv.org/abs/2407.20183

Zobrazit plný text záznamu

Report

CIBench: Evaluating Your LLMs with a Code Interpreter Plugin

Autor: Zhang, Songyang, Zhang, Chuyu, Hu, Yingfan, Shen, Haowen, Liu, Kuikun, Ma, Zerun, Zhou, Fengzhe, Zhang, Wenwei, He, Xuming, Lin, Dahua, Chen, Kai

While LLM-Based agents, which use external tools to solve complex problems, have made significant progress, benchmarking their ability is challenging, thereby hindering a clear understanding of their limitations. In this paper, we propose an interact

Externí odkaz: http://arxiv.org/abs/2407.10499

Zobrazit plný text záznamu

Report

4D Contrastive Superflows are Dense 3D Representation Learners

Autor: Xu, Xiang, Kong, Lingdong, Shuai, Hui, Zhang, Wenwei, Pan, Liang, Chen, Kai, Liu, Ziwei, Liu, Qingshan

In the realm of autonomous driving, accurate 3D perception is the foundation. However, developing such models relies on extensive human annotations -- a process that is both costly and labor-intensive. To address this challenge from a data representa

Externí odkaz: http://arxiv.org/abs/2407.06190

Zobrazit plný text záznamu

Report

STMR: Spiral Transformer for Hand Mesh Reconstruction

Autor: Xie, Huilong, Song, Wenwei, Kang, Wenxiong, Lin, Yihong

Recent advancements in both transformer-based methods and spiral neighbor sampling techniques have greatly enhanced hand mesh reconstruction. Transformers excel in capturing complex vertex relationships, and spiral neighbor sampling is vital for util

Externí odkaz: http://arxiv.org/abs/2407.05967

Zobrazit plný text záznamu

Report

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Autor: Gu, Yuzhe, Ji, Ziwei, Zhang, Wenwei, Lyu, Chengqi, Lin, Dahua, Chen, Kai

Large language models (LLMs) exhibit hallucinations in long-form question-answering tasks across various domains and wide applications. Current hallucination detection and mitigation datasets are limited in domains and sizes, which struggle to scale

Externí odkaz: http://arxiv.org/abs/2407.04693

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání