Výsledky vyhledávání - "Zhang, Juexiao"

Report

Autor: Zhang, Juexiao, Zhu, Gao, Li, Sihang, Liu, Xinhao, Song, Haorui, Tang, Xinran, Feng, Chen

A proper scene representation is central to the pursuit of spatial intelligence where agents can robustly reconstruct and efficiently understand 3D scenes. A scene representation is either metric, such as landmark maps in 3D reconstruction, 3D boundi

Externí odkaz: http://arxiv.org/abs/2410.11187

Zobrazit plný text záznamu

Report

VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model

Autor: Wang, Beichen, Zhang, Juexiao, Dong, Shuwen, Fang, Irving, Feng, Chen

Vision Language Models (VLMs) have recently been adopted in robotics for their capability in common sense reasoning and generalizability. Existing work has applied VLMs to generate task and motion planning from natural language instructions and simul

Externí odkaz: http://arxiv.org/abs/2410.08792

Zobrazit plný text záznamu

Report

Tell Me Where You Are: Multimodal LLMs Meet Place Recognition

Autor: Lyu, Zonglin, Zhang, Juexiao, Lu, Mingxuan, Li, Yiming, Feng, Chen

Large language models (LLMs) exhibit a variety of promising capabilities in robotics, including long-horizon planning and commonsense reasoning. However, their performance in place recognition is still underexplored. In this work, we introduce multim

Externí odkaz: http://arxiv.org/abs/2406.17520

Zobrazit plný text záznamu

Report

LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images

Autor: Zhang, Jing, Fang, Irving, Zhang, Juexiao, Wu, Hao, Kaushik, Akshat, Rodriguez, Alice, Zhao, Hanwen, Zheng, Zhuo, Iovita, Radu, Feng, Chen

Lithic Use-Wear Analysis (LUWA) using microscopic images is an underexplored vision-for-science research area. It seeks to distinguish the worked material, which is critical for understanding archaeological artifacts, material interactions, tool func

Externí odkaz: http://arxiv.org/abs/2403.13171

Zobrazit plný text záznamu

Report

ActFormer: Scalable Collaborative Perception via Active Queries

Autor: Huang, Suozhi, Zhang, Juexiao, Li, Yiming, Feng, Chen

Collaborative perception leverages rich visual observations from multiple robots to extend a single robot's perception ability beyond its field of view. Many prior works receive messages broadcast from all collaborators, leading to a scalability chal

Externí odkaz: http://arxiv.org/abs/2403.04968

Zobrazit plný text záznamu

Report

URLOST: Unsupervised Representation Learning without Stationarity or Topology

Autor: Yun, Zeyu, Zhang, Juexiao, Olshausen, Bruno, LeCun, Yann, Chen, Yubei

Unsupervised representation learning has seen tremendous progress but is constrained by its reliance on data modality-specific stationarity and topology, a limitation not found in biological intelligence systems. For instance, human vision processes

Externí odkaz: http://arxiv.org/abs/2310.04496

Zobrazit plný text záznamu

Report

Word Embedding Visualization Via Dictionary Learning

Autor: Zhang, Juexiao, Chen, Yubei, Cheung, Brian, Olshausen, Bruno A

Co-occurrence statistics based word embedding techniques have proved to be very useful in extracting the semantic and syntactic representation of words as low dimensional continuous vectors. In this work, we discovered that dictionary learning can op

Externí odkaz: http://arxiv.org/abs/1910.03833

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání