Výsledky vyhledávání - "Hou, Chenshu"

Report

PD-APE: A Parallel Decoding Framework with Adaptive Position Encoding for 3D Visual Grounding

Autor: Hou, Chenshu, Peng, Liang, Wu, Xiaopei, He, Xiaofei, Wang, Wenxiao

3D visual grounding aims to identify objects in 3D point cloud scenes that match specific natural language descriptions. This requires the model to not only focus on the target object itself but also to consider the surrounding environment to determi

Externí odkaz: http://arxiv.org/abs/2407.14491

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání