Výsledky vyhledávání - "Zhao, Jianbo"

Report

MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving

Autor: Wang, Xiyang, Qi, Shouzheng, Zhao, Jieyou, Zhou, Hangning, Zhang, Siyu, Wang, Guoan, Tu, Kai, Guo, Songlin, Zhao, Jianbo, Li, Jian, Yang, Mu

This paper introduces MCTrack, a new 3D multi-object tracking method that achieves state-of-the-art (SOTA) performance across KITTI, nuScenes, and Waymo datasets. Addressing the gap in existing tracking paradigms, which often perform well on specific

Externí odkaz: http://arxiv.org/abs/2409.16149

Zobrazit plný text záznamu

Report

Force Sensing Guided Artery-Vein Segmentation via Sequential Ultrasound Images

Autor: Geng, Yimeng, Meng, Gaofeng, Chen, Mingcong, Cao, Guanglin, Zhao, Mingyang, Zhao, Jianbo, Liu, Hongbin

Accurate identification of arteries and veins in ultrasound images is crucial for vascular examinations and interventions in robotics-assisted surgeries. However, current methods for ultrasound vessel segmentation face challenges in distinguishing be

Externí odkaz: http://arxiv.org/abs/2407.21394

Zobrazit plný text záznamu

Report

KiGRAS: Kinematic-Driven Generative Model for Realistic Agent Simulation

Autor: Zhao, Jianbo, Zhuang, Jiaheng, Zhou, Qibin, Ban, Taiyu, Xu, Ziyao, Zhou, Hangning, Wang, Junhe, Wang, Guoan, Li, Zhiheng, Li, Bin

Trajectory generation is a pivotal task in autonomous driving. Recent studies have introduced the autoregressive paradigm, leveraging the state transition model to approximate future trajectory distributions. This paradigm closely mirrors the real-wo

Externí odkaz: http://arxiv.org/abs/2407.12940

Zobrazit plný text záznamu

Report

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Autor: Tian, Yuzhang, Zhao, Jianbo, Dong, Haoyu, Xiong, Junyu, Xia, Shiyu, Zhou, Mengyu, Lin, Yun, Cambronero, José, He, Yeye, Han, Shi, Zhang, Dongmei

Spreadsheets, with their extensive two-dimensional grids, various layouts, and diverse formatting options, present notable challenges for large language models (LLMs). In response, we introduce SpreadsheetLLM, pioneering an efficient encoding method

Externí odkaz: http://arxiv.org/abs/2407.09025

Zobrazit plný text záznamu

Report

Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities

Autor: Xia, Shiyu, Xiong, Junyu, Dong, Haoyu, Zhao, Jianbo, Tian, Yuzhang, Zhou, Mengyu, He, Yeye, Han, Shi, Zhang, Dongmei

Publikováno v: Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR), Pages 116-128, August 2024

This paper explores capabilities of Vision Language Models on spreadsheet comprehension. We propose three self-supervised challenges with corresponding evaluation metrics to comprehensively evaluate VLMs on Optical Character Recognition (OCR), spatia

Externí odkaz: http://arxiv.org/abs/2405.16234

Zobrazit plný text záznamu

Report

Automated Multi-level Preference for MLLMs

Autor: Zhang, Mengxi, Wu, Wenhao, Lu, Yu, Song, Yuxin, Rong, Kang, Yao, Huanjin, Zhao, Jianbo, Liu, Fanglong, Sun, Yifan, Feng, Haocheng, Wang, Jingdong

Current multimodal Large Language Models (MLLMs) suffer from ``hallucination'', occasionally generating responses that are not grounded in the input images. To tackle this challenge, one promising path is to utilize reinforcement learning from human

Externí odkaz: http://arxiv.org/abs/2405.11165

Zobrazit plný text záznamu

Report

SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving

Autor: Zhang, Diankun, Wang, Guoan, Zhu, Runwen, Zhao, Jianbo, Chen, Xiwu, Zhang, Siyu, Gong, Jiahao, Zhou, Qibin, Zhang, Wenyuan, Wang, Ningzi, Tan, Feiyang, Zhou, Hangning, Xu, Ziyao, Yao, Haotian, Zhang, Chi, Liu, Xiaojun, Di, Xiaoguang, Li, Bin

End-to-End paradigms use a unified framework to implement multi-tasks in an autonomous driving system. Despite simplicity and clarity, the performance of end-to-end autonomous driving methods on sub-tasks is still far behind the single-task methods.

Externí odkaz: http://arxiv.org/abs/2404.06892

Zobrazit plný text záznamu

Report

BronchoCopilot: Towards Autonomous Robotic Bronchoscopy via Multimodal Reinforcement Learning

Autor: Zhao, Jianbo, Chen, Hao, Tian, Qingyao, Chen, Jian, Yang, Bingyu, Liu, Hongbin

Bronchoscopy plays a significant role in the early diagnosis and treatment of lung diseases. This process demands physicians to maneuver the flexible endoscope for reaching distal lesions, particularly requiring substantial expertise when examining t

Externí odkaz: http://arxiv.org/abs/2403.01483

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání