Zobrazeno 1 - 10
of 718
pro vyhledávání: '"Zhao, Jianbo"'
Autor:
Wang, Xiyang, Qi, Shouzheng, Zhao, Jieyou, Zhou, Hangning, Zhang, Siyu, Wang, Guoan, Tu, Kai, Guo, Songlin, Zhao, Jianbo, Li, Jian, Yang, Mu
This paper introduces MCTrack, a new 3D multi-object tracking method that achieves state-of-the-art (SOTA) performance across KITTI, nuScenes, and Waymo datasets. Addressing the gap in existing tracking paradigms, which often perform well on specific
Externí odkaz:
http://arxiv.org/abs/2409.16149
Autor:
Geng, Yimeng, Meng, Gaofeng, Chen, Mingcong, Cao, Guanglin, Zhao, Mingyang, Zhao, Jianbo, Liu, Hongbin
Accurate identification of arteries and veins in ultrasound images is crucial for vascular examinations and interventions in robotics-assisted surgeries. However, current methods for ultrasound vessel segmentation face challenges in distinguishing be
Externí odkaz:
http://arxiv.org/abs/2407.21394
Autor:
Zhao, Jianbo, Zhuang, Jiaheng, Zhou, Qibin, Ban, Taiyu, Xu, Ziyao, Zhou, Hangning, Wang, Junhe, Wang, Guoan, Li, Zhiheng, Li, Bin
Trajectory generation is a pivotal task in autonomous driving. Recent studies have introduced the autoregressive paradigm, leveraging the state transition model to approximate future trajectory distributions. This paradigm closely mirrors the real-wo
Externí odkaz:
http://arxiv.org/abs/2407.12940
Autor:
Tian, Yuzhang, Zhao, Jianbo, Dong, Haoyu, Xiong, Junyu, Xia, Shiyu, Zhou, Mengyu, Lin, Yun, Cambronero, José, He, Yeye, Han, Shi, Zhang, Dongmei
Spreadsheets, with their extensive two-dimensional grids, various layouts, and diverse formatting options, present notable challenges for large language models (LLMs). In response, we introduce SpreadsheetLLM, pioneering an efficient encoding method
Externí odkaz:
http://arxiv.org/abs/2407.09025
Autor:
Xia, Shiyu, Xiong, Junyu, Dong, Haoyu, Zhao, Jianbo, Tian, Yuzhang, Zhou, Mengyu, He, Yeye, Han, Shi, Zhang, Dongmei
Publikováno v:
Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR), Pages 116-128, August 2024
This paper explores capabilities of Vision Language Models on spreadsheet comprehension. We propose three self-supervised challenges with corresponding evaluation metrics to comprehensively evaluate VLMs on Optical Character Recognition (OCR), spatia
Externí odkaz:
http://arxiv.org/abs/2405.16234
Autor:
Zhang, Mengxi, Wu, Wenhao, Lu, Yu, Song, Yuxin, Rong, Kang, Yao, Huanjin, Zhao, Jianbo, Liu, Fanglong, Sun, Yifan, Feng, Haocheng, Wang, Jingdong
Current multimodal Large Language Models (MLLMs) suffer from ``hallucination'', occasionally generating responses that are not grounded in the input images. To tackle this challenge, one promising path is to utilize reinforcement learning from human
Externí odkaz:
http://arxiv.org/abs/2405.11165
Autor:
Zhang, Diankun, Wang, Guoan, Zhu, Runwen, Zhao, Jianbo, Chen, Xiwu, Zhang, Siyu, Gong, Jiahao, Zhou, Qibin, Zhang, Wenyuan, Wang, Ningzi, Tan, Feiyang, Zhou, Hangning, Xu, Ziyao, Yao, Haotian, Zhang, Chi, Liu, Xiaojun, Di, Xiaoguang, Li, Bin
End-to-End paradigms use a unified framework to implement multi-tasks in an autonomous driving system. Despite simplicity and clarity, the performance of end-to-end autonomous driving methods on sub-tasks is still far behind the single-task methods.
Externí odkaz:
http://arxiv.org/abs/2404.06892
Bronchoscopy plays a significant role in the early diagnosis and treatment of lung diseases. This process demands physicians to maneuver the flexible endoscope for reaching distal lesions, particularly requiring substantial expertise when examining t
Externí odkaz:
http://arxiv.org/abs/2403.01483
Publikováno v:
In International Journal of Hydrogen Energy 28 October 2024 88:78-85
Autor:
Chen, Bo, Ma, Sen, Kumar, Sachin, Yao, Zhitong, Feng, Wanqi, Zhao, Jianbo, Zhang, Xu, Cai, Di, Cao, Hui, Watson, Ian
Publikováno v:
In Carbon Resources Conversion September 2024 7(3)