Zobrazeno 1 - 10
of 1 169
pro vyhledávání: '"WU Wenhao"'
Publikováno v:
Gong-kuang zidonghua, Vol 48, Iss 7, Pp 142-148 (2022)
The coal mine network is faced with the threat of malicious traffic encrypted by the transport layer security protocol (TLS) generated by malicious software and the high false alarm rate of encrypted traffic during detection. In order to solve the ab
Externí odkaz:
https://doaj.org/article/b0e257df24234433b2b0fcedbba2e864
Audio Descriptions (ADs) aim to provide a narration of a movie in text form, describing non-dialogue-related narratives, such as characters, actions, or scene establishment. Automatic generation of ADs remains challenging due to: i) the domain gap be
Externí odkaz:
http://arxiv.org/abs/2411.18180
A longstanding goal of artificial general intelligence is highly capable generalists that can learn from diverse experiences and generalize to unseen tasks. The language and vision communities have seen remarkable progress toward this trend by scalin
Externí odkaz:
http://arxiv.org/abs/2410.11448
Autor:
Song, Yifan, Xiong, Weimin, Zhao, Xiutian, Zhu, Dawei, Wu, Wenhao, Wang, Ke, Li, Cheng, Peng, Wei, Li, Sujian
Fine-tuning on agent-environment interaction trajectory data holds significant promise for surfacing generalized agent capabilities in open-source large language models (LLMs). In this work, we introduce AgentBank, by far the largest trajectory tunin
Externí odkaz:
http://arxiv.org/abs/2410.07706
Autor:
Changjun Zhao, Zhen Li, Bangsen Tian, Ping Zhang, WU Wenhao, Shuo Gao, Yuechi Yu, Yunyun Dong
Publikováno v:
International Journal of Applied Earth Observations and Geoinformation, Vol 110, Iss , Pp 102792- (2022)
Multitemporal interferometric synthetic aperture radar (InSAR) technology is extensively applied in earth observations. As a critical processing step, the estimation of covariance matrix directly affects the accuracy of its final result. Adaptive mul
Externí odkaz:
https://doaj.org/article/bad5c4acb9a143469d5d48616c6c8124
Publikováno v:
Nanophotonics, Vol 8, Iss 3, Pp 467-474 (2019)
Polarization measurement has been widely used in material characterization, medical diagnosis and remote sensing. However, existing commercial polarization analyzers are either bulky schemes or operate in non-real time. Recently, various polarization
Externí odkaz:
https://doaj.org/article/0e8c6d9b430f43dfb11a31647e80d043
Autor:
Xiong, Weimin, Song, Yifan, Zhao, Xiutian, Wu, Wenhao, Wang, Xun, Wang, Ke, Li, Cheng, Peng, Wei, Li, Sujian
Large language model agents have exhibited exceptional performance across a range of complex interactive tasks. Recent approaches have utilized tuning with expert trajectories to enhance agent performance, yet they primarily concentrate on outcome re
Externí odkaz:
http://arxiv.org/abs/2406.11176
Autor:
Yao, Huanjin, Wu, Wenhao, Yang, Taojiannan, Song, YuXin, Zhang, Mengxi, Feng, Haocheng, Sun, Yifan, Li, Zhiheng, Ouyang, Wanli, Wang, Jingdong
Publikováno v:
NeurIPS 2024
Do we fully leverage the potential of visual encoder in Multimodal Large Language Models (MLLMs)? The recent outstanding performance of MLLMs in multimodal understanding has garnered broad attention from both academia and industry. In the current MLL
Externí odkaz:
http://arxiv.org/abs/2405.13800
Autor:
Zhang, Mengxi, Wu, Wenhao, Lu, Yu, Song, Yuxin, Rong, Kang, Yao, Huanjin, Zhao, Jianbo, Liu, Fanglong, Sun, Yifan, Feng, Haocheng, Wang, Jingdong
Current multimodal Large Language Models (MLLMs) suffer from ``hallucination'', occasionally generating responses that are not grounded in the input images. To tackle this challenge, one promising path is to utilize reinforcement learning from human
Externí odkaz:
http://arxiv.org/abs/2405.11165
Autor:
Wu, Wenhao
This paper undertakes an empirical study to revisit the latest advancements in Multimodal Large Language Models (MLLMs): Video Assistant. This study, namely FreeVA, aims to extend existing image-based MLLM to the video domain in a training-free manne
Externí odkaz:
http://arxiv.org/abs/2405.07798