Výsledky vyhledávání

Report

EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection

Autor: Lei, Qinqian, Wang, Bo, Tan, Robby T.

Detecting Human-Object Interactions (HOI) in zero-shot settings, where models must handle unseen classes, poses significant challenges. Existing methods that rely on aligning visual encoders with large Vision-Language Models (VLMs) to tap into the ex

Externí odkaz: http://arxiv.org/abs/2410.23904

Zobrazit plný text záznamu

Report

VoiceBench: Benchmarking LLM-Based Voice Assistants

Autor: Chen, Yiming, Yue, Xianghu, Zhang, Chen, Gao, Xiaoxue, Tan, Robby T., Li, Haizhou

Building on the success of large language models (LLMs), recent advancements such as GPT-4o have enabled real-time speech interactions through LLM-based voice assistants, offering a significantly improved user experience compared to traditional text-

Externí odkaz: http://arxiv.org/abs/2410.17196

Zobrazit plný text záznamu

Report

Representation Learning of Structured Data for Medical Foundation Models

Autor: Dwivedi, Vijay Prakash, Schlegel, Viktor, Liu, Andy T., Nguyen, Thanh-Tung, Kashyap, Abhinav Ramesh, Wei, Jeng, Yin, Wei-Hsian, Winkler, Stefan, Tan, Robby T.

Large Language Models (LLMs) have demonstrated remarkable performance across various domains, including healthcare. However, their ability to effectively represent structured non-textual data, such as the alphanumeric medical codes used in records li

Externí odkaz: http://arxiv.org/abs/2410.13351

Zobrazit plný text záznamu

Report

Leveraging reconfigurable micro-resonator soliton crystals for Intensity-Modulated Direct Detection Data Transmission

Autor: Chia, Xavier X., Ong, Kenny Y. K., Aadhi, A., Chen, George F. R., Choi, Ju Won, Sohn, Byoung-Uk, Chowdury, Amdad, Tan, Dawn T. H.

The perennial demand for highly efficient short-haul communications is evidenced by a sustained explosion of growth in data center infrastructure that is predicted to continue for the foreseeable future. In these relatively compact networks, cost-sen

Externí odkaz: http://arxiv.org/abs/2410.08638

Zobrazit plný text záznamu

Report

High proton conductivity through angstrom-porous titania

Autor: Ji, Y., Hao, G. -P., Tan, Y. -T., Xiong, W. Q., Liu, Y., Zhou, W. Z., Tang, D. -M., Ma, R. Z., Yuan, S. J., Sasaki, T., Lozada-Hidalgo, M., Geim, A. K., Sun, Pengzhan

Publikováno v: Nature Communications 15, 10546 (2024)

Two dimensional (2D) crystals have attracted strong interest as a new class of proton conducting materials that can block atoms, molecules and ions while allowing proton transport through the atomically thin basal planes. Although 2D materials exhibi

Externí odkaz: http://arxiv.org/abs/2410.06489

Zobrazit plný text záznamu

Report

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models

Autor: Chen, Yiming, Yue, Xianghu, Gao, Xiaoxue, Zhang, Chen, D'Haro, Luis Fernando, Tan, Robby T., Li, Haizhou

Various audio-LLMs (ALLMs) have been explored recently for tackling different audio tasks simultaneously using a single, unified model. While existing evaluations of ALLMs primarily focus on single-audio tasks, real-world applications often involve p

Externí odkaz: http://arxiv.org/abs/2409.18680

Zobrazit plný text záznamu

Report

Demonstration of entanglement distribution over 155 km metropolitan fiber using a silicon nanophotonic chip

Autor: Du, Jinyi, Zhang, Xingjian, Chen, George F. R., Gao, Hongwei, Tan, Dawn T. H., Ling, Alexander

Transmitting an entangled state over an extended distance is crucial for the development of quantum networks. Previous demonstrations of transmitting entangled photons over long distance using satellites or fibers have use entangled photon pairs gene

Externí odkaz: http://arxiv.org/abs/2409.17558

Zobrazit plný text záznamu

Report

uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

Autor: Nagar, Aishik, Liu, Yutong, Liu, Andy T., Schlegel, Viktor, Dwivedi, Vijay Prakash, Kaliya-Perumal, Arun-Kumar, Kalanchiam, Guna Pratheep, Tang, Yili, Tan, Robby T.

Medical abstractive summarization faces the challenge of balancing faithfulness and informativeness. Current methods often sacrifice key information for faithfulness or introduce confabulations when prioritizing informativeness. While recent advancem

Externí odkaz: http://arxiv.org/abs/2408.12095

Zobrazit plný text záznamu

Report

SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation

Autor: Cao, Xiao, Lin, Beibei, Wang, Bo, Huang, Zhiyong, Tan, Robby T.

Sparse view NeRF is challenging because limited input images lead to an under constrained optimization problem for volume rendering. Existing methods address this issue by relying on supplementary information, such as depth maps. However, generating

Externí odkaz: http://arxiv.org/abs/2408.09144

Zobrazit plný text záznamu

Report

Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions

Autor: Ai, Yihao, Qi, Yifei, Wang, Bo, Cheng, Yu, Wang, Xinchao, Tan, Robby T.

Existing 2D human pose estimation research predominantly concentrates on well-lit scenarios, with limited exploration of poor lighting conditions, which are a prevalent aspect of daily life. Recent studies on low-light pose estimation require the use

Externí odkaz: http://arxiv.org/abs/2407.15451

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání