Zobrazeno 1 - 10
of 5 615
pro vyhledávání: '"Tan, A T"'
Detecting Human-Object Interactions (HOI) in zero-shot settings, where models must handle unseen classes, poses significant challenges. Existing methods that rely on aligning visual encoders with large Vision-Language Models (VLMs) to tap into the ex
Externí odkaz:
http://arxiv.org/abs/2410.23904
Building on the success of large language models (LLMs), recent advancements such as GPT-4o have enabled real-time speech interactions through LLM-based voice assistants, offering a significantly improved user experience compared to traditional text-
Externí odkaz:
http://arxiv.org/abs/2410.17196
Autor:
Dwivedi, Vijay Prakash, Schlegel, Viktor, Liu, Andy T., Nguyen, Thanh-Tung, Kashyap, Abhinav Ramesh, Wei, Jeng, Yin, Wei-Hsian, Winkler, Stefan, Tan, Robby T.
Large Language Models (LLMs) have demonstrated remarkable performance across various domains, including healthcare. However, their ability to effectively represent structured non-textual data, such as the alphanumeric medical codes used in records li
Externí odkaz:
http://arxiv.org/abs/2410.13351
Autor:
Chia, Xavier X., Ong, Kenny Y. K., Aadhi, A., Chen, George F. R., Choi, Ju Won, Sohn, Byoung-Uk, Chowdury, Amdad, Tan, Dawn T. H.
The perennial demand for highly efficient short-haul communications is evidenced by a sustained explosion of growth in data center infrastructure that is predicted to continue for the foreseeable future. In these relatively compact networks, cost-sen
Externí odkaz:
http://arxiv.org/abs/2410.08638
Autor:
Ji, Y., Hao, G. -P., Tan, Y. -T., Xiong, W. Q., Liu, Y., Zhou, W. Z., Tang, D. -M., Ma, R. Z., Yuan, S. J., Sasaki, T., Lozada-Hidalgo, M., Geim, A. K., Sun, Pengzhan
Publikováno v:
Nature Communications 15, 10546 (2024)
Two dimensional (2D) crystals have attracted strong interest as a new class of proton conducting materials that can block atoms, molecules and ions while allowing proton transport through the atomically thin basal planes. Although 2D materials exhibi
Externí odkaz:
http://arxiv.org/abs/2410.06489
Autor:
Chen, Yiming, Yue, Xianghu, Gao, Xiaoxue, Zhang, Chen, D'Haro, Luis Fernando, Tan, Robby T., Li, Haizhou
Various audio-LLMs (ALLMs) have been explored recently for tackling different audio tasks simultaneously using a single, unified model. While existing evaluations of ALLMs primarily focus on single-audio tasks, real-world applications often involve p
Externí odkaz:
http://arxiv.org/abs/2409.18680
Autor:
Du, Jinyi, Zhang, Xingjian, Chen, George F. R., Gao, Hongwei, Tan, Dawn T. H., Ling, Alexander
Transmitting an entangled state over an extended distance is crucial for the development of quantum networks. Previous demonstrations of transmitting entangled photons over long distance using satellites or fibers have use entangled photon pairs gene
Externí odkaz:
http://arxiv.org/abs/2409.17558
Autor:
Nagar, Aishik, Liu, Yutong, Liu, Andy T., Schlegel, Viktor, Dwivedi, Vijay Prakash, Kaliya-Perumal, Arun-Kumar, Kalanchiam, Guna Pratheep, Tang, Yili, Tan, Robby T.
Medical abstractive summarization faces the challenge of balancing faithfulness and informativeness. Current methods often sacrifice key information for faithfulness or introduce confabulations when prioritizing informativeness. While recent advancem
Externí odkaz:
http://arxiv.org/abs/2408.12095
Sparse view NeRF is challenging because limited input images lead to an under constrained optimization problem for volume rendering. Existing methods address this issue by relying on supplementary information, such as depth maps. However, generating
Externí odkaz:
http://arxiv.org/abs/2408.09144
Existing 2D human pose estimation research predominantly concentrates on well-lit scenarios, with limited exploration of poor lighting conditions, which are a prevalent aspect of daily life. Recent studies on low-light pose estimation require the use
Externí odkaz:
http://arxiv.org/abs/2407.15451