Zobrazeno 1 - 10
of 290 412
pro vyhledávání: '"Chiu AT"'
Multimodal incremental learning needs to digest the information from multiple modalities while concurrently learning new knowledge without forgetting the previously learned information. There are numerous challenges for this task, mainly including th
Externí odkaz:
http://arxiv.org/abs/2412.09549
Autor:
Wallingford, Matthew, Bhattad, Anand, Kusupati, Aditya, Ramanujan, Vivek, Deitke, Matt, Kakade, Sham, Kembhavi, Aniruddha, Mottaghi, Roozbeh, Ma, Wei-Chiu, Farhadi, Ali
Three-dimensional (3D) understanding of objects and scenes play a key role in humans' ability to interact with the world and has been an active area of research in computer vision, graphics, and robotics. Large scale synthetic and object-centric 3D d
Externí odkaz:
http://arxiv.org/abs/2412.07770
While large vision-language models (LVLMs) have shown impressive capabilities in generating plausible responses correlated with input visual contents, they still suffer from hallucinations, where the generated text inaccurately reflects visual conten
Externí odkaz:
http://arxiv.org/abs/2412.06775
Second-order topological superconductors host Majorana corner modes (MCMs), which are confined to specific corners of the system. This spatial restriction presents challenges for manipulating and relocating MCMs. We propose a novel protocol for dynam
Externí odkaz:
http://arxiv.org/abs/2412.06150
Autor:
Chiu, Kenneth Chung Tak
We use the determinant method of Bombieri-Pila and Heath-Brown and its Arakelov reformulation by Chen utilizing Bost's slope method to estimate the number of hypersurfaces required to cover the regular rational points with bounded Arakelov height on
Externí odkaz:
http://arxiv.org/abs/2412.05205
Electrocardiogram (ECG) signals play a crucial role in diagnosing cardiovascular diseases. To reduce power consumption in wearable or portable devices used for long-term ECG monitoring, super-resolution (SR) techniques have been developed, enabling t
Externí odkaz:
http://arxiv.org/abs/2412.04861
We propose an efficient radiance field rendering algorithm that incorporates a rasterization process on sparse voxels without neural networks or 3D Gaussians. There are two key contributions coupled with the proposed system. The first is to render sp
Externí odkaz:
http://arxiv.org/abs/2412.04459
Virtual models of human gait, or digital twins, offer a promising solution for studying mobility without the need for labor-intensive data collection. However, challenges such as the sim-to-real gap and limited adaptability to diverse walking conditi
Externí odkaz:
http://arxiv.org/abs/2412.03949
In vision-language models (VLMs), the ability to perceive and interpret color and physical environment is crucial for achieving contextually accurate understanding and interaction. However, despite advances in multimodal modeling, there remains a sig
Externí odkaz:
http://arxiv.org/abs/2412.03927
Autor:
Khanal, Nabin, Yu, Chun Meng, Chiu, Jui-Cheng, Chaudhary, Anav, Zhang, Ziyue, Katija, Kakani, Forbes, Angus G.
Publikováno v:
UIST 2024: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology
We introduce FathomGPT, an open source system for the interactive investigation of ocean science data via a natural language interface. FathomGPT was developed in close collaboration with marine scientists to enable researchers to explore and analyze
Externí odkaz:
http://arxiv.org/abs/2412.02784