Výsledky vyhledávání

Report

Exemplar Masking for Multimodal Incremental Learning

Autor: Lee, Yi-Lun, Lee, Chen-Yu, Chiu, Wei-Chen, Tsai, Yi-Hsuan

Multimodal incremental learning needs to digest the information from multiple modalities while concurrently learning new knowledge without forgetting the previously learned information. There are numerous challenges for this task, mainly including th

Externí odkaz: http://arxiv.org/abs/2412.09549

Zobrazit plný text záznamu

Report

From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos

Autor: Wallingford, Matthew, Bhattad, Anand, Kusupati, Aditya, Ramanujan, Vivek, Deitke, Matt, Kakade, Sham, Kembhavi, Aniruddha, Mottaghi, Roozbeh, Ma, Wei-Chiu, Farhadi, Ali

Three-dimensional (3D) understanding of objects and scenes play a key role in humans' ability to interact with the world and has been an active area of research in computer vision, graphics, and robotics. Large scale synthetic and object-centric 3D d

Externí odkaz: http://arxiv.org/abs/2412.07770

Zobrazit plný text záznamu

Report

Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models

Autor: Lee, Yi-Lun, Tsai, Yi-Hsuan, Chiu, Wei-Chen

While large vision-language models (LVLMs) have shown impressive capabilities in generating plausible responses correlated with input visual contents, they still suffer from hallucinations, where the generated text inaccurately reflects visual conten

Externí odkaz: http://arxiv.org/abs/2412.06775

Zobrazit plný text záznamu

Report

Moving Protocol of Majorana Corner Modes in a Superconducting 2D Weyl Semimetal Heterostructure

Autor: Chiu, Ching-Kai, Yao, Yueh-Ting, Chang, Tay-Rong, Bian, Guang

Second-order topological superconductors host Majorana corner modes (MCMs), which are confined to specific corners of the system. This spatial restriction presents challenges for manipulating and relocating MCMs. We propose a novel protocol for dynam

Externí odkaz: http://arxiv.org/abs/2412.06150

Zobrazit plný text záznamu

Report

Slope-determinant method, complex cellular structures and hypersurface coverings of regular rational points

Autor: Chiu, Kenneth Chung Tak

We use the determinant method of Bombieri-Pila and Heath-Brown and its Arakelov reformulation by Chen utilizing Bost's slope method to estimate the number of hypersurfaces required to cover the regular rational points with bounded Arakelov height on

Externí odkaz: http://arxiv.org/abs/2412.05205

Zobrazit plný text záznamu

Report

MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution

Autor: Lin, Jie, Chiu, I, Wang, Kuan-Chen, Liu, Kai-Chun, Wang, Hsin-Min, Yeh, Ping-Cheng, Tsao, Yu

Electrocardiogram (ECG) signals play a crucial role in diagnosing cardiovascular diseases. To reduce power consumption in wearable or portable devices used for long-term ECG monitoring, super-resolution (SR) techniques have been developed, enabling t

Externí odkaz: http://arxiv.org/abs/2412.04861

Zobrazit plný text záznamu

Report

Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering

Autor: Sun, Cheng, Choe, Jaesung, Loop, Charles, Ma, Wei-Chiu, Wang, Yu-Chiang Frank

We propose an efficient radiance field rendering algorithm that incorporates a rasterization process on sparse voxels without neural networks or 3D Gaussians. There are two key contributions coupled with the proposed system. The first is to render sp

Externí odkaz: http://arxiv.org/abs/2412.04459

Zobrazit plný text záznamu

Report

Learning Speed-Adaptive Walking Agent Using Imitation Learning with Physics-Informed Simulation

Autor: Chiu, Yi-Hung, Lee, Ung Hee, Song, Changseob, Hu, Manaen, Kang, Inseung

Virtual models of human gait, or digital twins, offer a promising solution for studying mobility without the need for labor-intensive data collection. However, challenges such as the sim-to-real gap and limited adaptability to diverse walking conditi

Externí odkaz: http://arxiv.org/abs/2412.03949

Zobrazit plný text záznamu

Report

MegaCOIN: Enhancing Medium-Grained Color Perception for Vision-Language Models

Autor: Chiu, Ming-Chang, Wen, Shicheng, Chen, Pin-Yu, Ma, Xuezhe

In vision-language models (VLMs), the ability to perceive and interpret color and physical environment is crucial for achieving contextually accurate understanding and interaction. However, despite advances in multimodal modeling, there remains a sig

Externí odkaz: http://arxiv.org/abs/2412.03927

Zobrazit plný text záznamu

Report

FathomGPT: A Natural Language Interface for Interactively Exploring Ocean Science Data

Autor: Khanal, Nabin, Yu, Chun Meng, Chiu, Jui-Cheng, Chaudhary, Anav, Zhang, Ziyue, Katija, Kakani, Forbes, Angus G.

Publikováno v: UIST 2024: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology

We introduce FathomGPT, an open source system for the interactive investigation of ocean science data via a natural language interface. FathomGPT was developed in close collaboration with marine scientists to enable researchers to explore and analyze

Externí odkaz: http://arxiv.org/abs/2412.02784

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání