Výsledky vyhledávání

Report

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Autor: Zhou, Pengfei, Peng, Xiaopeng, Song, Jiajun, Li, Chuanhao, Xu, Zhaopan, Yang, Yue, Guo, Ziyao, Zhang, Hao, Lin, Yuqi, He, Yefei, Zhao, Lirui, Liu, Shuo, Li, Tianhua, Xie, Yuxuan, Chang, Xiaojun, Qiao, Yu, Shao, Wenqi, Zhang, Kaipeng

Multimodal Large Language Models (MLLMs) have made significant strides in visual understanding and generation tasks. However, generating interleaved image-text content remains a challenge, which requires integrated multimodal understanding and genera

Externí odkaz: http://arxiv.org/abs/2411.18499

Zobrazit plný text záznamu

Report

EEG Signal Denoising Using pix2pix GAN: Enhancing Neurological Data Analysis

Autor: Wang, Haoyi, Chen, Xufang, Yang, Yue, Zhou, Kewei, Lv, Meining, Wang, Dongrui, Zhang, Wenjie

Electroencephalography (EEG) is essential in neuroscience and clinical practice, yet it suffers from physiological artifacts, particularly electromyography (EMG), which distort signals. We propose a deep learning model using pix2pixGAN to remove such

Externí odkaz: http://arxiv.org/abs/2411.13288

Zobrazit plný text záznamu

Report

ARCADE: Scalable Demonstration Collection and Generation via Augmented Reality for Imitation Learning

Autor: Yang, Yue, Ikeda, Bryce, Bertasius, Gedas, Szafir, Daniel

Robot Imitation Learning (IL) is a crucial technique in robot learning, where agents learn by mimicking human demonstrations. However, IL encounters scalability challenges stemming from both non-user-friendly demonstration collection methods and the

Externí odkaz: http://arxiv.org/abs/2410.15994

Zobrazit plný text záznamu

Report

Synergy of turbulence and thermo-diffusive effects on the intermittent boundary-layer flashback of swirling flames

Autor: Zhang, Shiming, Lu, Zhen, Yang, Yue

We simulated the intermittent boundary-layer flashback (BLF) of hydrogen-enriched swirling flames using large-eddy simulation (LES) with the flame-surface-density (FSD) method. Three cases of intermittent BLF, characterized by periodic flame entry an

Externí odkaz: http://arxiv.org/abs/2410.15988

Zobrazit plný text záznamu

Report

MiRAGeNews: Multimodal Realistic AI-Generated News Detection

Autor: Huang, Runsheng, Dugan, Liam, Yang, Yue, Callison-Burch, Chris

The proliferation of inflammatory or misleading "fake" news content has become increasingly common in recent years. Simultaneously, it has become easier than ever to use AI tools to generate photorealistic images depicting any scene imaginable. Combi

Externí odkaz: http://arxiv.org/abs/2410.09045

Zobrazit plný text záznamu

Report

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Autor: Yang, Yue, Zhang, Shuibai, Shao, Wenqi, Zhang, Kaipeng, Bin, Yi, Wang, Yu, Luo, Ping

Large Vision-Language Models (LVLMs) have demonstrated remarkable capabilities across multimodal tasks such as visual perception and reasoning, leading to good performance on various multimodal evaluation benchmarks. However, these benchmarks keep a

Externí odkaz: http://arxiv.org/abs/2410.08695

Zobrazit plný text záznamu

Report

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model

Autor: Le, Long, Xie, Jason, Liang, William, Wang, Hung-Ju, Yang, Yue, Ma, Yecheng Jason, Vedder, Kyle, Krishna, Arjun, Jayaraman, Dinesh, Eaton, Eric

Interactive 3D simulated objects are crucial in AR/VR, animations, and robotics, driving immersive experiences and advanced automation. However, creating these articulated objects requires extensive human effort and expertise, limiting their broader

Externí odkaz: http://arxiv.org/abs/2410.13882

Zobrazit plný text záznamu

Report

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Autor: Wang, Zhaowei, Zhang, Hongming, Fang, Tianqing, Tian, Ye, Yang, Yue, Ma, Kaixin, Pan, Xiaoman, Song, Yangqiu, Yu, Dong

Object navigation in unknown environments is crucial for deploying embodied agents in real-world applications. While we have witnessed huge progress due to large-scale scene datasets, faster simulators, and stronger models, previous studies mainly fo

Externí odkaz: http://arxiv.org/abs/2410.02730

Zobrazit plný text záznamu

Report

StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion

Autor: Zhang, Han, Killeen, Benjamin D., Ku, Yu-Chun, Seenivasan, Lalithkumar, Zhao, Yuxuan, Liu, Mingxu, Yang, Yue, Gu, Suxi, Martin-Gomez, Alejandro, Taylor, Russell H., Osgood, Greg, Unberath, Mathias

In percutaneous pelvic trauma surgery, accurate placement of Kirschner wires (K-wires) is crucial to ensure effective fracture fixation and avoid complications due to breaching the cortical bone along an unsuitable trajectory. Surgical navigation via

Externí odkaz: http://arxiv.org/abs/2410.01143

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání