Výsledky vyhledávání

Report

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Autor: Wang, Xingrui, Li, Xin, Hu, Yaosi, Zhu, Hanxin, Hou, Chen, Lan, Cuiling, Chen, Zhibo

Text-driven Image to Video Generation (TI2V) aims to generate controllable video given the first frame and corresponding textual description. The primary challenges of this task lie in two parts: (i) how to identify the target objects and ensure the

Externí odkaz: http://arxiv.org/abs/2412.10275

Zobrazit plný text záznamu

Report

Staking out the Proton Drip-Line of Thulium at the N=82 Shell Closure

Direct observation of proton emission with very small emission energy is often unfeasible due to the long partial half-lives associated with tunneling through the Coulomb barrier. Therefore proton emitters with very small Q-values may require masses

Externí odkaz: http://arxiv.org/abs/2412.10259

Zobrazit plný text záznamu

Report

Integrated trucks assignment and scheduling problem with mixed service mode docks: A Q-learning based adaptive large neighborhood search algorithm

Autor: Li, Yueyi, Mohammadi, Mehrdad, Zhang, Xiaodong, Lan, Yunxing, van Jaarsveld, Willem

Mixed service mode docks enhance efficiency by flexibly handling both loading and unloading trucks in warehouses. However, existing research often predetermines the number and location of these docks prior to planning truck assignment and sequencing.

Externí odkaz: http://arxiv.org/abs/2412.09090

Zobrazit plný text záznamu

Report

MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced Interpretation

Autor: Zhang, Zheyuan, Wang, Zehong, Ma, Tianyi, Taneja, Varun Sameer, Nelson, Sofia, Le, Nhi Ha Lan, Murugesan, Keerthiram, Ju, Mingxuan, Chawla, Nitesh V, Zhang, Chuxu, Ye, Yanfang

The prevalence of unhealthy eating habits has become an increasingly concerning issue in the United States. However, major food recommendation platforms (e.g., Yelp) continue to prioritize users' dietary preferences over the healthiness of their choi

Externí odkaz: http://arxiv.org/abs/2412.08847

Zobrazit plný text záznamu

Report

StreamChat: Chatting with Streaming Video

Autor: Liu, Jihao, Yu, Zhiding, Lan, Shiyi, Wang, Shihao, Fang, Rongyao, Kautz, Jan, Li, Hongsheng, Alvare, Jose M.

This paper presents StreamChat, a novel approach that enhances the interaction capabilities of Large Multimodal Models (LMMs) with streaming video content. In streaming interaction scenarios, existing methods rely solely on visual information availab

Externí odkaz: http://arxiv.org/abs/2412.08646

Zobrazit plný text záznamu

Report

TECO: Improving Multimodal Intent Recognition with Text Enhancement through Commonsense Knowledge Extraction

Autor: Nguyen, Quynh-Mai Thi, Nguyen, Lan-Nhi Thi, Nguyen, Cam-Van Thi

The objective of multimodal intent recognition (MIR) is to leverage various modalities-such as text, video, and audio-to detect user intentions, which is crucial for understanding human language and context in dialogue systems. Despite advances in th

Externí odkaz: http://arxiv.org/abs/2412.08529

Zobrazit plný text záznamu

Report

ObjCtrl-2.5D: Training-free Object Control with Camera Poses

Autor: Wang, Zhouxia, Lan, Yushi, Zhou, Shangchen, Loy, Chen Change

This study aims to achieve more precise and versatile object control in image-to-video (I2V) generation. Current methods typically represent the spatial movement of target objects with 2D trajectories, which often fail to capture user intention and f

Externí odkaz: http://arxiv.org/abs/2412.07721

Zobrazit plný text záznamu

Report

CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings

Autor: Mu, Jiazuo, Yang, Fuyi, Zhang, Yanshun, Zhang, Junxiong, Luo, Yongjian, Xu, Lan, Shi, Yujiao, Yu, Jingyi, Zhang, Yingliang

We introduce CADSpotting, an efficient method for panoptic symbol spotting in large-scale architectural CAD drawings. Existing approaches struggle with the diversity of symbols, scale variations, and overlapping elements in CAD designs. CADSpotting o

Externí odkaz: http://arxiv.org/abs/2412.07377

Zobrazit plný text záznamu

Report

Tube Category, Tensor Renormalization and Topological Holography

Autor: Lan, Tian

Ocneanu's tube algebra provides a finite algorithm to compute the Drinfeld center of a fusion category. In this work we reveal the universal property underlying the tube algebra. Take a base category $\mathcal V$ which is concrete, bicomplete, and sy

Externí odkaz: http://arxiv.org/abs/2412.07198

Zobrazit plný text záznamu

Report

AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark

Autor: Li, Lan, Fang, Liri, Torvik, Vetle I.

We investigate the reasoning capabilities of large language models (LLMs) for automatically generating data-cleaning workflows. To evaluate LLMs' ability to complete data-cleaning tasks, we implemented a pipeline for LLM-based Auto Data Cleaning Work

Externí odkaz: http://arxiv.org/abs/2412.06724

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání