Výsledky vyhledávání

Report

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Autor: Wang, Lening, Zheng, Wenzhao, Du, Dalong, Zhang, Yunpeng, Ren, Yilong, Jiang, Han, Cui, Zhiyong, Yu, Haiyang, Zhou, Jie, Lu, Jiwen, Zhang, Shanghang

4D driving simulation is essential for developing realistic autonomous driving simulators. Despite advancements in existing methods for generating driving scenes, significant challenges remain in view transformation and spatial-temporal dynamic model

Externí odkaz: http://arxiv.org/abs/2412.05280

Zobrazit plný text záznamu

Report

Budgeted Spatial Data Acquisition: When Coverage and Connectivity Matter

Autor: Yang, Wenzhe, Huang, Shixun, Wang, Sheng, Peng, Zhiyong

Data is undoubtedly becoming a commodity like oil, land, and labor in the 21st century. Although there have been many successful marketplaces for data trading, the existing data marketplaces lack consideration of the case where buyers want to acquire

Externí odkaz: http://arxiv.org/abs/2412.04853

Zobrazit plný text záznamu

Report

A Unified Approach for Multi-granularity Search over Spatial Datasets

Autor: Yang, Wenzhe, Wang, Sheng, Huang, Shixun, Liao, Yuyang, Sun, Yuan, Freire, Juliana, Peng, Zhiyong

There has been increased interest in data search as a means to find relevant datasets or data points in data lakes and repositories. Although approaches have been proposed to support spatial dataset search and data point search, they consider the two

Externí odkaz: http://arxiv.org/abs/2412.04805

Zobrazit plný text záznamu

Report

Approximate Vector Set Search: A Bio-Inspired Approach for High-Dimensional Spaces

Autor: Li, Yiqi, Wang, Sheng, Chen, Zhiyu, Chen, Shangfeng, Peng, Zhiyong

Vector set search, an underexplored similarity search paradigm, aims to find vector sets similar to a query set. This search paradigm leverages the inherent structural alignment between sets and real-world entities to model more fine-grained and cons

Externí odkaz: http://arxiv.org/abs/2412.03301

Zobrazit plný text záznamu

Report

HunyuanVideo: A Systematic Framework For Large Video Generative Models

Recent advancements in video generation have significantly impacted daily life for both individuals and industries. However, the leading video generation models remain closed-source, resulting in a notable performance gap between industry capabilitie

Externí odkaz: http://arxiv.org/abs/2412.03603

Zobrazit plný text záznamu

Report

On Simplifying Large-Scale Spatial Vectors: Fast, Memory-Efficient, and Cost-Predictable k-means

Autor: Ji, Yushuai, Liu, Zepeng, Wang, Sheng, Sun, Yuan, Peng, Zhiyong

The k-means algorithm can simplify large-scale spatial vectors, such as 2D geo-locations and 3D point clouds, to support fast analytics and learning. However, when processing large-scale datasets, existing k-means algorithms have been developed to ac

Externí odkaz: http://arxiv.org/abs/2412.02244

Zobrazit plný text záznamu

Report

[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster

Autor: Zhang, Qizhe, Cheng, Aosong, Lu, Ming, Zhuo, Zhiyong, Wang, Minqi, Cao, Jiajun, Guo, Shaobo, She, Qi, Zhang, Shanghang

Large vision-language models (VLMs) often rely on a substantial number of visual tokens when interacting with large language models (LLMs), which has proven to be inefficient. Recent efforts have aimed to accelerate VLM inference by pruning visual to

Externí odkaz: http://arxiv.org/abs/2412.01818

Zobrazit plný text záznamu

Report

Linearly Homomorphic Signature with Tight Security on Lattice

Autor: Guo, Heng, Tian, Kun, Liu, Fengxia, Zheng, Zhiyong

At present, in lattice-based linearly homomorphic signature schemes, especially under the standard model, there are very few schemes with tight security. This paper constructs the first lattice-based linearly homomorphic signature scheme that achieve

Externí odkaz: http://arxiv.org/abs/2412.01641

Zobrazit plný text záznamu

Report

The Codec Language Model-based Zero-Shot Spontaneous Style TTS System for CoVoC Challenge 2024

Autor: Zhou, Shuoyi, Zhou, Yixuan, Li, Weiqing, Chen, Jun, Ye, Runchuan, Wu, Weihao, Lin, Zijian, Lei, Shun, Wu, Zhiyong

This paper describes the zero-shot spontaneous style TTS system for the ISCSLP 2024 Conversational Voice Clone Challenge (CoVoC). We propose a LLaMA-based codec language model with a delay pattern to achieve spontaneous style voice cloning. To improv

Externí odkaz: http://arxiv.org/abs/2412.01100

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání