Výsledky vyhledávání

Report

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Autor: Fan, Lijie, Li, Tianhong, Qin, Siyang, Li, Yuanzhen, Sun, Chen, Rubinstein, Michael, Sun, Deqing, He, Kaiming, Tian, Yonglong

Scaling up autoregressive models in vision has not proven as beneficial as in large language models. In this work, we investigate this scaling problem in the context of text-to-image generation, focusing on two critical factors: whether models use di

Externí odkaz: http://arxiv.org/abs/2410.13863

Zobrazit plný text záznamu

Report

Autoregressive Image Generation without Vector Quantization

Autor: Li, Tianhong, Tian, Yonglong, Li, He, Deng, Mingyang, He, Kaiming

Conventional wisdom holds that autoregressive models for image generation are typically accompanied by vector-quantized tokens. We observe that while a discrete-valued space can facilitate representing a categorical distribution, it is not a necessit

Externí odkaz: http://arxiv.org/abs/2406.11838

Zobrazit plný text záznamu

Report

Return of Unconditional Generation: A Self-supervised Representation Generation Method

Autor: Li, Tianhong, Katabi, Dina, He, Kaiming

Unconditional generation -- the problem of modeling data distribution without relying on human-annotated labels -- is a long-standing and fundamental challenge in generative models, creating a potential of learning from large-scale unlabeled data. In

Externí odkaz: http://arxiv.org/abs/2312.03701

Zobrazit plný text záznamu

Report

Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency

Autor: Li, Tianhong, Bhardwaj, Sangnie, Tian, Yonglong, Zhang, Han, Barber, Jarred, Katabi, Dina, Lajoie, Guillaume, Chang, Huiwen, Krishnan, Dilip

Current vision-language generative models rely on expansive corpora of paired image-text data to attain optimal performance and generalization capabilities. However, automatically collecting such data (e.g. via large-scale web scraping) leads to low

Externí odkaz: http://arxiv.org/abs/2310.03734

Zobrazit plný text záznamu

Report

Reparo: Loss-Resilient Generative Codec for Video Conferencing

Autor: Li, Tianhong, Sivaraman, Vibhaalakshmi, Karimi, Pantea, Fan, Lijie, Alizadeh, Mohammad, Katabi, Dina

Packet loss during video conferencing often results in poor quality and video freezing. Retransmitting lost packets is often impractical due to the need for real-time playback, and using Forward Error Correction (FEC) for packet recovery is challengi

Externí odkaz: http://arxiv.org/abs/2305.14135

Zobrazit plný text záznamu

Report

Global Finite-Energy Solutions of the Compressible Euler-Poisson Equations for General Pressure Laws with Spherical Symmetry

Autor: Chen, Gui-Qiang G., Huang, Feimin, Li, Tianhong, Wang, Weiqiang, Wang, Yong

We are concerned with global finite-energy solutions of the three-dimensional compressible Euler-Poisson equations with gravitational potential and general pressure law, especially including the constitutive equation of white dwarf stars. We construc

Externí odkaz: http://arxiv.org/abs/2305.12615

Zobrazit plný text záznamu

Report

MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis

Autor: Li, Tianhong, Chang, Huiwen, Mishra, Shlok Kumar, Zhang, Han, Katabi, Dina, Krishnan, Dilip

Generative modeling and representation learning are two key tasks in computer vision. However, these models are typically trained independently, which ignores the potential for each task to help the other, and leads to training and model maintenance

Externí odkaz: http://arxiv.org/abs/2211.09117

Zobrazit plný text záznamu

Report

Unsupervised Learning for Human Sensing Using Radio Signals

Autor: Li, Tianhong, Fan, Lijie, Yuan, Yuan, Katabi, Dina

There is a growing literature demonstrating the feasibility of using Radio Frequency (RF) signals to enable key computer vision tasks in the presence of occlusions and poor lighting. It leverages that RF signals traverse walls and occlusions to deliv

Externí odkaz: http://arxiv.org/abs/2207.02370

Zobrazit plný text záznamu

Akademický článek

Unveiling the spatiotemporal dynamics and influencing factors of carbon stocks in the yangtze river basin over the past two decades

Autor: Xi, Haojun, Li, Tianhong

Publikováno v: In Science of the Total Environment 1 December 2024 954

Zobrazit plný text záznamu

Report

Targeted Supervised Contrastive Learning for Long-Tailed Recognition

Autor: Li, Tianhong, Cao, Peng, Yuan, Yuan, Fan, Lijie, Yang, Yuzhe, Feris, Rogerio, Indyk, Piotr, Katabi, Dina

Real-world data often exhibits long tail distributions with heavy class imbalance, where the majority classes can dominate the training process and alter the decision boundaries of the minority classes. Recently, researchers have investigated the pot

Externí odkaz: http://arxiv.org/abs/2111.13998

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání