Výsledky vyhledávání - "Pourreza, Reza"

Report

Live Fitness Coaching as a Testbed for Situated Interaction

Autor: Panchal, Sunny, Bhattacharyya, Apratim, Berger, Guillaume, Mercier, Antoine, Bohm, Cornelius, Dietrichkeit, Florian, Pourreza, Reza, Li, Xuanlin, Madan, Pulkit, Lee, Mingu, Todorovich, Mark, Bax, Ingo, Memisevic, Roland

Tasks at the intersection of vision and language have had a profound impact in advancing the capabilities of vision-language models such as dialog-based assistants. However, models trained on existing tasks are largely limited to turn-based interacti

Externí odkaz: http://arxiv.org/abs/2407.08101

Zobrazit plný text záznamu

Report

Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving

Autor: Ling, Zhan, Fang, Yunhao, Li, Xuanlin, Mu, Tongzhou, Lee, Mingu, Pourreza, Reza, Memisevic, Roland, Su, Hao

Large Language Models (LLMs) have achieved tremendous progress, yet they still often struggle with challenging reasoning problems. Current approaches address this challenge by sampling or searching detailed and low-level reasoning chains. However, th

Externí odkaz: http://arxiv.org/abs/2311.00694

Zobrazit plný text záznamu

Report

Painter: Teaching Auto-regressive Language Models to Draw Sketches

Autor: Pourreza, Reza, Bhattacharyya, Apratim, Panchal, Sunny, Lee, Mingu, Madan, Pulkit, Memisevic, Roland

Large language models (LLMs) have made tremendous progress in natural language understanding and they have also been successfully adopted in other domains such as computer vision, robotics, reinforcement learning, etc. In this work, we apply LLMs to

Externí odkaz: http://arxiv.org/abs/2308.08520

Zobrazit plný text záznamu

Report

Look, Remember and Reason: Grounded reasoning in videos with language models

Autor: Bhattacharyya, Apratim, Panchal, Sunny, Lee, Mingu, Pourreza, Reza, Madan, Pulkit, Memisevic, Roland

Multi-modal language models (LM) have recently shown promising performance in high-level reasoning tasks on videos. However, existing methods still fall short in tasks like causal or compositional spatiotemporal reasoning over actions, in which model

Externí odkaz: http://arxiv.org/abs/2306.17778

Zobrazit plný text záznamu

Report

Differentiable bit-rate estimation for neural-based video codec enhancement

Autor: Said, Amir, Singh, Manish Kumar, Pourreza, Reza

Publikováno v: Picture Coding Symposium (PCS), San Jose, CA, USA, 2022, pp. 379-383

Neural networks (NN) can improve standard video compression by pre- and post-processing the encoded video. For optimal NN training, the standard codec needs to be replaced with a codec proxy that can provide derivatives of estimated bit-rate and dist

Externí odkaz: http://arxiv.org/abs/2301.09776

Zobrazit plný text záznamu

Report

Optimized learned entropy coding parameters for practical neural-based image and video compression

Autor: Said, Amir, Pourreza, Reza, Le, Hoang

Publikováno v: IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 2022, pp. 661-665

Neural-based image and video codecs are significantly more power-efficient when weights and activations are quantized to low-precision integers. While there are general-purpose techniques for reducing quantization effects, large losses can occur when

Externí odkaz: http://arxiv.org/abs/2301.08752

Zobrazit plný text záznamu

Report

Boosting neural video codecs by exploiting hierarchical redundancy

Autor: Pourreza, Reza, Le, Hoang, Said, Amir, Sautiere, Guillaume, Wiggers, Auke

In video compression, coding efficiency is improved by reusing pixels from previously decoded frames via motion and residual compensation. We define two levels of hierarchical redundancy in video frames: 1) first-order: redundancy in pixel space, i.e

Externí odkaz: http://arxiv.org/abs/2208.04303

Zobrazit plný text záznamu

Report

MobileCodec: Neural Inter-frame Video Compression on Mobile Devices

Autor: Le, Hoang, Zhang, Liang, Said, Amir, Sautiere, Guillaume, Yang, Yang, Shrestha, Pranav, Yin, Fei, Pourreza, Reza, Wiggers, Auke

Realizing the potential of neural video codecs on mobile devices is a big technological challenge due to the computational complexity of deep networks and the power-constrained mobile hardware. We demonstrate practical feasibility by leveraging Qualc

Externí odkaz: http://arxiv.org/abs/2207.08338

Zobrazit plný text záznamu

Report

Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set

Autor: van Rozendaal, Ties, Brehmer, Johann, Zhang, Yunfan, Pourreza, Reza, Wiggers, Auke, Cohen, Taco S.

We introduce a video compression algorithm based on instance-adaptive learning. On each video sequence to be transmitted, we finetune a pretrained compression model. The optimal parameters are transmitted to the receiver along with the latent code. B

Externí odkaz: http://arxiv.org/abs/2111.10302

Zobrazit plný text záznamu

Report

A Combined Deep Learning based End-to-End Video Coding Architecture for YUV Color Space

Autor: Singh, Ankitesh K., Egilmez, Hilmi E., Pourreza, Reza, Coban, Muhammed, Karczewicz, Marta, Cohen, Taco S.

Most of the existing deep learning based end-to-end video coding (DLEC) architectures are designed specifically for RGB color format, yet the video coding standards, including H.264/AVC, H.265/HEVC and H.266/VVC developed over past few decades, have

Externí odkaz: http://arxiv.org/abs/2104.00807

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání