Výsledky vyhledávání

Report

Learning Optimal Linear Block Transform by Rate Distortion Minimization

Autor: Gnutti, Alessandro, Kao, Chia-Hao, Peng, Wen-Hsiao, Leonardi, Riccardo

Linear block transform coding remains a fundamental component of image and video compression. Although the Discrete Cosine Transform (DCT) is widely employed in all current compression standards, its sub-optimality has sparked ongoing research into d

Externí odkaz: http://arxiv.org/abs/2411.18494

Zobrazit plný text záznamu

Report

Fast-OMRA: Fast Online Motion Resolution Adaptation for Neural B-Frame Coding

Autor: NguyenQuang, Sang, Gao, Zong-Lin, Ho, Kuan-Wei, HoangVan, Xiem, Peng, Wen-Hsiao

Most learned B-frame codecs with hierarchical temporal prediction suffer from the domain shift issue caused by the discrepancy in the Group-of-Pictures (GOP) size used for training and test. As such, the motion estimation network may fail to predict

Externí odkaz: http://arxiv.org/abs/2410.21763

Zobrazit plný text záznamu

Report

Cross-Platform Neural Video Coding: A Case Study

Autor: Conceição, Ruhan, Porto, Marcelo, Peng, Wen-Hsiao, Agostini, Luciano

In this paper, we first show that current learning-based video codecs, specifically the SSF codec, are not suitable for real-world applications due to the mismatch between the encoder and decoder caused by floating-point round-off errors. To address

Externí odkaz: http://arxiv.org/abs/2410.20145

Zobrazit plný text záznamu

Report

On the Rate-Distortion-Complexity Trade-offs of Neural Video Coding

Autor: Chen, Yi-Hsin, Ho, Kuan-Wei, Benjak, Martin, Ostermann, Jörn, Peng, Wen-Hsiao

This paper aims to delve into the rate-distortion-complexity trade-offs of modern neural video coding. Recent years have witnessed much research effort being focused on exploring the full potential of neural video coding. Conditional autoencoders hav

Externí odkaz: http://arxiv.org/abs/2410.03898

Zobrazit plný text záznamu

Report

ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck

Autor: Kao, Chia-Hao, Chien, Cheng, Tseng, Yu-Jen, Chen, Yi-Hsin, Gnutti, Alessandro, Lo, Shao-Yuan, Peng, Wen-Hsiao, Leonardi, Riccardo

This paper presents the first-ever study of adapting compressed image latents to suit the needs of downstream vision tasks that adopt Multimodal Large Language Models (MLLMs). MLLMs have extended the success of large language models to modalities (e.

Externí odkaz: http://arxiv.org/abs/2407.19651

Zobrazit plný text záznamu

Report

Transformer-based Learned Image Compression for Joint Decoding and Denoising

Autor: Chen, Yi-Hsin, Ho, Kuan-Wei, Tsai, Shiau-Rung, Lin, Guan-Hsun, Gnutti, Alessandro, Peng, Wen-Hsiao, Leonardi, Riccardo

This work introduces a Transformer-based image compression system. It has the flexibility to switch between the standard image reconstruction and the denoising reconstruction from a single compressed bitstream. Instead of training separate decoders f

Externí odkaz: http://arxiv.org/abs/2402.12888

Zobrazit plný text záznamu

Report

OMRA: Online Motion Resolution Adaptation to Remedy Domain Shift in Learned Hierarchical B-frame Coding

Autor: Gao, Zong-Lin, NguyenQuang, Sang, Peng, Wen-Hsiao, HoangVan, Xiem

Learned hierarchical B-frame coding aims to leverage bi-directional reference frames for better coding efficiency. However, the domain shift between training and test scenarios due to dataset limitations poses a challenge. This issue arises from trai

Externí odkaz: http://arxiv.org/abs/2402.12816

Zobrazit plný text záznamu

Report

LiDAR Depth Map Guided Image Compression Model

Autor: Gnutti, Alessandro, Della Fiore, Stefano, Savardi, Mattia, Chen, Yi-Hsin, Leonardi, Riccardo, Peng, Wen-Hsiao

The incorporation of LiDAR technology into some high-end smartphones has unlocked numerous possibilities across various applications, including photography, image restoration, augmented reality, and more. In this paper, we introduce a novel direction

Externí odkaz: http://arxiv.org/abs/2401.06517

Zobrazit plný text záznamu

Report

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

Autor: Chen, Yi-Hsin, Xie, Hong-Sheng, Chen, Cheng-Wei, Gao, Zong-Lin, Benjak, Martin, Peng, Wen-Hsiao, Ostermann, Jörn

Conditional coding has lately emerged as the mainstream approach to learned video compression. However, a recent study shows that it may perform worse than residual coding when the information bottleneck arises. Conditional residual coding was thus p

Externí odkaz: http://arxiv.org/abs/2312.15829

Zobrazit plný text záznamu

Akademický článek

Integrating population-based biobanks: Catalyst for advances in precision health

Autor: Jui-Chu Lin, Yi-Lien Liu, Wesley Wei-Wen Hsiao, Chien-Te Fan

Publikováno v: Computational and Structural Biotechnology Journal, Vol 24, Iss , Pp 690-698 (2024)

Precision health extends beyond the scope of precision medicine and involves a broader range of activities, including the prediction, prevention, treatment, and management of diseases. Tailored to specific populations, precision health offers persona

Externí odkaz: https://doaj.org/article/3acc5c97d01c4d00864740a6ab5b7303

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání