Výsledky vyhledávání

Report

Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction

Autor: Gu, Yi, Ye, Dongjun, Wang, Zhaorui, Wang, Jiaxu, Cao, Jiahang, Xu, Renjing

Neural surface reconstruction relies heavily on accurate camera poses as input. Despite utilizing advanced pose estimators like COLMAP or ARKit, camera poses can still be noisy. Existing pose-NeRF joint optimization methods handle poses with small no

Externí odkaz: http://arxiv.org/abs/2411.13620

Zobrazit plný text záznamu

Report

Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step

Autor: Zhou, Mingyuan, Zheng, Huangjie, Gu, Yi, Wang, Zhendong, Huang, Hai

Score identity Distillation (SiD) is a data-free method that has achieved SOTA performance in image generation by leveraging only a pretrained diffusion model, without requiring any training data. However, its ultimate performance is constrained by h

Externí odkaz: http://arxiv.org/abs/2410.14919

Zobrazit plný text záznamu

Report

ChatHouseDiffusion: Prompt-Guided Generation and Editing of Floor Plans

Autor: Qin, Sizhong, He, Chengyu, Chen, Qiaoyun, Yang, Sen, Liao, Wenjie, Gu, Yi, Lu, Xinzheng

The generation and editing of floor plans are critical in architectural planning, requiring a high degree of flexibility and efficiency. Existing methods demand extensive input information and lack the capability for interactive adaptation to user mo

Externí odkaz: http://arxiv.org/abs/2410.11908

Zobrazit plný text záznamu

Report

Revisiting Deep Ensemble Uncertainty for Enhanced Medical Anomaly Detection

Autor: Gu, Yi, Lin, Yi, Cheng, Kwang-Ting, Chen, Hao

Medical anomaly detection (AD) is crucial in pathological identification and localization. Current methods typically rely on uncertainty estimation in deep ensembles to detect anomalies, assuming that ensemble learners should agree on normal samples

Externí odkaz: http://arxiv.org/abs/2409.17485

Zobrazit plný text záznamu

Report

3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation

Autor: Gu, Yi, Otake, Yoshito, Uemura, Keisuke, Takao, Masaki, Soufi, Mazen, Okada, Seiji, Sugano, Nobuhiko, Talbot, Hugues, Sato, Yoshinobu

Radiography is widely used in orthopedics for its affordability and low radiation exposure. 3D reconstruction from a single radiograph, so-called 2D-3D reconstruction, offers the possibility of various clinical applications, but achieving clinically

Externí odkaz: http://arxiv.org/abs/2409.16702

Zobrazit plný text záznamu

Report

Speech Recognition Rescoring with Large Speech-Text Foundation Models

Autor: Shivakumar, Prashanth Gurunath, Kolehmainen, Jari, Gourav, Aditya, Gu, Yi, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Large language models (LLM) have demonstrated the ability to understand human language by leveraging large amount of text data. Automatic speech recognition (ASR) systems are often limited by available transcribed speech data and benefit from a secon

Externí odkaz: http://arxiv.org/abs/2409.16654

Zobrazit plný text záznamu

Report

Enhancing Quantitative Image Synthesis through Pretraining and Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray Image

Autor: Gu, Yi, Otake, Yoshito, Uemura, Keisuke, Takao, Masaki, Soufi, Mazen, Okada, Seiji, Sugano, Nobuhiko, Talbot, Hugues, Sato, Yoshinobu

While most vision tasks are essentially visual in nature (for recognition), some important tasks, especially in the medical field, also require quantitative analysis (for quantification) using quantitative images. Unlike in visual analysis, pixel val

Externí odkaz: http://arxiv.org/abs/2407.20495

Zobrazit plný text záznamu

Report

Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification

Autor: Peng, Letian, Gu, Yi, Dong, Chengyu, Wang, Zihan, Shang, Jingbo

For extremely weak-supervised text classification, pioneer research generates pseudo labels by mining texts similar to the class names from the raw corpus, which may end up with very limited or even no samples for the minority classes. Recent works h

Externí odkaz: http://arxiv.org/abs/2406.11115

Zobrazit plný text záznamu

Report

Pandora: Towards General World Model with Natural Language Actions and Video States

Autor: Xiang, Jiannan, Liu, Guangyi, Gu, Yi, Gao, Qiyue, Ning, Yuting, Zha, Yuheng, Feng, Zeyu, Tao, Tianhua, Hao, Shibo, Shi, Yemin, Liu, Zhengzhong, Xing, Eric P., Hu, Zhiting

World models simulate future states of the world in response to different actions. They facilitate interactive content creation and provides a foundation for grounded, long-horizon reasoning. Current foundation models do not fully meet the capabiliti

Externí odkaz: http://arxiv.org/abs/2406.09455

Zobrazit plný text záznamu

Report

Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization

Autor: Gu, Yi, Wang, Zhendong, Yin, Yueqin, Xie, Yujia, Zhou, Mingyuan

Aligning large language models with human preferences has emerged as a critical focus in language modeling research. Yet, integrating preference learning into Text-to-Image (T2I) generative models is still relatively uncharted territory. The Diffusio

Externí odkaz: http://arxiv.org/abs/2406.06382

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání