Výsledky vyhledávání - "CLARK, RONALD A."

Report

Olympus: A Universal Task Router for Computer Vision Tasks

Autor: Lin, Yuanze, Li, Yunsheng, Chen, Dongdong, Xu, Weijian, Clark, Ronald, Torr, Philip H. S.

We introduce Olympus, a new approach that transforms Multimodal Large Language Models (MLLMs) into a unified framework capable of handling a wide array of computer vision tasks. Utilizing a controller MLLM, Olympus delegates over 20 specialized tasks

Externí odkaz: http://arxiv.org/abs/2412.09612

Zobrazit plný text záznamu

Report

MALT: Improving Reasoning with Multi-Agent LLM Training

Autor: Motwani, Sumeet Ramesh, Smith, Chandler, Das, Rocktim Jyoti, Rybchuk, Markian, Torr, Philip H. S., Laptev, Ivan, Pizzati, Fabio, Clark, Ronald, de Witt, Christian Schroeder

Enabling effective collaboration among LLMs is a crucial step toward developing autonomous systems capable of solving complex problems. While LLMs are typically used as single-model generators, where humans critique and refine their outputs, the pote

Externí odkaz: http://arxiv.org/abs/2412.01928

Zobrazit plný text záznamu

Report

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Autor: Channing, Georgia, Sock, Juil, Clark, Ronald, Torr, Philip, de Witt, Christian Schroeder

The rapid proliferation of AI-manipulated or generated audio deepfakes poses serious challenges to media integrity and election security. Current AI-driven detection solutions lack explainability and underperform in real-world settings. In this paper

Externí odkaz: http://arxiv.org/abs/2410.07436

Zobrazit plný text záznamu

Report

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Autor: Brown, Bradley, Juravsky, Jordan, Ehrlich, Ryan, Clark, Ronald, Le, Quoc V., Ré, Christopher, Mirhoseini, Azalia

Scaling the amount of compute used to train language models has dramatically improved their capabilities. However, when it comes to inference, we often limit the amount of compute to only one attempt per problem. Here, we explore inference compute as

Externí odkaz: http://arxiv.org/abs/2407.21787

Zobrazit plný text záznamu

Report

Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge

Autor: Lin, Yuanze, Li, Yunsheng, Chen, Dongdong, Xu, Weijian, Clark, Ronald, Torr, Philip, Yuan, Lu

In recent years, multimodal large language models (MLLMs) have made significant strides by training on vast high-quality image-text datasets, enabling them to generally understand images well. However, the inherent difficulty in explicitly conveying

Externí odkaz: http://arxiv.org/abs/2407.04681

Zobrazit plný text záznamu

Report

EVCL: Elastic Variational Continual Learning with Weight Consolidation

Autor: Batra, Hunar, Clark, Ronald

Continual learning aims to allow models to learn new tasks without forgetting what has been learned before. This work introduces Elastic Variational Continual Learning with Weight Consolidation (EVCL), a novel hybrid model that integrates the variati

Externí odkaz: http://arxiv.org/abs/2406.15972

Zobrazit plný text záznamu

Report

DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion

Autor: Lin, Yuanze, Clark, Ronald, Torr, Philip

We present DreamPolisher, a novel Gaussian Splatting based method with geometric guidance, tailored to learn cross-view consistency and intricate detail from textual descriptions. While recent progress on text-to-3D generation methods have been promi

Externí odkaz: http://arxiv.org/abs/2403.17237

Zobrazit plný text záznamu

Report

DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and Computer Vision Applications

Autor: Nimal, Nillan, Li, Wenbin, Clark, Ronald, Saeedi, Sajad

The creation of accurate virtual models of real-world objects is imperative to robotic simulations and applications such as computer vision, artificial intelligence, and machine learning. This paper documents the different methods employed for genera

Externí odkaz: http://arxiv.org/abs/2402.11836

Zobrazit plný text záznamu

Report

Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator

Autor: Amini-Naieni, Niki, Jakab, Tomas, Vedaldi, Andrea, Clark, Ronald

Although Neural Radiance Fields (NeRFs) have markedly improved novel view synthesis, accurate uncertainty quantification in their image predictions remains an open problem. The prevailing methods for estimating uncertainty, including the state-of-the

Externí odkaz: http://arxiv.org/abs/2312.02350

Zobrazit plný text záznamu

Report

Towards the Probabilistic Fusion of Learned Priors into Standard Pipelines for 3D Reconstruction

Autor: Laidlow, Tristan, Czarnowski, Jan, Nicastro, Andrea, Clark, Ronald, Leutenegger, Stefan

The best way to combine the results of deep learning with standard 3D reconstruction pipelines remains an open problem. While systems that pass the output of traditional multi-view stereo approaches to a network for regularisation or refinement curre

Externí odkaz: http://arxiv.org/abs/2207.13464

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání