Výsledky vyhledávání - "Krähenbühl, A."

Report

Does Spatial Cognition Emerge in Frontier Models?

Autor: Ramakrishnan, Santhosh Kumar, Wijmans, Erik, Kraehenbuehl, Philipp, Koltun, Vladlen

Not yet. We present SPACE, a benchmark that systematically evaluates spatial cognition in frontier models. Our benchmark builds on decades of research in cognitive science. It evaluates large-scale mapping abilities that are brought to bear when an o

Externí odkaz: http://arxiv.org/abs/2410.06468

Zobrazit plný text záznamu

Report

Promptable Closed-loop Traffic Simulation

Autor: Tan, Shuhan, Ivanovic, Boris, Chen, Yuxiao, Li, Boyi, Weng, Xinshuo, Cao, Yulong, Krähenbühl, Philipp, Pavone, Marco

Simulation stands as a cornerstone for safe and efficient autonomous driving development. At its core a simulation system ought to produce realistic, reactive, and controllable traffic patterns. In this paper, we propose ProSim, a multimodal promptab

Externí odkaz: http://arxiv.org/abs/2409.05863

Zobrazit plný text záznamu

Report

Image and Video Tokenization with Binary Spherical Quantization

Autor: Zhao, Yue, Xiong, Yuanjun, Krähenbühl, Philipp

We propose a new transformer-based image and video tokenizer with Binary Spherical Quantization (BSQ). BSQ projects the high-dimensional visual embedding to a lower-dimensional hypersphere and then applies binary quantization. BSQ is (1) parameter-ef

Externí odkaz: http://arxiv.org/abs/2406.07548

Zobrazit plný text záznamu

Report

GLIDS: A Global Latency Information Dissemination System

Autor: Krähenbühl, Cyrill, Tabaeiaghdaei, Seyedali, Scherrer, Simon, Frei, Matthias, Perrig, Adrian

A recent advance in networking is the deployment of path-aware multipath network architectures, where network endpoints are given multiple network paths to send their data on. In this work, we tackle the challenge of selecting paths for latency-sensi

Externí odkaz: http://arxiv.org/abs/2405.04319

Zobrazit plný text záznamu

Report

Language-Image Models with 3D Understanding

Autor: Cho, Jang Hyun, Ivanovic, Boris, Cao, Yulong, Schmerling, Edward, Wang, Yue, Weng, Xinshuo, Li, Boyi, You, Yurong, Krähenbühl, Philipp, Wang, Yan, Pavone, Marco

Multi-modal large language models (MLLMs) have shown incredible capabilities in a variety of 2D vision and language tasks. We extend MLLMs' perceptual capabilities to ground and reason about images in 3-dimensional space. To that end, we first develo

Externí odkaz: http://arxiv.org/abs/2405.03685

Zobrazit plný text záznamu

Report

Distilling Vision-Language Models on Millions of Videos

Autor: Zhao, Yue, Zhao, Long, Zhou, Xingyi, Wu, Jialin, Chu, Chun-Te, Miao, Hui, Schroff, Florian, Adam, Hartwig, Liu, Ting, Gong, Boqing, Krähenbühl, Philipp, Yuan, Liangzhe

The recent advance in vision-language models is largely attributed to the abundance of image-text data. We aim to replicate this success for video-language models, but there simply is not enough human-curated video-text data available. We thus resort

Externí odkaz: http://arxiv.org/abs/2401.06129

Zobrazit plný text záznamu

Report

Predicting a Protein's Stability under a Million Mutations

Autor: Ouyang-Zhang, Jeffrey, Diaz, Daniel J., Klivans, Adam R., Krähenbühl, Philipp

Stabilizing proteins is a foundational step in protein engineering. However, the evolutionary pressure of all extant proteins makes identifying the scarce number of mutations that will improve thermodynamic stability challenging. Deep learning has re

Externí odkaz: http://arxiv.org/abs/2310.12979

Zobrazit plný text záznamu

Report

Training a Large Video Model on a Single Machine in a Day

Autor: Zhao, Yue, Krähenbühl, Philipp

Videos are big, complex to pre-process, and slow to train on. State-of-the-art large-scale video models are trained on clusters of 32 or more GPUs for several days. As a consequence, academia largely ceded the training of large video models to indust

Externí odkaz: http://arxiv.org/abs/2309.16669

Zobrazit plný text záznamu

Report

Language Conditioned Traffic Generation

Autor: Tan, Shuhan, Ivanovic, Boris, Weng, Xinshuo, Pavone, Marco, Kraehenbuehl, Philipp

Simulation forms the backbone of modern self-driving development. Simulators help develop, test, and improve driving systems without putting humans, vehicles, or their environment at risk. However, simulators face a major challenge: They rely on real

Externí odkaz: http://arxiv.org/abs/2307.07947

Zobrazit plný text záznamu

Report

FABRID: Flexible Attestation-Based Routing for Inter-Domain Networks

Autor: Krähenbühl, Cyrill, Wyss, Marc, Basin, David, Lenders, Vincent, Perrig, Adrian, Strohmeier, Martin

In its current state, the Internet does not provide end users with transparency and control regarding on-path forwarding devices. In particular, the lack of network device information reduces the trustworthiness of the forwarding path and prevents en

Externí odkaz: http://arxiv.org/abs/2304.03108

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání