Výsledky vyhledávání - "ROSS, DAVID A."

Report

Autor: Ross, David A.

Results about the structure of the set of Egyptian fractions on the line are extended to subsets of topological groups.

Externí odkaz: http://arxiv.org/abs/2410.24165

Zobrazit plný text záznamu

Report

Advantages of multistage quantum walks over QAOA

Autor: Gerblich, Lasse, Dasanjh, Tamanna, Wong, Horatio Q. X., Ross, David, Novo, Leonardo, Chancellor, Nicholas, Kendon, Viv

Methods to find the solution state for optimization problems encoded into Ising Hamiltonians are a very active area of current research. In this work we compare the quantum approximate optimization algorithm (QAOA) with multi-stage quantum walks (MSQ

Externí odkaz: http://arxiv.org/abs/2407.06663

Zobrazit plný text záznamu

Report

Nonstandard arguments for results about infinite systems of equations in infinitely many variables

Autor: Ross, David A.

Short nonstandard proofs are given for some results about infinite systems of equations in infinitely many variables.

Externí odkaz: http://arxiv.org/abs/2405.04552

Zobrazit plný text záznamu

Report

Linear equations and multiplicative polynomial equations in infinitely many variables

Autor: Nathanson, Melvyn B., Ross, David A.

This paper describes infinite sets of polynomial equations in infinitely many variables with the property that the existence of a solution or even an approximate solution for every finite subset of the equations implies the existence of a solution fo

Externí odkaz: http://arxiv.org/abs/2405.01766

Zobrazit plný text záznamu

Report

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation

Autor: Chowdhury, Townim Faisal, Liao, Kewen, Phan, Vu Minh Hieu, To, Minh-Son, Xie, Yutong, Hung, Kevin, Ross, David, Hengel, Anton van den, Verjans, Johan W., Liao, Zhibin

Deep Neural Networks (DNNs) are widely used for visual classification tasks, but their complex computation process and black-box nature hinder decision transparency and interpretability. Class activation maps (CAMs) and recent variants provide ways t

Externí odkaz: http://arxiv.org/abs/2404.02388

Zobrazit plný text záznamu

Report

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Autor: Hu, Ziniu, Iscen, Ahmet, Jain, Aashi, Kipf, Thomas, Yue, Yisong, Ross, David A., Schmid, Cordelia, Fathi, Alireza

This paper introduces SceneCraft, a Large Language Model (LLM) Agent converting text descriptions into Blender-executable Python scripts which render complex scenes with up to a hundred 3D assets. This process requires complex spatial planning and ar

Externí odkaz: http://arxiv.org/abs/2403.01248

Zobrazit plný text záznamu

Report

VideoPrism: A Foundational Visual Encoder for Video Understanding

Autor: Zhao, Long, Gundavarapu, Nitesh B., Yuan, Liangzhe, Zhou, Hao, Yan, Shen, Sun, Jennifer J., Friedman, Luke, Qian, Rui, Weyand, Tobias, Zhao, Yue, Hornung, Rachel, Schroff, Florian, Yang, Ming-Hsuan, Ross, David A., Wang, Huisheng, Adam, Hartwig, Sirotenko, Mikhail, Liu, Ting, Gong, Boqing

We introduce VideoPrism, a general-purpose video encoder that tackles diverse video understanding tasks with a single frozen model. We pretrain VideoPrism on a heterogeneous corpus containing 36M high-quality video-caption pairs and 582M video clips

Externí odkaz: http://arxiv.org/abs/2402.13217

Zobrazit plný text záznamu

Report

VideoPoet: A Large Language Model for Zero-Shot Video Generation

We present VideoPoet, a language model capable of synthesizing high-quality video, with matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder-only transformer architecture that processes multimodal inputs -- includ

Externí odkaz: http://arxiv.org/abs/2312.14125

Zobrazit plný text záznamu

Kniha

Tarot and Tequila : A Tarot Guide with Cocktails. [elektronicky zdroj]

Autor: Ross, David A.

Externí odkaz: Kolekce e-knih KNAV (Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on requests.)

Report

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Autor: Yu, Lijun, Lezama, José, Gundavarapu, Nitesh B., Versari, Luca, Sohn, Kihyuk, Minnen, David, Cheng, Yong, Birodkar, Vighnesh, Gupta, Agrim, Gu, Xiuye, Hauptmann, Alexander G., Gong, Boqing, Yang, Ming-Hsuan, Essa, Irfan, Ross, David A., Jiang, Lu

While Large Language Models (LLMs) are the dominant models for generative tasks in language, they do not perform as well as diffusion models on image and video generation. To effectively use LLMs for visual generation, one crucial component is the vi

Externí odkaz: http://arxiv.org/abs/2310.05737

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání