Zobrazeno 1 - 10
of 3 424
pro vyhledávání: '"Chan, David"'
Publikováno v:
GMS Ophthalmology Cases, Vol 13, p Doc12 (2023)
Background: While complex public health challenges and the emergence of variants have impeded responses to the COVID pandemic, vaccines continue to represent a crucial tool in mitigating the risk of morbidity and mortality. Safety issues weigh heavil
Externí odkaz:
https://doaj.org/article/d00a5385ee3442c8b8bf14c049634320
Autor:
Chan, David M., Corona, Rodolfo, Park, Joonyong, Cho, Cheol Jun, Bai, Yutong, Darrell, Trevor
With the introduction of transformer-based models for vision and language tasks, such as LLaVA and Chameleon, there has been renewed interest in the discrete tokenized representation of images. These models often treat image patches as discrete token
Externí odkaz:
http://arxiv.org/abs/2411.05001
Tambara functors are the analogue of commutative rings in equivariant algebra. Nakaoka defined ideals in Tambara functors, leading to the definition of the Nakaoka spectrum of prime ideals in a Tambara functor. In this work, we continue the study of
Externí odkaz:
http://arxiv.org/abs/2410.23052
The Automated Audio Captioning (AAC) task asks models to generate natural language descriptions of an audio input. Evaluating these machine-generated audio captions is a complex task that requires considering diverse factors, among them, auditory sce
Externí odkaz:
http://arxiv.org/abs/2409.12962
Autor:
Tulsiani, Hitesh, Chan, David M., Ghosh, Shalini, Lalwani, Garima, Pandey, Prabhat, Bansal, Ankish, Garimella, Sri, Rastrow, Ariya, Hoffmeister, Björn
Dialog systems, such as voice assistants, are expected to engage with users in complex, evolving conversations. Unfortunately, traditional automatic speech recognition (ASR) systems deployed in such applications are usually trained to recognize each
Externí odkaz:
http://arxiv.org/abs/2409.10515
Assessing personality traits using large language models (LLMs) has emerged as an interesting and challenging area of research. While previous methods employ explicit questionnaires, often derived from the Big Five model of personality, we hypothesiz
Externí odkaz:
http://arxiv.org/abs/2409.09905
Autor:
Wu, Tsung-Han, Biamby, Giscard, Quenum, Jerome, Gupta, Ritwik, Gonzalez, Joseph E., Darrell, Trevor, Chan, David M.
Large Multimodal Models (LMMs) have made significant strides in visual question-answering for single images. Recent advancements like long-context LMMs have allowed them to ingest larger, or even multiple, images. However, the ability to process a la
Externí odkaz:
http://arxiv.org/abs/2407.13766
Autor:
Moon, Suhong, Abdulhai, Marwa, Kang, Minwoo, Suh, Joseph, Soedarmadji, Widyadewi, Behar, Eran Kohen, Chan, David M.
Large language models (LLMs) are trained from vast repositories of text authored by millions of distinct authors, reflecting an enormous diversity of human traits. While these models bear the potential to be used as approximations of human subjects i
Externí odkaz:
http://arxiv.org/abs/2407.06576
Autor:
Chan, David, Vogeli, Chase
We compute the $RO(G)$-graded equivariant algebraic $K$-groups of a finite field with an action by its Galois group $G$. Specifically, we show these $K$-groups split as the sum of an explicitly computable term and the well-studied $RO(G)$-graded coef
Externí odkaz:
http://arxiv.org/abs/2406.19481
Autor:
Petryk, Suzanne, Chan, David M., Kachinthaya, Anish, Zou, Haodi, Canny, John, Gonzalez, Joseph E., Darrell, Trevor
Despite recent advances in multimodal pre-training for visual description, state-of-the-art models still produce captions containing errors, such as hallucinating objects not present in a scene. The existing prominent metric for object hallucination,
Externí odkaz:
http://arxiv.org/abs/2404.02904