Výsledky vyhledávání - "A. Goldwater"

Report

Autor: Haley, Coleman, Goldwater, Sharon, Ponti, Edoardo

We propose a grounded approach to meaning in language typology. We treat data from perceptual modalities, such as images, as a language-agnostic representation of meaning. Hence, we can quantify the function--form relationship between images and capt

Externí odkaz: http://arxiv.org/abs/2412.10369

Zobrazit plný text záznamu

Report

Bottom-Up and Top-Down Analysis of Values, Agendas, and Observations in Corpora and LLMs

Autor: Friedman, Scott E., Benkler, Noam, Mosaphir, Drisana, Rye, Jeffrey, Schmer-Galunder, Sonja M., Goldwater, Micah, McLure, Matthew, Wheelock, Ruta, Gottlieb, Jeremy, Goldman, Robert P., Miller, Christopher

Large language models (LLMs) generate diverse, situated, persuasive texts from a plurality of potential perspectives, influenced heavily by their prompts and training data. As part of LLM adoption, we seek to characterize - and ideally, manage - the

Externí odkaz: http://arxiv.org/abs/2411.05040

Zobrazit plný text záznamu

Report

Orthogonality and isotropy of speaker and phonetic information in self-supervised speech representations

Autor: Mohamed, Mukhtar, Liu, Oli Danyi, Tang, Hao, Goldwater, Sharon

Self-supervised speech representations can hugely benefit downstream speech technologies, yet the properties that make them useful are still poorly understood. Two candidate properties related to the geometry of the representation space have been hyp

Externí odkaz: http://arxiv.org/abs/2406.09200

Zobrazit plný text záznamu

Report

Estimating the Level of Dialectness Predicts Interannotator Agreement in Multi-dialect Arabic Datasets

Autor: Keleg, Amr, Magdy, Walid, Goldwater, Sharon

On annotating multi-dialect Arabic datasets, it is common to randomly assign the samples across a pool of native Arabic speakers. Recent analyses recommended routing dialectal samples to native speakers of their respective dialects to build higher-qu

Externí odkaz: http://arxiv.org/abs/2405.11282

Zobrazit plný text záznamu

Report

A predictive learning model can simulate temporal dynamics and context effects found in neural representations of continuous speech

Autor: Liu, Oli Danyi, Tang, Hao, Feldman, Naomi, Goldwater, Sharon

Speech perception involves storing and integrating sequentially presented items. Recent work in cognitive neuroscience has identified temporal and contextual characteristics in humans' neural encoding of speech that may facilitate this temporal proce

Externí odkaz: http://arxiv.org/abs/2405.08237

Zobrazit plný text záznamu

Report

ALDi: Quantifying the Arabic Level of Dialectness of Text

Autor: Keleg, Amr, Goldwater, Sharon, Magdy, Walid

Transcribed speech and user-generated text in Arabic typically contain a mixture of Modern Standard Arabic (MSA), the standardized language taught in schools, and Dialectal Arabic (DA), used in daily communications. To handle this variation, previous

Externí odkaz: http://arxiv.org/abs/2310.13747

Zobrazit plný text záznamu

Report

Enable people to identify science news based on retracted articles on social media

Autor: Yaqub, Waheeb, Kay, Judy, Goldwater, Micah

For many people, social media is an important way to consume news on important topics like health. Unfortunately, some influential health news is misinformation because it is based on retracted scientific work. Ours is the first work to explore how p

Externí odkaz: http://arxiv.org/abs/2309.00912

Zobrazit plný text záznamu

Report

Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling

Autor: Sanabria, Ramon, Klejch, Ondrej, Tang, Hao, Goldwater, Sharon

Acoustic word embeddings are typically created by training a pooling function using pairs of word-like units. For unsupervised systems, these are mined using k-nearest neighbor (KNN) search, which is slow. Recently, mean-pooled representations from a

Externí odkaz: http://arxiv.org/abs/2306.02153

Zobrazit plný text záznamu

Report

Self-supervised Predictive Coding Models Encode Speaker and Phonetic Information in Orthogonal Subspaces

Autor: Liu, Oli, Tang, Hao, Goldwater, Sharon

Self-supervised speech representations are known to encode both speaker and phonetic information, but how they are distributed in the high-dimensional space remains largely unexplored. We hypothesize that they are encoded in orthogonal subspaces, a p

Externí odkaz: http://arxiv.org/abs/2305.12464

Zobrazit plný text záznamu

Akademický článek

Anecdotes impact medical decisions even when presented with statistical information or decision aids

Autor: Emily N. Line, Sara Jaramillo, Micah Goldwater, Zachary Horne

Publikováno v: Cognitive Research, Vol 9, Iss 1, Pp 1-23 (2024)

Abstract People are inundated with popular press reports about medical research concerning what is healthy, get advice from doctors, and hear personal anecdotes. How do people integrate conflicting anecdotal and statistical information when making me

Externí odkaz: https://doaj.org/article/2f08e71db3be4a4ab31a87e3cf831cb4

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání