Výsledky vyhledávání - "Padfield, A."

Report

Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities

Autor: Zayats, Vicky, Chen, Peter, Ferrari, Melissa, Padfield, Dirk

Integrating multiple generative foundation models, especially those trained on different modalities, into something greater than the sum of its parts poses significant challenges. Two key hurdles are the availability of aligned data (concepts that co

Externí odkaz: http://arxiv.org/abs/2405.18669

Zobrazit plný text záznamu

Report

AudioPaLM: A Large Language Model That Can Speak and Listen

We introduce AudioPaLM, a large language model for speech understanding and generation. AudioPaLM fuses text-based and speech-based language models, PaLM-2 [Anil et al., 2023] and AudioLM [Borsos et al., 2022], into a unified multimodal architecture

Externí odkaz: http://arxiv.org/abs/2306.12925

Zobrazit plný text záznamu

Report

MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup

Autor: Shen, Hua, Zayats, Vicky, Rocholl, Johann C., Walker, Daniel D., Padfield, Dirk

Current disfluency detection models focus on individual utterances each from a single speaker. However, numerous discontinuity phenomena in spoken conversational transcripts occur across multiple turns, hampering human readability and the performance

Externí odkaz: http://arxiv.org/abs/2305.12029

Zobrazit plný text záznamu

Report

Chronological Self-Training for Real-Time Speaker Diarization

Autor: Padfield, Dirk, Liebling, Daniel J.

Publikováno v: Proc. Interspeech (2021) 4613-4617

Diarization partitions an audio stream into segments based on the voices of the speakers. Real-time diarization systems that include an enrollment step should limit enrollment training samples to reduce user interaction time. Although training on a s

Externí odkaz: http://arxiv.org/abs/2208.03393

Zobrazit plný text záznamu

Report

Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection

Autor: Chen, Angelica, Zayats, Vicky, Walker, Daniel D., Padfield, Dirk

In modern interactive speech-based systems, speech is consumed and transcribed incrementally prior to having disfluencies removed. This post-processing step is crucial for producing clean transcripts and high performance on downstream tasks (e.g. mac

Externí odkaz: http://arxiv.org/abs/2205.00620

Zobrazit plný text záznamu

Akademický článek

A combined EEG motor and speech imagery paradigm with automated successive halving for customizable command selection.

Autor: Padfield, Natasha¹ (AUTHOR) natasha.padfield@um.edu.mt, Camilleri, Tracey² (AUTHOR), Fabri, Simon² (AUTHOR), Bugeja, Marvin² (AUTHOR), Camilleri, Kenneth^1,2 (AUTHOR)

Publikováno v: Brain-Computer Interfaces. Jun-Sep2024, Vol. 11 Issue 3, p125-142. 18p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech

Autor: Tomanek, Katrin, Zayats, Vicky, Padfield, Dirk, Vaillancourt, Kara, Biadsy, Fadi

Automatic Speech Recognition (ASR) systems are often optimized to work best for speakers with canonical speech patterns. Unfortunately, these systems perform poorly when tested on atypical speech and heavily accented speech. It has previously been sh

Externí odkaz: http://arxiv.org/abs/2109.06952

Zobrazit plný text záznamu

Akademický článek

Contrary effects of increasing temperatures on the spread of antimicrobial resistance in river biofilms

Autor: Kenyum Bagra, David Kneis, Daniel Padfield, Edina Szekeres, Adela Teban-Man, Cristian Coman, Gargi Singh, Thomas U. Berendonk, Uli Klümper

Publikováno v: mSphere, Vol 9, Iss 2 (2024)

ABSTRACT River microbial communities regularly act as the first barrier of defense against the spread of antimicrobial resistance genes (ARGs) that enter environmental microbiomes through wastewater. However, how the invasion dynamics of wastewater-b

Externí odkaz: https://doaj.org/article/4c3bb2d9dcdf49c3a0e5d800717c8551

Zobrazit plný text záznamu

Report

Sentence Boundary Augmentation For Neural Machine Translation Robustness

Autor: Li, Daniel, I, Te, Arivazhagan, Naveen, Cherry, Colin, Padfield, Dirk

Neural Machine Translation (NMT) models have demonstrated strong state of the art performance on translation tasks where well-formed training and evaluation data are provided, but they remain sensitive to inputs that include errors of various types.

Externí odkaz: http://arxiv.org/abs/2010.11132

Zobrazit plný text záznamu

Report

Designing Mid-Air Haptic Gesture Controlled User Interfaces for Cars

Autor: Young, Gareth, Milne, Hamish, Griffiths, Daniel, Padfield, Elliot, Blenkinsopp, Robert, Georgiou, Orestis

We present advancements in the design and development of in-vehicle infotainment systems that utilize gesture input and ultrasonic mid-air haptic feedback. Such systems employ state-of-the-art hand tracking technology and novel haptic feedback techno

Externí odkaz: http://arxiv.org/abs/2005.08535

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání