Výsledky vyhledávání - "Specia, Lucia"

Report

Tuning Language Models by Mixture-of-Depths Ensemble

Autor: Luo, Haoyan, Specia, Lucia

Transformer-based Large Language Models (LLMs) traditionally rely on final-layer loss for training and final-layer representations for predictions, potentially overlooking the predictive power embedded in intermediate layers. Surprisingly, we find th

Externí odkaz: http://arxiv.org/abs/2410.13077

Zobrazit plný text záznamu

Report

DiffuseDef: Improved Robustness to Adversarial Attacks

Autor: Li, Zhenhao, Rei, Marek, Specia, Lucia

Pretrained language models have significantly advanced performance across various natural language processing tasks. However, adversarial attacks continue to pose a critical challenge to system built using these models, as they can be exploited with

Externí odkaz: http://arxiv.org/abs/2407.00248

Zobrazit plný text záznamu

Report

MoESD: Mixture of Experts Stable Diffusion to Mitigate Gender Bias

Autor: Wang, Guorun, Specia, Lucia

Text-to-image models are known to propagate social biases. For example, when prompted to generate images of people in certain professions, these models tend to systematically generate specific genders or ethnicities. In this paper, we show that this

Externí odkaz: http://arxiv.org/abs/2407.11002

Zobrazit plný text záznamu

Report

From Understanding to Utilization: A Survey on Explainability for Large Language Models

Autor: Luo, Haoyan, Specia, Lucia

Explainability for Large Language Models (LLMs) is a critical yet challenging aspect of natural language processing. As LLMs are increasingly integral to diverse applications, their "black-box" nature sparks significant concerns regarding transparenc

Externí odkaz: http://arxiv.org/abs/2401.12874

Zobrazit plný text záznamu

Elektronická kniha

Quality estimation for machine translation / Lucia Specia, Carolina Scarton, Gustavo Henrique Paetzold. [electronic resource]

Autor: Specia, Lucia, author

Externí odkaz: Kolekce e-knih KNAV Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on request.

Report

Reducing Hallucinations in Neural Machine Translation with Feature Attribution

Autor: Tang, Joël, Fomicheva, Marina, Specia, Lucia

Neural conditional language generation models achieve the state-of-the-art in Neural Machine Translation (NMT) but are highly dependent on the quality of parallel training dataset. When trained on low-quality datasets, these models are prone to vario

Externí odkaz: http://arxiv.org/abs/2211.09878

Zobrazit plný text záznamu

Report

Scene Text Recognition with Semantics

Autor: Placidi, Joshua Cesare, Miao, Yishu, Wang, Zixu, Specia, Lucia

Scene Text Recognition (STR) models have achieved high performance in recent years on benchmark datasets where text images are presented with minimal noise. Traditional STR recognition pipelines take a cropped image as sole input and attempt to ident

Externí odkaz: http://arxiv.org/abs/2210.10836

Zobrazit plný text záznamu

Report

Contrastive Video-Language Learning with Fine-grained Frame Sampling

Autor: Wang, Zixu, Zhong, Yujie, Miao, Yishu, Ma, Lin, Specia, Lucia

Despite recent progress in video and language representation learning, the weak or sparse correspondence between the two modalities remains a bottleneck in the area. Most video-language models are trained via pair-level loss to predict whether a pair

Externí odkaz: http://arxiv.org/abs/2210.05039

Zobrazit plný text záznamu

Report

Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion, Age, and Origin from Vocal Bursts

Autor: Anuchitanukul, Atijit, Specia, Lucia

We present Burst2Vec, our multi-task learning approach to predict emotion, age, and origin (i.e., native country/language) from vocal bursts. Burst2Vec utilises pre-trained speech representations to capture acoustic information from raw waveforms and

Externí odkaz: http://arxiv.org/abs/2206.12469

Zobrazit plný text záznamu

Report

Logically Consistent Adversarial Attacks for Soft Theorem Provers

Autor: Gaskell, Alexander, Miao, Yishu, Specia, Lucia, Toni, Francesca

Recent efforts within the AI community have yielded impressive results towards "soft theorem proving" over natural language sentences using language models. We propose a novel, generative adversarial framework for probing and improving these models'

Externí odkaz: http://arxiv.org/abs/2205.00047

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání