Výsledky vyhledávání - "Song, Haiyue"

Report

Connecting Ideas in 'Lower-Resource' Scenarios: NLP for National Varieties, Creoles and Other Low-resource Scenarios

Autor: Joshi, Aditya, Kanojia, Diptesh, Lent, Heather, Kaing, Hour, Song, Haiyue

Despite excellent results on benchmarks over a small subset of languages, large language models struggle to process text from languages situated in `lower-resource' scenarios such as dialects/sociolects (national or social varieties of a language), C

Externí odkaz: http://arxiv.org/abs/2409.12683

Zobrazit plný text záznamu

Report

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Autor: Romero, David, Lyu, Chenyang, Wibowo, Haryo Akbarianto, Lynn, Teresa, Hamed, Injy, Kishore, Aditya Nanda, Mandal, Aishik, Dragonetti, Alina, Abzaliev, Artem, Tonja, Atnafu Lambebo, Balcha, Bontu Fufa, Whitehouse, Chenxi, Salamea, Christian, Velasco, Dan John, Adelani, David Ifeoluwa, Meur, David Le, Villa-Cueva, Emilio, Koto, Fajri, Farooqui, Fauzan, Belcavello, Frederico, Batnasan, Ganzorig, Vallejo, Gisela, Caulfield, Grainne, Ivetta, Guido, Song, Haiyue, Ademtew, Henok Biadglign, Maina, Hernán, Lovenia, Holy, Azime, Israel Abebe, Cruz, Jan Christian Blaise, Gala, Jay, Geng, Jiahui, Ortiz-Barajas, Jesus-German, Baek, Jinheon, Dunstan, Jocelyn, Alemany, Laura Alonso, Nagasinghe, Kumaranage Ravindu Yasas, Benotti, Luciana, D'Haro, Luis Fernando, Viridiano, Marcelo, Estecha-Garitagoitia, Marcos, Cabrera, Maria Camila Buitrago, Rodríguez-Cantelar, Mario, Jouitteau, Mélanie, Mihaylov, Mihail, Imam, Mohamed Fazli Mohamed, Adilazuarda, Muhammad Farid, Gochoo, Munkhjargal, Otgonbold, Munkh-Erdene, Etori, Naome, Niyomugisha, Olivier, Silva, Paula Mónica, Chitale, Pranjal, Dabre, Raj, Chevi, Rendi, Zhang, Ruochen, Diandaru, Ryandito, Cahyawijaya, Samuel, Góngora, Santiago, Jeong, Soyeong, Purkayastha, Sukannya, Kuribayashi, Tatsuki, Jayakumar, Thanmay, Torrent, Tiago Timponi, Ehsan, Toqeer, Araujo, Vladimir, Kementchedjhieva, Yova, Burzo, Zara, Lim, Zheng Wei, Yong, Zheng Xin, Ignat, Oana, Nwatu, Joan, Mihalcea, Rada, Solorio, Thamar, Aji, Alham Fikri

Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA

Externí odkaz: http://arxiv.org/abs/2406.05967

Zobrazit plný text záznamu

Report

Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph Networks

Autor: Fu, Yahui, Song, Haiyue, Zhao, Tianyu, Kawahara, Tatsuya

Personality recognition is useful for enhancing robots' ability to tailor user-adaptive responses, thus fostering rich human-robot interactions. One of the challenges in this task is a limited number of speakers in existing dialogue corpora, which ha

Externí odkaz: http://arxiv.org/abs/2401.05871

Zobrazit plný text záznamu

Report

Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture Transcripts

Autor: Song, Haiyue, Dabre, Raj, Chu, Chenhui, Fujita, Atsushi, Kurohashi, Sadao

Lecture transcript translation helps learners understand online courses, however, building a high-quality lecture machine translation system lacks publicly available parallel corpora. To address this, we examine a framework for parallel corpus mining

Externí odkaz: http://arxiv.org/abs/2311.03696

Zobrazit plný text záznamu

Report

SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation

Autor: Song, Haiyue, Dabre, Raj, Chu, Chenhui, Kurohashi, Sadao, Sumita, Eiichiro

Sub-word segmentation is an essential pre-processing step for Neural Machine Translation (NMT). Existing work has shown that neural sub-word segmenters are better than Byte-Pair Encoding (BPE), however, they are inefficient as they require parallel c

Externí odkaz: http://arxiv.org/abs/2307.16400

Zobrazit plný text záznamu

Report

Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation

Autor: Mao, Zhuoyuan, Song, Haiyue, Dabre, Raj, Chu, Chenhui, Kurohashi, Sadao

The language-independency of encoded representations within multilingual neural machine translation (MNMT) models is crucial for their generalization ability on zero-shot translation. Neural interlingua representations have been shown as an effective

Externí odkaz: http://arxiv.org/abs/2305.10190

Zobrazit plný text záznamu

Report

Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation

Autor: Mao, Zhuoyuan, Dabre, Raj, Liu, Qianying, Song, Haiyue, Chu, Chenhui, Kurohashi, Sadao

This paper studies the impact of layer normalization (LayerNorm) on zero-shot translation (ZST). Recent efforts for ZST often utilize the Transformer architecture as the backbone, with LayerNorm at the input of layers (PreNorm) set as the default. Ho

Externí odkaz: http://arxiv.org/abs/2305.09312

Zobrazit plný text záznamu

Report

GPT-RE: In-context Learning for Relation Extraction using Large Language Models

Autor: Wan, Zhen, Cheng, Fei, Mao, Zhuoyuan, Liu, Qianying, Song, Haiyue, Li, Jiwei, Kurohashi, Sadao

In spite of the potential for ground-breaking achievements offered by large language models (LLMs) (e.g., GPT-3), they still lag significantly behind fully-supervised baselines (e.g., fine-tuned BERT) in relation extraction (RE). This is due to the t

Externí odkaz: http://arxiv.org/abs/2305.02105

Zobrazit plný text záznamu

Report

Relation Extraction with Weighted Contrastive Pre-training on Distant Supervision

Autor: Wan, Zhen, Cheng, Fei, Liu, Qianying, Mao, Zhuoyuan, Song, Haiyue, Kurohashi, Sadao

Contrastive pre-training on distant supervision has shown remarkable effectiveness in improving supervised relation extraction tasks. However, the existing methods ignore the intrinsic noise of distant supervision during the pre-training stage. In th

Externí odkaz: http://arxiv.org/abs/2205.08770

Zobrazit plný text záznamu

Report

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

Autor: Mao, Zhuoyuan, Chu, Chenhui, Dabre, Raj, Song, Haiyue, Wan, Zhen, Kurohashi, Sadao

Word alignment has proven to benefit many-to-many neural machine translation (NMT). However, high-quality ground-truth bilingual dictionaries were used for pre-editing in previous methods, which are unavailable for most language pairs. Meanwhile, the

Externí odkaz: http://arxiv.org/abs/2204.12165

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání