BERTje: A Dutch BERT Model

Autor:	de Vries, Wietse, van Cranenburgh, Andreas, Bisazza, Arianna, Caselli, Tommaso, van Noord, Gertjan, Nissim, Malvina
Rok vydání:	2019
Předmět:	Computer Science - Computation and Language
Druh dokumentu:	Working Paper
Popis:	The transformer-based pre-trained language model BERT has helped to improve state-of-the-art performance on many natural language processing (NLP) tasks. Using the same architecture and parameters, we developed and evaluated a monolingual Dutch BERT model called BERTje. Compared to the multilingual BERT model, which includes Dutch but is only based on Wikipedia text, BERTje is based on a large and diverse dataset of 2.4 billion tokens. BERTje consistently outperforms the equally-sized multilingual BERT model on downstream NLP tasks (part-of-speech tagging, named-entity recognition, semantic role labeling, and sentiment analysis). Our pre-trained Dutch BERT model is made available at https://github.com/wietsedv/bertje.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/1912.09582 Zobrazit plný text záznamu View this record from Arxiv