Zobrazeno 1 - 10
of 29
pro vyhledávání: '"Matthias Gallé"'
Publikováno v:
Algorithms, Vol 4, Iss 4, Pp 262-284 (2011)
The smallest grammar problem—namely, finding a smallest context-free grammar that generates exactly one sequence—is of practical and theoretical importance in fields such as Kolmogorov complexity, data compression and pattern discovery. We propos
Externí odkaz:
https://doaj.org/article/67a87f6420214577a7b290ed7a2a4f18
Autor:
Vassilina Nikoulina, Maxat Tezekbayev, Nuradil Kozhakhmet, Madina Babazhanova, Matthias Gallé, Zhenisbek Assylbekov
There is an ongoing debate in the NLP community whether modern language models contain linguistic knowledge, recovered through so-called probes. In this paper, we study whether linguistic knowledge is a necessary condition for the good performance of
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1d1e1ae8b72966cdf13e3e53f3b07acd
http://arxiv.org/abs/2103.01819
http://arxiv.org/abs/2103.01819
Publikováno v:
EACL (System Demonstrations)
It is standard procedure these days to solve Information Extraction task by fine-tuning large pre-trained language models. This is not the case for generation task, which relies on a variety of techniques for controlled language generation. In this p
Publikováno v:
Philip, J, Berard, A, Gallé, M & Besacier, L 2020, Monolingual Adapters for Zero-Shot Neural Machine Translation . in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) . Online, pp. 4465-4470, The 2020 Conference on Empirical Methods in Natural Language Processing, Virtual conference, 16/11/20 . https://doi.org/10.18653/v1/2020.emnlp-main.361
EMNLP (Empirical Methods for Natural Language Processing)
EMNLP (Empirical Methods for Natural Language Processing), Nov 2020, Virtual, France
EMNLP (1)
EMNLP (Empirical Methods for Natural Language Processing)
EMNLP (Empirical Methods for Natural Language Processing), Nov 2020, Virtual, France
EMNLP (1)
International audience; We propose a novel adapter layer formalism for adapting multilingual models. They are more parameter-efficient than existing adapter layers while obtaining as good or better performance. The layers are specific to one language
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::42acc31213743e9ee9b05dbffd32168b
https://www.pure.ed.ac.uk/ws/files/192697520/Monolingual_Adapters_PHILIP_DOA14092020_VOR_CC_BY.pdf
https://www.pure.ed.ac.uk/ws/files/192697520/Monolingual_Adapters_PHILIP_DOA14092020_VOR_CC_BY.pdf
Publikováno v:
NLP4COVID@EMNLP
We release a multilingual neural machine translation model, which can be used to translate text in the biomedical domain. The model can translate from 5 languages (French, German, Italian, Korean and Spanish) into English. It is trained with large am
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::776e5ebde2f6c2a4c94b2a18a95dd00e
Publikováno v:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, Apr 2021, Online, Unknown Region. pp.1646--1662
EACL
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, Apr 2021, Online, Unknown Region. pp.1646--1662
EACL
We address the problem of unsupervised abstractive summarization of collections of user generated reviews with self-supervision and control. We propose a self-supervised setup that considers an individual document as a target summary for a set of sim
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f31ec1519dbca3c93d878e46d93357b7
Autor:
Matthias Gallé, Matías Tealdi
Publikováno v:
Journal of Discrete Algorithms. 48:1-16
The context in which a substring appears is an important notion to identify – for example – its semantic meaning. However, existing definitions from stringology fail to model the context explicitly. We introduce here xkcd-repeats, a new family of
Publikováno v:
Web Intelligence. 15:189-203
The World Wide Web contains a large quantity of community created knowledge of instructional nature. Similarly, in a commercial setting, databases of instructions are used by customer-care providers to guide clients in the resolution of issues. Most
Publikováno v:
2nd Workshop on New Frontiers in Summarization
2nd Workshop on New Frontiers in Summarization, Nov 2019, Hong-Kong, China. pp.42--47, ⟨10.18653/v1/D19-5405⟩
2nd Workshop on New Frontiers in Summarization, Nov 2019, Hong-Kong, China. pp.42--47, ⟨10.18653/v1/D19-5405⟩
User-generated reviews of products or services provide valuable information to customers. However, it is often impossible to read each of the potentially thousands of reviews: it would therefore save valuable time to provide short summaries of their
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::181d5dd8f538774b327f98bc636b8988
https://hal.archives-ouvertes.fr/hal-02874864
https://hal.archives-ouvertes.fr/hal-02874864
Autor:
Pierre Daix-Moreux, Matthias Gallé
Publikováno v:
TextGraphs@EMNLP
Word embeddings continue to be of great use for NLP researchers and practitioners due to their training speed and easiness of use and distribution. Prior work has shown that the representation of those words can be improved by the use of semantic kno