Zobrazeno 1 - 10
of 15
pro vyhledávání: '"Maillette De Buy Wenniger, Gideon"'
Autor:
Jacobs, Pieter Floris, Maillette De Buy Wenniger, Gideon, Wiering, Marco, Schomaker, Lambert, Leiva, L.A., Pruski, C, Markovich, R., Najjar, A., Schommer, C.
Publikováno v:
Communications in Computer and Information Science ISBN: 9783030938413
33rd Benelux Conference on Artificial Intelligence, BNAIC/Benelearn 2021: Esch-sur-Alzette, Luxembourg, November 10–12, 2021, Revised Selected Papers, 3-29
STARTPAGE=3;ENDPAGE=29;TITLE=33rd Benelux Conference on Artificial Intelligence, BNAIC/Benelearn 2021
33rd Benelux Conference on Artificial Intelligence, BNAIC/Benelearn 2021: Esch-sur-Alzette, Luxembourg, November 10–12, 2021, Revised Selected Papers, 3-29
STARTPAGE=3;ENDPAGE=29;TITLE=33rd Benelux Conference on Artificial Intelligence, BNAIC/Benelearn 2021
Labeling data can be an expensive task as it is usually performed manually by domain experts. This is cumbersome for deep learning, as it is dependent on large labeled datasets. Active learning (AL) is a paradigm that aims to reduce labeling effort b
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bb6d4c3652a0a142dbb2ad3a3e5136c8
https://doi.org/10.1007/978-3-030-93842-0_1
https://doi.org/10.1007/978-3-030-93842-0_1
Publikováno v:
Poncelas, Alberto ORCID: 0000-0002-5089-1687 , Maillette de Buy Wenniger, Gideon and Way, Andy ORCID: 0000-0001-5736-5930 (2018) Data selection with feature decay algorithms using an approximated target side. In: The 15th International Workshop on Spoken Language Translation 2018, 29-30 Oct 2018, Bruges, Belgium.
Way, Andy ORCID: 0000-0001-5736-5930, Poncelas, Alberto ORCID: 0000-0002-5089-1687 and Maillette de Buy Wenniger, Gideon (2018) Data selection with feature decay algorithms using an approximated target side. In: 15th International Workshop on Spoken Language Translation (IWSLT 2018), 29-30 Apr 2018, Bruges, Belgium.
Way, Andy ORCID: 0000-0001-5736-5930
Data selection techniques applied to neural machine trans-lation (NMT) aim to increase the performance of a model byretrieving a subset of sentences for use as training data.One of the possible data selection techniques are trans-ductive learning met
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::233b3bf15b66a4834d7760ec821bd4ca
http://arxiv.org/abs/1811.03039
http://arxiv.org/abs/1811.03039
Autor:
van Dongen, Thomas, Maillette de Buy Wenniger, Gideon, Schomaker, Lambert, Chandrasekaran, Muthu Kumar
Publikováno v:
Proceedings of the First Workshop on Scholarly Document Processing, 148-157
STARTPAGE=148;ENDPAGE=157;TITLE=Proceedings of the First Workshop on Scholarly Document Processing
STARTPAGE=148;ENDPAGE=157;TITLE=Proceedings of the First Workshop on Scholarly Document Processing
Predicting the number of citations of scholarly documents is an upcoming task in scholarly document processing. Besides the intrinsic merit of this information, it also has a wider use as an imperfect proxy for quality which has the advantage of bein
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7d08eaaeacf729c1f3653e245e297382
https://research.rug.nl/en/publications/d13d2b5a-db30-418e-b2a9-2e41951ba4ac
https://research.rug.nl/en/publications/d13d2b5a-db30-418e-b2a9-2e41951ba4ac
Publikováno v:
Natural Language Engineering; Jan2022, Vol. 28 Issue 1, p71-91, 21p
Publikováno v:
Machine Translation; Dec2015, Vol. 29 Issue 3/4, p225-265, 41p
Autor:
Alberto Poncelas, Dimitar Shterionov, Andy Way, Gideon Maillette De Buy Wenniger, Peyman Passban
Publikováno v:
RUA. Repositorio Institucional de la Universidad de Alicante
Universidad de Alicante (UA)
Scopus-Elsevier
Poncelas, Alberto ORCID: 0000-0002-5089-1687, Shterionov, Dimitar ORCID: 0000-0001-6300-797X , Way, Andy ORCID: 0000-0001-5736-5930 , Maillette de Buy Wenniger, Gideon and Passban, Peyman (2018) Investigating backtranslation in neural machine translation. In: 21st Annual Conference of The European Association for Machine Translation, 28-30 May 2018, Alicante, Spain.
Poncelas, Alberto ORCID: 0000-0002-5089-1687, Shterionov, Dimitar ORCID: 0000-0001-6300-797X , Way, Andy ORCID: 0000-0001-5736-5930 , Maillette de Buy Wenniger, Gideon and Passban, Peyman (2018) Investigating backtranslation in neural machine translation. In: 21st Annual Conference
Tilburg University-PURE
Universidad de Alicante (UA)
Scopus-Elsevier
Poncelas, Alberto ORCID: 0000-0002-5089-1687
Poncelas, Alberto ORCID: 0000-0002-5089-1687
Tilburg University-PURE
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (SMT) or Neural MT (NMT) – is the availability of high-quality parallel data. This is arguably more important today than ever before, as NMT has bee
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::694f2196d4c1b5f9e93f7ebf1b26b611
https://hdl.handle.net/10045/76085
https://hdl.handle.net/10045/76085
Publikováno v:
Natural Language Processing and Information Systems
Bhattacharjee, Santanu, Haque, Rejwanul ORCID: 0000-0003-1680-0099, Maillette de Buy Wenniger, Gideon and Way, Andy ORCID: 0000-0001-5736-5930 (2020) Investigating query expansion and coreference resolution in question answering on BERT. In: 25th International Conference on Natural Language & Information Systems (NLDB 2020)), 24-26 June 2020, Saarbrücken, Germany (Online). ISBN 978-3-030-51309-2
Natural Language Processing and Information Systems ISBN: 9783030513092
NLDB
Natural Language Processing and Information Systems-25th International Conference on Applications of Natural Language to Information Systems, NLDB 2020, Saarbrücken, Germany, June 24–26, 2020, Proceedings
Lecture Notes in Computer Science
Lecture Notes in Computer Science-Natural Language Processing and Information Systems
Bhattacharjee, Santanu, Haque, Rejwanul ORCID: 0000-0003-1680-0099
Natural Language Processing and Information Systems ISBN: 9783030513092
NLDB
Natural Language Processing and Information Systems-25th International Conference on Applications of Natural Language to Information Systems, NLDB 2020, Saarbrücken, Germany, June 24–26, 2020, Proceedings
Lecture Notes in Computer Science
Lecture Notes in Computer Science-Natural Language Processing and Information Systems
The Bidirectional Encoder Representations from Transformers (BERT) model produces state-of-the-art results in many question answering (QA) datasets, including the Stanford Question Answering Dataset (SQuAD). This paper presents a query expansion (QE)
Publikováno v:
Prague Bulletin of Mathematical Linguistics, Vol 108, Iss 1, Pp 245-256 (2017)
The Prague Bulletin of Mathematical Linguistics
Poncelas, Alberto ORCID: 0000-0002-5089-1687, Maillette de Buy Wenniger, Gideon and Way, Andy ORCID: 0000-0001-5736-5930 (2017) Applying N-gram alignment entropy to improve feature decay algorithms. The Prague Bulletin of Mathematical Linguistics (108). pp. 245-256. ISSN 0032-6585
The Prague Bulletin of Mathematical Linguistics
Poncelas, Alberto ORCID: 0000-0002-5089-1687
Data Selection is a popular step in Machine Translation pipelines. Feature Decay Algorithms (FDA) is a technique for data selection that has shown a good performance in several tasks. FDA aims to maximize the coverage of n-grams in the test set. Howe
Autor:
Poncelas, Alberto
Publikováno v:
Poncelas, Alberto ORCID: 0000-0002-5089-1687 (2019) Improving transductive data selection algorithms for machine translation. PhD thesis, Dublin City University.
In this work, we study different ways of improving Machine Translation models by using the subset of training data that is the most relevant to the test set. This is achieved by using Transductive Algoritms (TA) for data selection. In particular, we
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od_______119::30290dcc644a4074acd06ed70e8bf009
http://doras.dcu.ie/23726/
http://doras.dcu.ie/23726/
Autor:
Gideon Maillette de Buy Wenniger, Andy Way, Dimitar Shterionov, Maja Popović, Alberto Poncelas
Publikováno v:
Poncelas, Alberto ORCID: 0000-0002-5089-1687 , Popović, Maja ORCID: 0000-0001-8234-8745 , Shterionov, Dimitar ORCID: 0000-0001-6300-797X , Maillette de Buy Wenniger, Gideon and Way, Andy ORCID: 0000-0001-5736-5930 (2019) Combining SMT and NMT back-translated data for efficient NMT. In: Recent Advances in Natural Language Processing (RANLP 2019), 2-4 Sept 2019, Varna, Bulgaria.
Scopus-Elsevier
Tilburg University-PURE
RANLP
Scopus-Elsevier
Tilburg University-PURE
RANLP
Neural Machine Translation (NMT) models achieve their best performance when large sets of parallel data are used for training. Consequently, techniques for augmenting the training set have become popular recently. One of these methods is back-transla
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4969e950485870897dfabc3ba1475bcb
http://doras.dcu.ie/24272/
http://doras.dcu.ie/24272/