Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Leopoldo Pla Sempere"'
Autor:
Sergio Ortiz Rojas, Marek Strelec, Amir Kamran, Pinzhen Chen, Jaume Zaragoza, William Waites, Kenneth Heafield, Marta Bañón, Philipp Koehn, Hieu Hoang, Leopoldo Pla Sempere, Brian Thompson, Dion Wiggins, Elsa Sarrías, Faheem Kirefu, Gema Ramírez-Sánchez, Mikel L. Forcada, Barry Haddow, Miquel Esplà-Gomis
Publikováno v:
Bañón, M, Chen, P, Haddow, B, Heafield, K, Hoang, H, Esplà-Gomis, M, Forcada, M, Kamran, A, Kirefu, F, Koehn, P, Ortiz-Rojas, S, Pla, L, Ramírez-Sánchez, G, Sarrías, E, Strelec, M, Thompson, B, Waites, W, Wiggins, D & Zaragoza, J 2020, ParaCrawl: Web-Scale Acquisition of Parallel Corpora . in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics . pp. 4555–4567, 2020 Annual Conference of the Association for Computational Linguistics, Virtual conference, Washington, United States, 5/07/20 . https://doi.org/10.18653/v1/2020.acl-main.417
ACL
ACL
We report on methods to create the largest publicly available parallel corpora by crawling the web, using open source software. We empirically compare alternative methods and publish benchmark data sets for sentence alignment and sentence pair filter
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b9f4c40d019a0f748816f44701b90ba6
https://hdl.handle.net/20.500.11820/aeb1138d-856e-477a-9ea0-f3ee5900cab1
https://hdl.handle.net/20.500.11820/aeb1138d-856e-477a-9ea0-f3ee5900cab1
Autor:
Rik van Noord, Cristian Garcia-Romero, Miquel Esplà-Gomis, Leopoldo Pla Sempere, Antonio Toral
Publikováno v:
University of Groningen
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::a5e1c4bec3935df6a881efbfd405d769
https://research.rug.nl/en/publications/2394a04f-e9bd-423c-8ec7-aede50eb9308
https://research.rug.nl/en/publications/2394a04f-e9bd-423c-8ec7-aede50eb9308