Zobrazeno 1 - 10
of 72
pro vyhledávání: '"Béchet, Nicolas"'
We address in this article the the quality of the WikiNER corpus, a multilingual Named Entity Recognition corpus, and provide a consolidated version of it. The annotation of WikiNER was produced in a semi-supervised manner i.e. no manual verification
Externí odkaz:
http://arxiv.org/abs/2411.00030
Children have less text understanding capability than adults. Moreover, this capability differs among the children of different ages. Hence, automatically predicting a recommended age based on texts or sentences would be a great benefit to propose ad
Externí odkaz:
http://arxiv.org/abs/2308.10586
From the identification of a drawback in the Isolation Forest (IF) algorithm that limits its use in the scope of anomaly detection, we propose two extensions that allow to firstly overcome the previously mention limitation and secondly to provide it
Externí odkaz:
http://arxiv.org/abs/1705.03800
Publikováno v:
DocEng '22: ACM Symposium on Document Engineering 2022
DocEng '22: ACM Symposium on Document Engineering 2022, Sep 2022, San Jose California, France. pp.1-4, ⟨10.1145/3558100.3563848⟩
DocEng '22: ACM Symposium on Document Engineering 2022, Sep 2022, San Jose California, France. pp.1-4, ⟨10.1145/3558100.3563848⟩
International audience; We present a processing pipeline for Chinese public procurement document harvesting, with the aim of producing strategic data with greater added value. It consists of three micro-modules: data collection, information extractio
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a94f9d92ec97d720e4e948f4cae83d3d
https://hal.science/hal-03966541
https://hal.science/hal-03966541
Publikováno v:
In Information Processing and Management November 2012 48(6):1124-1135
This work is part of the TREMoLo project dedicated to language registers (casual, neutral, and formal). Here, we present an annotation guide grounded on a linguistic analysis of language registers and Computer-Mediated Communications (CMCs). It gives
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::a36d0fa8144a86af0f2a49ffc7196a10
https://hal.archives-ouvertes.fr/hal-03218217
https://hal.archives-ouvertes.fr/hal-03218217
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale
Traitement Automatique des Langues Naturelles
Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.237-245
Traitement Automatique des Langues Naturelles
Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.237-245
International audience; Des registres tels que familier, courant et soutenu sont un phénomène immédiatement perceptible par tout locuteur d’une langue. Ils restent encore peu étudiés en traitement des langues (TAL), en particulier en dehors de
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::ac237cff453d8f7e2dfb1235be0c3c96
https://hal.archives-ouvertes.fr/hal-03265873/document
https://hal.archives-ouvertes.fr/hal-03265873/document
Publikováno v:
JADT 2020 : 15èmes Journées Internationales d'Analyse statistique des Données Textuelles
JADT 2020 : 15èmes Journées Internationales d'Analyse statistique des Données Textuelles, Jun 2020, Toulouse, France
JADT 2020 : 15èmes Journées Internationales d'Analyse statistique des Données Textuelles, Jun 2020, Toulouse, France
International audience; Language registers are the highly perceptible characteristic of written or spoken communication. In this paper we present a methodology to automatically characterize language registers using statistical tool named "emerging se
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::04e76bdbe5eae42bcdc9cc0bb4088069
https://hal.archives-ouvertes.fr/hal-03078450/document
https://hal.archives-ouvertes.fr/hal-03078450/document
Autor:
Lecorvé, Gwénolé, Ayats, Hugo, Fournier, Benoît, Mekki, Jade, Chevelu, Jonathan, Battistelli, Delphine, Béchet, Nicolas
Publikováno v:
International Conference on Computational Linguistics and Intelligent Text Processing (CICLing)
International Conference on Computational Linguistics and Intelligent Text Processing (CICLing), Apr 2019, La Rochelle, France
Computational Linguistics and Intelligent Text Processing: 20th International Conference, CICLing 2019
International Conference on Computational Linguistics and Intelligent Text Processing (CICLing), Apr 2019, La Rochelle, France. pp.480-492, ⟨10.1007/978-3-031-24337-0_34⟩
Computational Linguistics and Intelligent Text Processing ISBN: 9783031243363
International Conference on Computational Linguistics and Intelligent Text Processing (CICLing), Apr 2019, La Rochelle, France
Computational Linguistics and Intelligent Text Processing: 20th International Conference, CICLing 2019
International Conference on Computational Linguistics and Intelligent Text Processing (CICLing), Apr 2019, La Rochelle, France. pp.480-492, ⟨10.1007/978-3-031-24337-0_34⟩
Computational Linguistics and Intelligent Text Processing ISBN: 9783031243363
International audience; Language registers are a strongly perceptible characteristic of texts and speeches. However, they are still poorly studied in natural language processing. In this paper, we present a semi-supervised approach which jointly buil
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::77864543f2cf3fc951bc07811e0081b9
https://hal.science/hal-02064694/document
https://hal.science/hal-02064694/document