Výsledky vyhledávání - "Béchet, Nicolas"

Report

WikiNER-fr-gold: A Gold-Standard NER Corpus

Autor: Cao, Danrun, Béchet, Nicolas, Marteau, Pierre-François

We address in this article the the quality of the WikiNER corpus, a multilingual Named Entity Recognition corpus, and provide a consolidated version of it. The annotation of WikiNER was produced in a semi-supervised manner i.e. no manual verification

Externí odkaz: http://arxiv.org/abs/2411.00030

Zobrazit plný text záznamu

Report

Age Recommendation from Texts and Sentences for Children

Autor: Rahman, Rashedur, Lecorvé, Gwénolé, Béchet, Nicolas

Children have less text understanding capability than adults. Moreover, this capability differs among the children of different ages. Hence, automatically predicting a recommended age based on texts or sentences would be a great benefit to propose ad

Externí odkaz: http://arxiv.org/abs/2308.10586

Zobrazit plný text záznamu

Report

Hybrid Isolation Forest - Application to Intrusion Detection

Autor: Marteau, Pierre-François, Soheily-Khah, Saeid, Béchet, Nicolas

From the identification of a drawback in the Isolation Forest (IF) algorithm that limits its use in the scope of anomaly detection, we propose two extensions that allow to firstly overcome the previously mention limitation and secondly to provide it

Externí odkaz: http://arxiv.org/abs/1705.03800

Zobrazit plný text záznamu

Dissertation/ Thesis

Extraction et regroupement de descripteurs morpho-syntaxiques pour des processus de Fouille de Textes

Autor: Béchet, Nicolas

Les mots constituent l'un des fondements des langues naturelles de type indo-européenne. Des corpus rédigés avec ces langues sont alors naturellement décrits avec des mots. Cependant, l'information qu'ils véhiculent seuls est assez réduite d'un

Externí odkaz: http://tel.archives-ouvertes.fr/tel-00462206
http://tel.archives-ouvertes.fr/docs/00/46/22/06/PDF/These.pdf

Zobrazit plný text záznamu

Chinese public procurement document harvesting pipeline

Autor: Cao, Danrun, Ahmia, Oussama, Béchet, Nicolas, Marteau, Pierre-François

Publikováno v: DocEng '22: ACM Symposium on Document Engineering 2022
DocEng '22: ACM Symposium on Document Engineering 2022, Sep 2022, San Jose California, France. pp.1-4, ⟨10.1145/3558100.3563848⟩

International audience; We present a processing pipeline for Chinese public procurement document harvesting, with the aim of producing strategic data with greater added value. It consists of three micro-modules: data collection, information extractio

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a94f9d92ec97d720e4e948f4cae83d3d
https://hal.science/hal-03966541

Zobrazit plný text záznamu

Akademický článek

A hybrid approach to managing job offers and candidates

Autor: Kessler, Rémy, Béchet, Nicolas, Roche, Mathieu, Torres-Moreno, Juan-Manuel, El-Bèze, Marc

Publikováno v: In Information Processing and Management November 2012 48(6):1124-1135

Zobrazit plný text záznamu

TREMoLo-Tweets corpus : guide d'annotation pour un corpus annoté en registres de langue pour le français

Autor: Mekki, Jade, Battistelli, Delphine, Lecorvé, Gwénolé, Béchet, Nicolas

This work is part of the TREMoLo project dedicated to language registers (casual, neutral, and formal). Here, we present an annotation guide grounded on a linguistic analysis of language registers and Computer-Mediated Communications (CMCs). It gives

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::a36d0fa8144a86af0f2a49ffc7196a10
https://hal.archives-ouvertes.fr/hal-03218217

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

TREMoLo : un corpus multi-étiquettes de tweets en français pour la caractérisation des registres de langue

Autor: Mekki, Jade, Battistelli, Delphine, Béchet, Nicolas, Lecorvé, Gwénolé

Publikováno v: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale
Traitement Automatique des Langues Naturelles
Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.237-245

International audience; Des registres tels que familier, courant et soutenu sont un phénomène immédiatement perceptible par tout locuteur d’une langue. Ils restent encore peu étudiés en traitement des langues (TAL), en particulier en dehors de

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::ac237cff453d8f7e2dfb1235be0c3c96
https://hal.archives-ouvertes.fr/hal-03265873/document

Zobrazit plný text záznamu

Caractérisation de registres de langue par extraction de motifs séquentiels émergents

Autor: Mekki, Jade, Béchet, Nicolas, Battistelli, Delphine, Lecorvé, Gwénolé

Publikováno v: JADT 2020 : 15èmes Journées Internationales d'Analyse statistique des Données Textuelles
JADT 2020 : 15èmes Journées Internationales d'Analyse statistique des Données Textuelles, Jun 2020, Toulouse, France

International audience; Language registers are the highly perceptible characteristic of written or spoken communication. In this paper we present a methodology to automatically characterize language registers using statistical tool named "emerging se

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::04e76bdbe5eae42bcdc9cc0bb4088069
https://hal.archives-ouvertes.fr/hal-03078450/document

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání