Multi-level mining and visualization of scientific text collections

Autor: Pablo Accuosto, Horacio Saggion, Francesco Ronzano, Daniel Ferrés
Rok vydání: 2017
Předmět:
Zdroj: WOSP@JCDL
DOI: 10.1145/3127526.3127529
Popis: We present a system to mine and visualize collections of scientific documents by semantically browsing information extracted from single publications or aggregated throughout corpora of articles. The text mining tool performs deep analysis of document collections allowing the extraction and interpretation of research paper's contents. In addition to the extraction and enrichment of documents with metadata (titles, authors, affiliations, etc), the deep analysis performed comprises semantic interpretation, rhetorical analysis of sentences, triple-based information extraction, and text summarization. The visualization components allow geographical-based exploration of collections, topic-evolution interpretation, and collaborative network analysis among others. The paper presents a case study of a bi-lingual collection in the field of Natural Language Processing (NLP).
Databáze: OpenAIRE