Classification of descriptions and summary using multiple passes of statistical and natural language toolkits

Autor: Banthia, Saumya, Sharma, Anantha
Rok vydání: 2020
Předmět:
Druh dokumentu: Working Paper
Popis: This document describes a possible approach that can be used to check the relevance of a summary / definition of an entity with respect to its name. This classifier focuses on the relevancy of an entity's name to its summary / definition, in other words, it is a name relevance check. The percentage score obtained from this approach can be used either on its own or used to supplement scores obtained from other metrics to arrive upon a final classification; at the end of the document, potential improvements have also been outlined. The dataset that this document focuses on achieving an objective score is a list of package names and their respective summaries (sourced from pypi.org).
Comment: 9 pages, 9 figures
Databáze: arXiv