Automated Business Goal Extraction from E-mail Repositories to Bootstrap Business Understanding
Autor: | Marcin Kais, Vincent Jorn Menger, Marco Spruit |
---|---|
Rok vydání: | 2021 |
Předmět: |
Computer Networks and Communications
Computer science Process (engineering) knowledge discovery Information technology business understanding business goal generation T58.5-58.64 Phase (combat) Data science Bridge (nautical) CRISP-DM Business goals ComputingMethodologies_PATTERNRECOGNITION Knowledge extraction Bootstrapping (electronics) natural language processing e-mail data understanding |
Zdroj: | Future Internet, Vol 13, Iss 243, p 243 (2021) Future Internet, 13(10). MDPI Future Internet Volume 13 Issue 10 |
ISSN: | 1999-5903 |
DOI: | 10.3390/fi13100243 |
Popis: | The Cross-Industry Standard Process for Data Mining (CRISP-DM), despite being the most popular data mining process for more than two decades, is known to leave those organizations lacking operational data mining experience puzzled and unable to start their data mining projects. This is especially apparent in the first phase of Business Understanding, at the conclusion of which, the data mining goals of the project at hand should be specified, which arguably requires at least a conceptual understanding of the knowledge discovery process. We propose to bridge this knowledge gap from a Data Science perspective by applying Natural Language Processing techniques (NLP) to the organizations’ e-mail exchange repositories to extract explicitly stated business goals from the conversations, thus bootstrapping the Business Understanding phase of CRISP-DM. Our NLP-Automated Method for Business Understanding (NAMBU) generates a list of business goals which can subsequently be used for further specification of data mining goals. The validation of the results on the basis of comparison to the results of manual business goal extraction from the Enron corpus demonstrates the usefulness of our NAMBU method when applied to large datasets. |
Databáze: | OpenAIRE |
Externí odkaz: |