Selecting syntactic attributes for authorship attribution
Autor: | Edson J. R. Justino, Luiz S. Oliveira, Paulo Junior Varela |
---|---|
Rok vydání: | 2011 |
Předmět: |
Structured support vector machine
business.industry Computer science Mode (statistics) Machine learning computer.software_genre Support vector machine Relevance vector machine Authorship attribution Genetic algorithm Margin classifier Artificial intelligence business computer Natural language processing |
Zdroj: | IJCNN |
DOI: | 10.1109/ijcnn.2011.6033217 |
Popis: | In this work we present a methodology to select syntactic attributes for authorship attribution. The approach takes into account a multi-objective genetic algorithm and a Support Vector Machine classifier and it operates in a wrapper mode. Through a series of comprehensive experiments on a database composed of 3000 short articles written in Portuguese we show that the proposed methodology is able to provide a concise subset of attributes, which increases the recognition rate in about 15 percentage points. |
Databáze: | OpenAIRE |
Externí odkaz: |