Autor:	Yadav, Shashank, Shimpi, Tejas, Chowdary, C. Ravindranath, Sharma, Prashant, Agrawal, Deepansh, Agarwal, Shivang
Rok vydání:	2019
Předmět:	Computer Science - Information Retrieval Computer Science - Computation and Language
Druh dokumentu:	Working Paper
Popis:	Segmenting an unordered text document into different sections is a very useful task in many text processing applications like multiple document summarization, question answering, etc. This paper proposes structuring of an unordered text document based on the keywords in the document. We test our approach on Wikipedia documents using both statistical and predictive methods such as the TextRank algorithm and Google's USE (Universal Sentence Encoder). From our experimental results, we show that the proposed model can effectively structure an unordered document into sections.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/1901.10133 Zobrazit plný text záznamu View this record from Arxiv