The lexical context in a style analysis: A word embeddings approach.

Autor: Kubát, Miroslav, Hůla, Jan, Chen, Xinying, Čech, Radek, Milička, Jiří
Předmět:
Zdroj: Corpus Linguistics & Linguistic Theory; Oct2021, Vol. 17 Issue 2, p443-464, 22p
Abstrakt: This is a pilot study of usability of Context Specificity measure for stylometric purposes. Specifically, the word embedding Word2vec approach based on measuring lexical context similarity between lemmas is applied to the analysis of texts that belong to different styles. Three types of Czech texts are investigated: fiction, non-fiction, and journalism. Specifically, forty lemmas were observed (10 lemmas each for verbs, nouns, adjectives, and adverbs). The aim of the present study is to introduce a concept of the Context Specificity and to test whether this measurement is sensitive to different styles. The results show that the proposed method Closest Context Specificity (CCS) is a corpus size independent method which has a promising potential in analyzing different styles. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index