On the Geometry of Concreteness

Autor: Wartena, Christian
Rok vydání: 2022
Předmět:
Zdroj: Proceedings of the 7th Workshop on Representation Learning for NLP.
DOI: 10.18653/v1/2022.repl4nlp-1.21
Popis: In this paper we investigate how concreteness and abstractness are represented in word embedding spaces. We use data for English and German, and show that concreteness and abstractness can be determined independently and turn out to be completely opposite directions in the embedding space. Various methods can be used to determine the direction of concreteness, always resulting in roughly the same vector. Though concreteness is a central aspect of the meaning of words and can be detected clearly in embedding spaces, it seems not as easy to subtract or add concreteness to words to obtain other words or word senses like e.g. can be done with a semantic property like gender.
Databáze: OpenAIRE