An illustrated approach to Soft Textual Cartography.

Autor: Ceré R; 1Department of Geography and Sustainability, University of Lausanne, Lausanne, Switzerland., Egloff M; 2Department of Language and Information Sciences, University of Lausanne, Lausanne, Switzerland.
Jazyk: angličtina
Zdroj: Applied network science [Appl Netw Sci] 2018; Vol. 3 (1), pp. 27. Date of Electronic Publication: 2018 Aug 13.
DOI: 10.1007/s41109-018-0087-y
Abstrakt: We propose and illustrate an approach of Soft Textual Cartography consisting in the clustering of regions by taking into account both their spatial relationships and their textual description within a corpus. We reduce large geo-referenced textual content into topics and merge them with their spatial configuration to reveal spatial patterns. The strategy consists in constructing a complex weighted network, reflecting the geographical layout, and whose nodes are further characterised by their thematic dissimilarity, extracted form topic modelling. A soft k-means procedure, taking into account both aspects through expectation maximisation on Gaussian mixture models and label propagation, converges towards a soft membership, to be further compared with expert knowledge on regions. Application on the Wikipedia pages of Swiss municipalities demonstrate the potential of the approach, revealing textual autocorrelation and associations with official classifications. The synergy of the spatial and textual aspects appears promising in topic interpretation and geographical information retrieval, and able to incorporate expert knowledge through the choice of the initial membership.
Competing Interests: The authors declare that they have no competing interests.Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Databáze: MEDLINE