Clustering and Visualising Documents using Word Embeddings

Autor: Jonathan Reades, Jennie Williams
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: The Programming Historian, Vol 12 (2023)
Druh dokumentu: article
ISSN: 2397-2068
DOI: 10.46430/phen0111
Popis: This lesson uses word embeddings and clustering algorithms in Python to identify groups of similar documents in a corpus of approximately 9,000 academic abstracts. It will teach you the basics of dimensionality reduction for extracting structure from a large corpus and how to evaluate your results.
Databáze: Directory of Open Access Journals