Autor: |
Jonathan Reades, Jennie Williams |
Jazyk: |
angličtina |
Rok vydání: |
2023 |
Předmět: |
|
Zdroj: |
The Programming Historian, Vol 12 (2023) |
Druh dokumentu: |
article |
ISSN: |
2397-2068 |
DOI: |
10.46430/phen0111 |
Popis: |
This lesson uses word embeddings and clustering algorithms in Python to identify groups of similar documents in a corpus of approximately 9,000 academic abstracts. It will teach you the basics of dimensionality reduction for extracting structure from a large corpus and how to evaluate your results. |
Databáze: |
Directory of Open Access Journals |
Externí odkaz: |
|