Building a DDC-annotated Corpus from OAI Metadata

Autor: Lösch, Mathias, Waltinger, Ulli, Horstmann, Wolfram, Mehler, Alexander
Přispěvatelé: German Research Foundation (DFG)
Jazyk: angličtina
Rok vydání: 2011
Předmět:
Zdroj: Journal of Digital Information; Vol 12, No 2 (2011): Open Repositories 2010
International Conference on Open Repositories : Proceedings
Journal of Digital Information; Vol. 12 No. 2 (2011): Open Repositories 2010
ISSN: 1368-7506
Popis: A frequently overlooked benefit of open access publications is that they are an easy accessible and cost-effective data source for research disciplines like text mining, natural language processing or computational linguistics. In those fields, linguistic data is usually managed in the form of corpora, i.e. machine readable bodies of texts that represent a particular variety of language.
International Conference on Open Repositories : Proceedings, The 5th International Conference on Open Repositories (OR2010), Madrid, Spain, 6-9 July 2010
Databáze: OpenAIRE