Mining and Analysing One Billion Requests to Linguistic Services

Autor: Thomas Eckart, Greta Franzini, Marco Büchler, Emily Franzini
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: IEEE BigData
Popis: From 2004 to 2016 the Leipzig Linguistic Services (LLS) existed as a SOAP-based cyberinfrastructure of atomic micro-services for the Wortschatz project, which covered different-sized textual corpora in more than 230 languages. The LLS were developed in 2004 and went live in 2005 in order to provide a webservice-based API to these corpus databases. In 2006, the LLS infrastructure began to systematically log and store requests made to the text collection, and in August 2016 the LLS were shut down. This article summarises the experience of the past ten years of running such a cyberinfrastructure with a total of nearly one billion requests. It includes an explanation of the technical decisions and limitations but also provides an overview of how the services were used.
Databáze: OpenAIRE