Autor: |
Utka, Andrius, Rackevičienė, Sigita, Mockienė, Liudmila, Rokas, Aivaras, Laurinaitis, Marius, Bielinskienė, Agnė |
Rok vydání: |
2022 |
Předmět: |
|
Zdroj: |
Linköping Electronic Conference Proceedings. |
ISSN: |
1650-3686 |
DOI: |
10.3384/ecp18912 |
Popis: |
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS. It is argued that a system of parallel, comparable, and training corpora for BiTE is particularly useful for less-resourced languages, as it allows efficiently to combine strengths and avoid weaknesses of comparable and parallel resources. A special focus is given to the availability of sources in the cybersecurity domain and issues related to copyright-protected publications, as well as the data curation performed for building the corpora and depositing them to CLARIN-LT repository. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|