GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek

Autor: Loukas, Lefteris, Smyrnioudis, Nikolaos, Dikonomaki, Chrysa, Barbakos, Spyros, Toumazatos, Anastasios, Koutsikakis, John, Kyriakakis, Manolis, Georgiou, Mary, Vassos, Stavros, Pavlopoulos, John, Androutsopoulos, Ion
Rok vydání: 2024
Předmět:
Druh dokumentu: Working Paper
Popis: We present GR-NLP-TOOLKIT, an open-source natural language processing (NLP) toolkit developed specifically for modern Greek. The toolkit provides state-of-the-art performance in five core NLP tasks, namely part-of-speech tagging, morphological tagging, dependency parsing, named entity recognition, and Greeklishto-Greek transliteration. The toolkit is based on pre-trained Transformers, it is freely available, and can be easily installed in Python (pip install gr-nlp-toolkit). It is also accessible through a demonstration platform on HuggingFace, along with a publicly available API for non-commercial use. We discuss the functionality provided for each task, the underlying methods, experiments against comparable open-source toolkits, and future possible enhancements. The toolkit is available at: https://github.com/nlpaueb/gr-nlp-toolkit
Comment: Accepted Demo Paper @ COLING 2025 (Github: https://github.com/nlpaueb/gr-nlp-toolkit/, Demo: https://huggingface.co/spaces/AUEB-NLP/greek-nlp-toolkit-demo, API: https://huggingface.co/spaces/AUEB-NLP/The-Greek-NLP-API)
Databáze: arXiv