Autor: |
Titeux, Hadrien, Riad, Rachid, Cao, Xuan-Nga, Hamilakis, Nicolas, Madden, Kris, Cristia, Alejandrina, Bachoud-Lévi, Anne-Catherine, Dupoux, Emmanuel |
Rok vydání: |
2020 |
Předmět: |
|
Zdroj: |
LREC 2020 - 12th Language Resources and Evaluation Conference, May 2020, Marseille, France. pp.6976-6982 |
Druh dokumentu: |
Working Paper |
Popis: |
We introduce Seshat, a new, simple and open-source software to efficiently manage annotations of speech corpora. The Seshat software allows users to easily customise and manage annotations of large audio corpora while ensuring compliance with the formatting and naming conventions of the annotated output files. In addition, it includes procedures for checking the content of annotations following specific rules that can be implemented in personalised parsers. Finally, we propose a double-annotation mode, for which Seshat computes automatically an associated inter-annotator agreement with the $\gamma$ measure taking into account the categorisation and segmentation discrepancies. |
Databáze: |
arXiv |
Externí odkaz: |
|