A Data Augmentation and Pre-processing Technique for Sign Language Fingerspelling Recognition
Autor: | Frank Fowley, Ellen Rushe, Anthony Ventresque |
---|---|
Rok vydání: | 2022 |
Zdroj: | 24th Irish Machine Vision and Image Processing Conference. |
DOI: | 10.56541/xbav3102 |
Popis: | The reliance of deep learning algorithms on large scale datasets is a significant challenge for sign language recognition (SLR). The shortage of data resources for training SLR models inevitably leads to poor generalisation, especially for low-resource languages. We propose novel data augmentation and preprocessing techniques based on synthetic data generation to overcome these generalisation difficulties. Using these methods, our models achieved a top-1 accuracy of 86.7% and a top-2 accuracy of 95.5% when evaluated against an unseen corpus of Irish Sign Language (ISL) fingerspelling video recordings. We believe that this constitutes a state-of-the-art performance baseline for an Irish Sign Language recognition model when tested on an unseen dataset. |
Databáze: | OpenAIRE |
Externí odkaz: |