Bornil: An open-source sign language data crowdsourcing platform for AI enabled dialect-agnostic communication

Autor: Dhruvo, Shahriar Elahi, Rahman, Mohammad Akhlaqur, Mandal, Manash Kumar, Shihab, Md. Istiak Hossain, Ansary, A. A. Noman, Shithi, Kaneez Fatema, Khanom, Sanjida, Akter, Rabeya, Arib, Safaeid Hossain, Ansary, M. N., Mehnaz, Sazia, Sultana, Rezwana, Rahman, Sejuti, Chowdhury, Sayma Sultana, Chowdhury, Sabbir Ahmed, Sadeque, Farig, Sushmit, Asif
Rok vydání: 2023
Předmět:
Druh dokumentu: Working Paper
Popis: The absence of annotated sign language datasets has hindered the development of sign language recognition and translation technologies. In this paper, we introduce Bornil; a crowdsource-friendly, multilingual sign language data collection, annotation, and validation platform. Bornil allows users to record sign language gestures and lets annotators perform sentence and gloss-level annotation. It also allows validators to make sure of the quality of both the recorded videos and the annotations through manual validation to develop high-quality datasets for deep learning-based Automatic Sign Language Recognition. To demonstrate the system's efficacy; we collected the largest sign language dataset for Bangladeshi Sign Language dialect, perform deep learning based Sign Language Recognition modeling, and report the benchmark performance. The Bornil platform, BornilDB v1.0 Dataset, and the codebases are available on https://bornil.bengali.ai
Comment: 6 pages, 7 figures
Databáze: arXiv