Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the 'Speaking Rosetta' JSALT 2017 Workshop

Autor: Scharenborg, Odette, Besacier, Laurent, Black, Alan, Hasegawa-Johnson, Mark, Metze, Florian, Neubig, Graham, Stueker, Sebastian, Godard, Pierre, Mueller, Markus, Ondel, Lucas, Palaskar, Shruti, Arthur, Philip, Ciannella, Francesco, Du, Mingxing, Larsen, Elin, Merkx, Danny, Riad, Rachid, Wang, Liming, Dupoux, Emmanuel
Rok vydání: 2018
Předmět:
Druh dokumentu: Working Paper
Popis: We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography. We study the replacement of orthographic transcriptions by images and/or translated text in a well-resourced language to help unsupervised discovery from raw speech.
Comment: Accepted to ICASSP 2018
Databáze: arXiv