Generating Synthetic Handwritten Mathematical Expressions from a LaTeX Sequence or a MathML Script
Autor: | Ung Quang Huy, Vu Tran Minh Khuong, Minh Khanh Phan, Nakagawa Masaki |
---|---|
Rok vydání: | 2019 |
Předmět: |
Sequence
Relation (database) Computer science business.industry media_common.quotation_subject 020207 software engineering 02 engineering and technology computer.software_genre Symbol (chemistry) Symbol Tree (data structure) Annotation MathML 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business computer Natural language processing media_common |
Zdroj: | ICDAR |
DOI: | 10.1109/icdar.2019.00152 |
Popis: | Collecting handwritten mathematical expressions (HMEs) generally requires a lot of time and effort for data preparation, data collection, annotation and so on. In this paper, we present a method for generating realistic HMEs in a wide variety of structures and styles from a LaTeX sequence or a MathML script using online isolated symbol patterns. Our method firstly positions all symbols in a symbol relation tree constructed from the input LaTeX or MathML script. Then, it places normalized online symbol patterns into the corresponding locations. A questionnaire-based experiment shows that the synthetic patterns are as clear and natural as the real patterns. Therefore, we can use the generated synthetic HME patterns for research on HME recognition and clustering. |
Databáze: | OpenAIRE |
Externí odkaz: |