Do the Math: Making Mathematics in Wikipedia Computable

Autor: Andre Greiner-Petter, Moritz Schubotz, Corinna Breitinger, Philipp Scharpf, Akiko Aizawa, Bela Gipp
Rok vydání: 2022
Předmět:
Zdroj: IEEE Transactions on Pattern Analysis and Machine Intelligence. :1-12
ISSN: 1939-3539
0162-8828
Popis: Wikipedia combines the power of AI solutions and human reviewers to safeguard article quality. Quality control objectives include detecting malicious edits, fixing typos, and spotting inconsistent formatting. However, no automated quality control mechanisms currently exist for mathematical formulae. Spell checkers are widely used to highlight textual errors, yet no equivalent tool exists to detect algebraically incorrect formulae. Our paper addresses this shortcoming by making mathematical formulae computable. We present a method that (1) gathers the semantic information surrounding the context of each mathematical formulae, (2) provides access to the information in a graph-structured dependency hierarchy, and (3) performs automatic plausibility checks on equations. We evaluate the performance of our approach on 6,337 mathematical expressions contained in 104 Wikipedia articles on the topic of orthogonal polynomials and special functions. Our system, LACAST , verified 358 out of 1,516 equations as error-free. LACAST successfully translated 27% of the mathematical expressions and outperformed existing translation approaches by 16%. Additionally, LACAST achieved an F1 score of .495 for annotating mathematical expressions with relevant textual descriptions, which is a significant step towards advancing searchability, readability, and accessibility of mathematical formulae in Wikipedia. A prototype of LACAST and the semantically enhanced Wikipedia articles are available at: https://tpami.wmflabs.org . published
Databáze: OpenAIRE