Understanding and Inferring Units in Spreadsheets
Autor: | Jack Williams, Advait Sarkar, Carina Negreanu, Andrew D. Gordon |
---|---|
Rok vydání: | 2020 |
Předmět: |
Class (computer programming)
Computer science business.industry Carry (arithmetic) Probabilistic logic Inference Value (computer science) 020207 software engineering 02 engineering and technology Machine learning computer.software_genre Unit (housing) Constraint (information theory) Annotation 020204 information systems 0202 electrical engineering electronic engineering information engineering Artificial intelligence business computer |
Zdroj: | VL/HCC |
DOI: | 10.1109/vl/hcc50065.2020.9127254 |
Popis: | Numbers in spreadsheets often have units: metres, grams, dollars, etc. Spreadsheet cells typically cannot carry unit information, and even where they can, users may not be motivated to provide it. However, unit information is extremely valuable: it allows us to detect and prevent an entire class of spreadsheet errors, such as accidentally adding values of different units. What if we could infer the unit of any value in a spreadsheet, with little or no work from the user?We present a novel method for predicting units and dimensions in spreadsheets, the first such method that combines logical constraint solving and probabilistic unit labelling. Our approach identifies and formalises the critical cells in spreadsheets that bound the user cost of unit annotation. Separately, we apply machine learning to infer probabilistic unit labels from cell text. To contextualise the accuracy of our system, we discuss the attention investment trade-off for unit inference. |
Databáze: | OpenAIRE |
Externí odkaz: |