Tabular Transformers for Modeling Multivariate Time Series
Autor: | Pierre L. Dognin, Mattia Rigotti, Yair Schiff, Youssef Mroueh, Igor Melnyk, Ravi Nair, Erik R. Altman, Jerret Ross, Inkit Padhi |
---|---|
Rok vydání: | 2020 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning Artificial neural network Computer Science - Artificial Intelligence Computer science business.industry Deep learning Feature extraction computer.software_genre Data modeling Machine Learning (cs.LG) Credit card Artificial Intelligence (cs.AI) Code (cryptography) Leverage (statistics) Data mining Artificial intelligence Time series business computer |
Zdroj: | ICASSP |
DOI: | 10.48550/arxiv.2011.01843 |
Popis: | Tabular datasets are ubiquitous in data science applications. Given their importance, it seems natural to apply state-of-the-art deep learning algorithms in order to fully unlock their potential. Here we propose neural network models that represent tabular time series that can optionally leverage their hierarchical structure. This results in two architectures for tabular time series: one for learning representations that is analogous to BERT and can be pre-trained end-to-end and used in downstream tasks, and one that is akin to GPT and can be used for generation of realistic synthetic tabular sequences. We demonstrate our models on two datasets: a synthetic credit card transaction dataset, where the learned representations are used for fraud detection and synthetic data generation, and on a real pollution dataset, where the learned encodings are used to predict atmospheric pollutant concentrations. Code and data are available at https://github.com/IBM/TabFormer. Comment: Accepted to ICASSP, 2021; https://github.com/IBM/TabFormer |
Databáze: | OpenAIRE |
Externí odkaz: |