Demand Forecasting in the Presence of Privileged Information
Autor: | Ariannezhad, M., Schelter, S., de Rijke, M., Lemaire, V., Malinowski, S., Bagnall, A., Guyet, T., Tavenard, R., Ifrim, G. |
---|---|
Přispěvatelé: | Information and Language Processing Syst (IVI, FNWI) |
Rok vydání: | 2020 |
Předmět: | |
Zdroj: | Advanced Analytics and Learning on Temporal Data ISBN: 9783030657413 AALTD@PKDD/ECML Advanced Analytics and Learning on Temporal Data: 5th ECML PKDD Workshop, AALTD 2020, Ghent, Belgium, September 18, 2020 : revised selected papers, 46-62 STARTPAGE=46;ENDPAGE=62;TITLE=Advanced Analytics and Learning on Temporal Data |
DOI: | 10.1007/978-3-030-65742-0_4 |
Popis: | Predicting the amount of sales in the future is a fundamental problem in the replenishment process of retail companies. Models for forecasting the demand of an item typically rely on influential features and historical sales of the item. However, the values of some influential features (to which we refer as non-plannable features) are only known during model training (for the past), and not for the future at prediction time. Examples of such features include sales in other channels, such as other stores in chain supermarkets. Existing forecasting methods ignore such non-plannable features or wrongly assume that they are also known at prediction time. We identify non-plannable features as privileged information, i.e., information that is available at training time but not at prediction time, and design a neural network to leverage this source of data accordingly. We present a dual branch neural network architecture that incorporates non-plannable features at training time, with a first branch to embed the historical information, and a second branch, the privileged information (PI) branch, to predict demand based on privileged information. Next, we leverage a single branch network at prediction time, which applies a simulation component to mimic the behavior of the PI branch, whose inputs are not available at prediction time. We evaluate our approach on two real-world forecasting datasets, and find that it outperforms state-of-the-art competitors in terms of mean absolute error and symmetric mean absolute percentage error metrics. We further provide visualizations and conduct experiments to validate the contribution of different components in our proposed architecture. |
Databáze: | OpenAIRE |
Externí odkaz: |