Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3448016.3457557acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

AutoAI-TS: AutoAI for Time Series Forecasting

Published: 18 June 2021 Publication History

Abstract

A large number of time series forecasting models including traditional statistical models, machine learning models and more recently deep learning have been proposed in the literature. However, choosing the right model along with good parameter values that performs well on a given data is still challenging. Automatically providing a good set of models to users for a given dataset saves both time and effort from using trial-and-error approaches with a wide variety of available models along with parameter optimization. We present AutoAI for Time Series Forecasting (AutoAI-TS) that provides users with a zero configuration (zero-conf) system to efficiently train, optimize and choose best forecasting model among various classes of models for the given dataset. With its flexible zero-conf design, AutoAI-TS automatically performs all the data preparation, model creation, parameter optimization, training and model selection for users and provides a trained model that is ready to use. For given data, AutoAI-TS utilizes a wide variety of models including classical statistical models, Machine Learning (ML) models, statistical-ML hybrid models and deep learning models along with various transformations to create forecasting pipelines. It then evaluates and ranks pipelines using the proposed T-Daub mechanism to choose the best pipeline. The paper describe in detail all the technical aspects of AutoAI-TS along with extensive benchmarking on a variety of real world data sets for various use-cases. Benchmark results show that AutoAI-TS, with no manual configuration from the user, automatically trains and selects pipelines that on average outperform existing state-of-the-art time series forecasting toolkits.

Supplementary Material

MP4 File (3448016.3457557.mp4)
Large number of time series forecasting models including tradi- tional statistical models, machine learning models and more re- cently deep learning have been proposed in the literature. However, choosing the right model along with good parameter values that performs well on a given data is still challenging. Automatically providing good set of models to users for a given dataset saves both time and efforts from using trial-and-error with wide variety of available models along with all possible parameter optimization. We present AutoAI for Time Series Forecasting (AutoAI-TS) that pro- vides users with a zero-conf system to efficiently train, optimize and choose best forecasting model among various classes of models for the given dataset. With the flexible zero-conf design of AutoAI-TS automatically performs all the data preparation, model creation, parameter optimization, training and model selection for the users and provides them with trained model that is ready to use for their use case. For given data, the AutoAI-TS utilizes wide inventory of models including classical statistical models, Machine Learn- ing (ML) models, statistical-ML hybrid models and deep learning models along with various transformations to create forecasting pipelines. It then evaluates and ranks pipelines using proposed T-Daub mechanism to choose best pipeline. The paper describe in detail all the technical aspects of AutoAI-TS along with exten- sive benchmarking on variety of real world data sets for various use-cases. Benchmark results show that AutoAI-TS with no manual configuration from user, automatically trains and selects pipelines that on average outperform existing state-of-the-art time series forecasting toolkits.

References

[1]
Subutai Ahmad, Alexander Lavin, Scott Purdy, and Zuha Agha. 2017. Unsupervised real-time anomaly detection for streaming data., Vol. 262 (2017). http://dblp.uni-trier.de/db/journals/ijon/ijon262.html#AhmadLPA17
[2]
AWS. Accessed: 2021-02--10. DeepAR. https://gluon-ts.mxnet.io/api/gluonts/gluonts.model.deepar.html .
[3]
Pourya Ayria. Accessed: 2020-08--17. A complete Machine Learning PipeLine. https://www.kaggle.com/pouryaayria/a-complete-ml-pipeline-tutorial-acu-86 .
[4]
Lars Buitinck, Gilles Louppe, Mathieu Blondel, Fabian Pedregosa, Andreas Mueller, Olivier Grisel, Vlad Niculae, Peter Prettenhofer, Alexandre Gramfort, Jaques Grobler, Robert Layton, Jake VanderPlas, Arnaud Joly, Brian Holt, and Gaë l Varoquaux. 2013. API design for machine learning software: experiences from the scikit-learn project. In ECML PKDD Workshop: Languages for Data Mining and Machine Learning . 108--122.
[5]
Facebook. Accessed: 2021-02--10. Prophet. https://facebook.github.io/prophet/.
[6]
Valentin Flunkert, David Salinas, and Jan Gasthaus. 2017. DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks. International Journal of Forecasting (04 2017). https://doi.org/10.1016/j.ijforecast.2019.07.001
[7]
Daniel Golovin, Benjamin Solnik, Subhodeep Moitra, Greg Kochanski, John Elliot Karro, and D. Sculley (Eds.). 2017. Google Vizier: A Service for Black-Box Optimization . http://www.kdd.org/kdd2017/papers/view/google-vizier-a-service-for-black-box-optimization
[8]
Google. Accessed: 2020-08--17. Cloud AutoML . https://cloud.google.com/automl .
[9]
H2O. Accessed: 2020-08--17. H2O AutoML . https://www.h2o.ai/products/h2o-automl/.
[10]
Trevor Hastie and Robert Tibshirani. 1987. Generalized additive models: Some applications. J. Amer. Statist. Assoc., Vol. 82 (1987), 371--386.
[11]
Xin He, Kaiyong Zhao, and Xiaowen Chu. 2019. AutoML: A Survey of the State-of-the-Art. ArXiv, Vol. abs/1908.00709 (2019).
[12]
Rob Hyndman, Anne Koehler, Keith Ord, and Ralph Snyder. 2008. Forecasting with exponential smoothing. The state space approach . 362. https://doi.org/10.1007/978--3--540--71918--2
[13]
Rob Hyndman and Yangzhuoran Yang. 2018. v0.1.0 (accessed Jan 15, 2021). tsdl: Time Series Data Library. https://pkg.yangzhuoranyang./tsdl/
[14]
Rob J. Hyndman and Yeasmin Khandakar. 2008. Automatic Time Series Forecasting: The forecast Package for R. Journal of Statistical Software, Vol. 27, 3 (22 3 2008), 1--22. showCODENJSSOBK
[15]
IBM. Accessed: 2020-08--17. AutoAI with IBM Watson Studio . https://www.ibm.com/cloud/watson-studio/autoai .
[16]
Arun Iyengar, Jayant Kalagnanam, Dhaval Patel, Chandra Reddy, and Shrey Shrivastava. 2019. Providing Cooperative Data Analytics for Real Applications Using Machine Learning. In 39th IEEE International Conference on Distributed Computing Systems, ICDCS. 1878--1890. https://doi.org/10.1109/ICDCS.2019.00186
[17]
Balázs Kégl. Accessed: 2020-08--17. How to Build a Data Science Pipeline. https://www.kdnuggets.com/2017/07/build-data-science-pipeline.html .
[18]
Guokun Lai, Wei-Cheng Chang, Yiming Yang, and Hanxiao Liu. 2017a. Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks. CoRR, Vol. abs/1703.07015 (2017). arxiv: 1703.07015
[19]
Guokun Lai, Wei-Cheng Chang, Yiming Yang, and Hanxiao Liu. 2017b. Modeling Long-and Short-Term Temporal Patterns with Deep Neural Networks. arXiv preprint arXiv:1703.07015 (2017).
[20]
Alysha Livera, Rob Hyndman, and Ralph Snyder. 2010. Forecasting Time Series With Complex Seasonal Patterns Using Exponential Smoothing. J. Amer. Statist. Assoc., Vol. 106 (01 2010), 1513--1527. https://doi.org/10.1198/jasa.2011.tm09771
[21]
Markus Lö ning, Anthony Bagnall, Sajaysurya Ganesh, Viktor Kazakov, Jason Lines, and Franz J Kirá ly. 2019. sktime: A Unified Interface for Machine Learning with Time Series. In Workshop on Systems for ML at NeurIPS 2019 (2019).
[22]
Spyros Makridakis, Evangelos Spiliotis, and Vassilis Assimakopoulos. 2018. Statistical and Machine Learning forecasting methods: Concerns and ways forward. PLoS ONE, Vol. 13 (03 2018). https://doi.org/10.1371/journal.pone.0194889
[23]
Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio. 2020. N-BEATS: Neural basis expansion analysis for interpretable time series forecasting. In International Conference on Learning Representations . https://openreview.net/forum?id=r1ecqn4YwB
[24]
Philippe Remy. Accessed: 2021-02--10. N-Beats. https://github.com/philipperemy/n-beats .
[25]
Ashish Sabharwal, Horst Samulowitz, and Gerald Tesauro. 2015. Selecting Near-Optimal Learners via Incremental Data Allocation. In AAAI .
[26]
David Salinas, Valentin Flunkert, Jan Gasthaus, and Tim Januschowski. 2020. DeepAR: Probabilistic forecasting with autoregressive recurrent networks. International Journal of Forecasting, Vol. 36, 3 (2020), 1181--1191.
[27]
Radwa El Shawi, Mohamed Maher, and Sherif Sakr. 2019. Automated Machine Learning: State-of-The-Art and Open Challenges. CoRR, Vol. abs/1906.02287 (2019). arxiv: 1906.02287 http://arxiv.org/abs/1906.02287
[28]
Shrey Shrivastava, Dhaval Patel, Wesley M. Gifford, Stuart Siegel, and Jayant Kalagnanam. 2019. ThunderML: A Toolkit for Enabling AI/ML Models on Cloud for Industry 4.0. In 26th International Conference Web Services - ICWS. 163--180. https://doi.org/10.1007/978--3-030--23499--7_11
[29]
P. C. L Silva et almbox. 2018 (accessed June 15, 2020). pyFTS: Fuzzy Time Series for Python . https://doi.org/10.5281/zenodo.597359
[30]
Taylor G. Smith et almbox. 2017. pmdarima: ARIMA estimators for Python . http://www.alkaline-ml.com/pmdarima [Online; accessed 2020-08--17].
[31]
Sean Taylor and Benjamin Letham. 2017. Forecasting at Scale. The American Statistician, Vol. 72 (09 2017). https://doi.org/10.1080/00031305.2017.1380080
[32]
Sean J Taylor and Benjamin Letham. 2018. Forecasting at scale. The American Statistician, Vol. 72, 1 (2018), 37--45.
[33]
Linda Zhou. Accessed: 2020-08--17. How to Build a Better Machine Learning Pipeline. https://www.datanami.com/2018/09/05/how-to-build-a-better-machine-learning-pipeline/.
[34]
Marc-Andre Zoller and Marco F. Huber. 2019. Benchmark and Survey of Automated Machine Learning Frameworks. arXiv: Learning (2019).

Cited By

View all
  • (2024)A Demonstration of TENDS: Time Series Management System Based on Model SelectionProceedings of the VLDB Endowment10.14778/3685800.368587417:12(4357-4360)Online publication date: 1-Aug-2024
  • (2024)DARKER: Efficient Transformer with Data-Driven Attention Mechanism for Time SeriesProceedings of the VLDB Endowment10.14778/3681954.368199617:11(3229-3242)Online publication date: 30-Aug-2024
  • (2024)Dynamic Spatial-Temporal Embedding via Neural Conditional Random Field for Multivariate Time Series ForecastingACM Transactions on Spatial Algorithms and Systems10.1145/3675165Online publication date: 27-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data
June 2021
2969 pages
ISBN:9781450383431
DOI:10.1145/3448016
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automl
  2. machine learning
  3. ml pipelines
  4. optimization
  5. time series

Qualifiers

  • Research-article

Conference

SIGMOD/PODS '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)107
  • Downloads (Last 6 weeks)12
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Demonstration of TENDS: Time Series Management System Based on Model SelectionProceedings of the VLDB Endowment10.14778/3685800.368587417:12(4357-4360)Online publication date: 1-Aug-2024
  • (2024)DARKER: Efficient Transformer with Data-Driven Attention Mechanism for Time SeriesProceedings of the VLDB Endowment10.14778/3681954.368199617:11(3229-3242)Online publication date: 30-Aug-2024
  • (2024)Dynamic Spatial-Temporal Embedding via Neural Conditional Random Field for Multivariate Time Series ForecastingACM Transactions on Spatial Algorithms and Systems10.1145/3675165Online publication date: 27-Jun-2024
  • (2024)AutoRL X: Automated Reinforcement Learning on the WebACM Transactions on Interactive Intelligent Systems10.1145/3670692Online publication date: 3-Jun-2024
  • (2024)ShrinkHPO: Towards Explainable Parallel Hyperparameter Optimization2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00371(4897-4910)Online publication date: 13-May-2024
  • (2024)Beyond point forecasts: Uncertainty quantification in tensor extrapolation for relational time series dataCommunications in Statistics: Case Studies, Data Analysis and Applications10.1080/23737484.2024.232917110:2(107-124)Online publication date: 19-Mar-2024
  • (2024)The Predictability of Stock PriceBig Data Research10.1016/j.bdr.2023.10041435:COnline publication date: 16-May-2024
  • (2024)Signature-based Adaptive Cloud Resource Usage Prediction Using Machine Learning and Anomaly DetectionJournal of Grid Computing10.1007/s10723-024-09764-422:2Online publication date: 23-Apr-2024
  • (2024)Foresight plus: serverless spatio-temporal traffic forecastingGeoinformatica10.1007/s10707-024-00517-928:4(649-677)Online publication date: 1-Oct-2024
  • (2024)AutoCTS++: zero-shot joint neural architecture and hyperparameter search for correlated time series forecastingThe VLDB Journal10.1007/s00778-024-00872-x33:5(1743-1770)Online publication date: 30-Jul-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media