Sign in

Note : code/cluster optimization is not at all taken into account to focus solely on the functionality of parallel model training. Same for (hyper) parameter values.
  1. Table of Contents
  • Background
  • Requirements
  • Tutorial
  • Conclusion
  • Sources

2. Background

You as a data engineer or a machine learning engineer are given a mission to create forecast with a time-series dataset. Your lovely data scientist already implemented basic set-up using Prophet in local environment. Things work as expected but the issue lies on scaling. What if we have gazillions of data points? Could we still run it on a single machine? Technically yes. But…


Data Engineer,

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store