Note : code/cluster optimization is not at all taken into account to focus solely on the functionality of parallel model training. Same for (hyper) parameter values.
  1. Table of Contents

2. Background

You as a data engineer or a machine learning engineer are given a mission to create forecast with a time-series dataset. Your lovely data scientist already implemented basic set-up using Prophet in local environment. Things work as expected but the issue lies on scaling. What if we have gazillions of data points? Could we still run it on a single machine? Technically yes. But…


Data Engineer,

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store