Introduction

These artifacts are part of a blog that describes a sample use case for predicting POS sales across stores, coordinated with inventory and supply chain data. PySpark is used for data and ML pipelines on Databricks, orchestrated with Control-M from BMC, to predict POS and forecast inventory items in Production.

Platform and Components

• Databricks Intelligent Data Platform on Azure • PySpark • Python Pandas library • Python Seaborn library for data visualization • Jupyter Notebooks on Databricks • Parquet and Delta file format

Project Artifacts

• Code for data ingestion, processing, ML training and serving, saving forecasted results to Databricks Lakehouse in delta format. • Code for workflow and orchestration with Control-M to coordinate all the activities and tasks and handle failure scenarios

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
controlm		controlm
notebooks		notebooks
python		python
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Platform and Components

Project Artifacts

About

Releases

Packages

Languages

JoeGoldberg/retail-forecasting-with-azure-databricks

Folders and files

Latest commit

History

Repository files navigation

Introduction

Platform and Components

Project Artifacts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages