Skip to content
Microsoft
Industry: Manufacturing

Data Lakehouse Solution using Databricks for Go-Pak

go-pak logo green

Read how Go-Pak partnered with DSP to consolidate their sales data, using Azure Databricks and Power BI, resulting in improved reporting efficiency and cross-company visibility.

Microsoft Azure
Power BI
Databricks

Overview

Go-Pak is an award-winning manufacturing company with operations in the UK and factory sites in Vietnam. They provide packaging solutions to the food service, cash & carry, and retail sectors globally. To gain a unified view of their operations and improve decision-making, Go-Pak partnered with DSP to build a Data Lakehouse solution to consolidate sales data across all its sites.

Microsoft Solutions Partner Data and AI Azure Colour

The Challenge

Each site at Go-Pak used separate source systems to store their sales data, so they wanted to improve cross-company visibility of this data by creating a consolidated dataset. Go-Pak were looking for a robust solution. This meant improvements needed to be made in data load efficiency and data modelling to achieve enhanced load times, faster query speeds, and effective storage of data changes.

The Solution

To meet Go-Pak’s requirements, DSP used Power BI and Databricks on Azure. In Databricks, a medallion architecture was leveraged to process the raw data from the source systems into a structured model suitable for reporting. This architecture consists of the following layers:

  • Bronze layer: Ingest raw data from each site’s source systems using incremental load for loading efficiency.

  • Silver layer: Consolidate and deduplicate the sales data from each source system into a single table.

  • Gold layer: Model the data for reporting using aggregation and implementing Slowly Changing Dimensions (SCD) type 2 to store data changes.

  • Platinum layer: Materialise gold tables as views with only necessary reporting fields selected, and row-level security applied.

A daily Databricks job was configured to process the sales data through these layers, ensuring the data remained up-to-date. Furthermore, separate development, testing, and production environments were set up in Databricks with the principle of least privilege (PoLP) implemented for security.

The Result

As a result, Power BI dashboards were connected to Databricks, rather than directly to the source. Go-Pak now have cross-company visibility of sales data with the option to switch the currency between GBP and USD. The implemented modelling has simplified the interpretation of sales data, improved query speeds, and provided access to historical data changes. Finally, Power BI refresh speeds are now 6x faster, reduced from 1 hour to 10 minutes.

Added value

Unified Data Visibility

DSP's solution enabled Go-Pak to achieve a comprehensive, cross-company view of sales data by consolidating previously siloed information.

Improved Efficiency

The solution delivered significant improvements in data load times and query speeds, highlighted by the faster Power BI refreshes.

Data Tracking

DSP implemented advanced data modelling, allowing Go-Pak to effectively store and access historical data changes for deeper analysis.

 

Why DSP?

  • We are a leading Microsoft Partner
  • DSP can help you navigate Power BI, enabling you to visualise and analyse data with greater speed, efficiency, and understanding
  • We can help your business reduce costs, reduce waste, and improve efficiency
Microsoft Solutions Partner Data and AI Azure Colour

 

 

DSP-Logo-2019-White-600px

 

Want to see what we can do for your business? 

Get in touch with our specialists today and start your journey with industry experts.