🎥 Testing Data Quality with Soda Core in Databricks

In this video I demonstrate how to perform data quality checks on a Delta table in Databricks using Soda Core. Soda Core is the open-source Python package developed by Soda. It can be compared to Great Expectations, but is much simpler in my opinion. I enjoy using Soda in my professional projects and will continue exploring this framework. ...

March 29, 2025

Hosting Great Expectations Data Docs on Azure Blob Storage

Resources Check out the complete code on GitHub. Browse the GX Data Doc on Azure Blob Storage. Use Case Last week I explored Soda as a data quality testing framework for my large enterprise client. This week I’m exploring a more mature alternative called Great Expectations or GX in short. GX generates neat HTML reports called Data Docs that give an overview of your data quality test results. The client wants to share these reports with the team - but not with the world! As the client is already using Azure, hosting the report files on Azure Blob Storage seems like a good solution. ...

February 20, 2025

Exploring Soda Data Quality Testing on Databricks

Use Case For my current engagement I’m tasked with developing an automated data quality framework for a large industrial enterprise in the renewable energy sector. The client has over a hundred independent SCADA systems from various vendors gathering energy production data. All this data has to flow in one central repository to be analyzed with Databricks. The client is obligated to ensure high data quality for contractual reporting to external parties. Failure to deliver incurs high financial penalties. ...

February 14, 2025

Developing a Qtile VPN Widget

I often work from WeWork, which is a public co-working space with a shared WIFI-network. Securing my connection For security and privacy reasons, I therefore work behind a VPN. I purchased an AirPVN subscription and configured it as an OpenVPN connection in NetworkManager1: NetworkManager’s terminal user interface nmtui This way, even if I’m working from Prague, my location will always be shown as Belgium2. This was important because my current enterprise client’s IT-security team kept receiving alarms that my location was changing all over Europe. ...

February 13, 2025

🎧 Smart Metals Podcast E13: From the Shop Floor to the Cloud: AI in Metal Manufacturing

In this episode of the Smart Metals Podcast Denis and I, dive into the topic of transferring shop floor data to the cloud and leveraging AI for predictive maintenance and other use-cases in the metals industry. We discuss essential components like the Unified Namespace (UNS) and Data-Centric AI, highlighting why smaller manufacturers shouldn’t shy away from cloud technology. We also break down common misconceptions about AI, particularly for SMBs, and explore the benefits of cloud services. You’ll hear a step-by-step approach to implementing predictive maintenance and get practical advice on how to start using AI-driven insights—without excessive costs or complexity. ...

January 31, 2025