Data Quality on Denis Gontcharov

Data Quality on Denis Gontcharov https://gontcharov.eu/tags/data-quality/ Recent content in Data Quality on Denis Gontcharov Denis Gontcharov https://gontcharov.eu/%3Clink%20or%20path%20of%20image%20for%20opengraph,%20twitter-cards%3E https://gontcharov.eu/%3Clink%20or%20path%20of%20image%20for%20opengraph,%20twitter-cards%3E Hugo -- 0.128.0 en Tue, 01 Apr 2025 16:08:03 +0200 🎧 Industrial Data Quality Podcast E1: Introduction https://gontcharov.eu/posts/podcast/e01-introduction/ Tue, 01 Apr 2025 16:08:03 +0200 https://gontcharov.eu/posts/podcast/e01-introduction/ Welcome to my podcast! In this very first episode I introduce the topics of this podcast and explain my background in data. Follow the show About Denis Gontcharov Hosting Great Expectations Data Docs on Azure Blob Storage https://gontcharov.eu/posts/great-expectations-azure/ Thu, 20 Feb 2025 18:17:42 +0100 https://gontcharov.eu/posts/great-expectations-azure/ Resources Check out the complete code on GitHub. Browse the GX Data Doc on Azure Blob Storage. Use Case Last week I explored Soda as a data quality testing framework for my large enterprise client. This week I’m exploring a more mature alternative called Great Expectations or GX in short. GX generates neat HTML reports called Data Docs that give an overview of your data quality test results. The client wants to share these reports with the team - but not with the world! Exploring Soda Data Quality Testing on Databricks https://gontcharov.eu/posts/exploring-soda-data-quality-framework/ Fri, 14 Feb 2025 10:08:53 +0100 https://gontcharov.eu/posts/exploring-soda-data-quality-framework/ Use Case For my current engagement I’m tasked with developing an automated data quality framework for a large industrial enterprise in the renewable energy sector. The client has over a hundred independent SCADA systems from various vendors gathering energy production data. All this data has to flow in one central repository to be analyzed with Databricks. The client is obligated to ensure high data quality for contractual reporting to external parties.