Building Scalable Data Platforms
Specializing in Apache Spark, Delta Lake, and Databricks to architect lakehouse solutions on Azure.
Latest Articles
Deep dives into data engineering topics
Streaming Failure Models: Why "It Didn't Crash" Is the Worst Outcome
Most Databricks streaming failures don't look dramatic. No cluster termination, no red wall of errors. Just a job that says RUNNING while your customers report nonsense.
Understanding Delta Table Partition Size Distribution Using the Delta Log
Learn how to inspect the Delta transaction log to understand your partition size distribution and make informed partitioning decisions.
Advanced Delta Lake Optimization Techniques
Deep dive into Z-ordering, data skipping, and compaction strategies to maximize Delta Lake performance.
Featured Projects
Architecture case studies and implementations
Enterprise Lakehouse Migration
Migrated legacy data warehouse to modern lakehouse architecture, reducing costs by 45% and improving query performance.
Real-Time Analytics Platform
Built streaming data platform processing 10M+ events per day with sub-second latency using Structured Streaming.
Let's Connect
Always open to connecting with fellow data engineers, sharing knowledge, and discussing the latest in data platform technologies.
Get in Touch →