P-01 · streaming
Real-time Fraud Detection
Sub-second fraud scoring on 1.2M events/s using Kafka, Flink and a feature store feeding online ML models.
Kafka Flink Snowflake
P-02 · migration
Lakehouse Migration
Migrated a 2PB legacy warehouse to a Delta Lakehouse on Databricks, cutting compute cost 38% and query times in half.
Spark Databricks Delta Lake
P-03 · platform
Self-serve Analytics
Built a dbt + Airflow + BigQuery platform with 300+ models and automated tests, giving 200 analysts trustworthy self-serve data.
dbt Airflow BigQuery
P-04 · cdc
CDC Streaming Sync
Change-data-capture pipeline syncing 80+ Postgres tables to Snowflake in near real time with Debezium and Kafka Connect.
Debezium Kafka Connect Postgres