My Work
Showcasing my journey as a Senior Data Engineer - from designing scalable ETL pipelines to architecting cloud-native data solutions on AWS.
Featured Projects
Automated Data Lake Architecture with Apache Hudi
Developed a scalable Data Lake on AWS to process high-frequency Change Data Capture (CDC) logs from MS SQL Server into a transactional storage layer.
View DetailsAWS CDC ETL Pipeline with Real-time Data Processing
Cloud-based data engineering solution using AWS services for real-time data migration from SQL Server to AWS with CDC capabilities.
View DetailsOracle to PostgreSQL Migration
Architected a high-performance migration pipeline moving ~1 billion records per load from an Oracle Data Warehouse to Aurora PostgreSQL using parallel DB-Link async sessions, partitioned tables, and S3-based hybrid-cloud ingestion.
View DetailsAll Projects
Automated Data Lake Architecture with Apache Hudi
Developed a scalable Data Lake on AWS to process high-frequency Change Data Capture (CDC) logs from MS SQL Server into a transactional storage layer.
AWS CDC ETL Pipeline with Real-time Data Processing
Cloud-based data engineering solution using AWS services for real-time data migration from SQL Server to AWS with CDC capabilities.
Oracle to PostgreSQL Migration
Architected a high-performance migration pipeline moving ~1 billion records per load from an Oracle Data Warehouse to Aurora PostgreSQL using parallel DB-Link async sessions, partitioned tables, and S3-based hybrid-cloud ingestion.
Technical Expertise
AWS Cloud
S3, Glue, Lambda, DMS, EMR, Kinesis, Step Functions
Apache Spark
PySpark, Spark SQL, DataFrame API, Optimizations
Real-time Streaming
Kafka, Kinesis, Apache Iceberg
Databases
PostgreSQL, MySQL, Oracle, DynamoDB
Interested in Working Together?
Let's discuss how I can help with your data engineering challenges.
Get in Touch