Data Engineer with 8+ years of experience specializing in Azure Cloud data engineering, with additional on-prem expertise. Experienced in developing robust data engineering pipelines and solutions, utilizing a Scrum-based methodology. Possesses strong technical and analytical skills, consistently delivering efficient data processing and integration solutions. Skilled in leveraging technologies such as Azure Databricks, Azure Data Factory, Snowflake, and PySpark for scalable data architectures.
8+ years Azure Data Engineer + Tech Lead with multi-cloud ETL ownership
SSIS → Azure & Snowflake ETL modernization
20% data processing efficiency increase via Azure Databricks
Multi-DB: Azure Cosmos DB + Azure Postgres SQL + Snowflake + Teradata + Oracle
Azure Synapse Analytics + ADLS + Azure Functions + Logic App
CI/CD for Snowflake in Azure DevOps — demonstrated through delivery on production projects in the candidate's experience
Datastage ETL + CA Erwin modeler legacy + Datastage performance tuning
Custom Snowflake roles + privileges + integrations
Managed overall implementation strategies for Azure data engineering projects
Achieved 20% data processing efficiency increase via end-to-end Databricks ETL pipelines
Developed custom PySpark scripts for raw data → analytics/reporting transformation
Led ETL modernization from SSIS to Azure & Snowflake with CI/CD
Secured environment variables + sensitive data via Azure Key Vault across projects
Utilized Azure Databricks for ML + predictive modeling for forecasting
Key outcomes:
Managed overall implementation strategy and environment setup for Azure services
Increased data processing efficiency by 20% through ETL pipeline implementation
Developed custom PySpark scripts for data transformation
Secured sensitive information using Azure Key Vault
Utilized Azure Databricks for machine learning and predictive modeling, improving business outcomes
Key outcomes:
Implemented validation and transformation logic using PySpark DataFrame and SQL API
Created an Azure Key Vault backed Secret Scope for secure integration
Worked on performance optimization techniques in PySpark
Responsible for estimating cluster size, monitoring and troubleshooting Azure Databricks cluster
Key outcomes:
Coordinated with client for overall technical project execution
Led modernization of ETL from SSIS to Azure & Snowflake
Implemented CI/CD of Snowflake in Azure DevOps
Involved in creation of custom roles and integrations in Snowflake
Key outcomes:
Extensively worked on creating ETL programs for data extraction, transformations, and loading
Performed data integration and validation tests for data warehousing tasks
Developed data models using CA Erwin modeler
Extensively worked on Datastage performance tuning to address bottlenecks
Sidhu
Azure Data Engineer