Gaugran  ·  Senior Multi-Cloud Data Engineer  ·  5+ yrs

Mid-Level
5+ years experienceremote
Available within 48 hrs

Proof of scale

Databricks Core Certification

About Gaugran

Gaurang is a Data Engineer with 5+ years of experience in designing and implementing scalable data pipelines and workflows. He has extensive expertise in cloud services, data processing, and machine learning integrations.

5+ years of commercial experience in

Skills(20)

AWSAzureApache AirflowDatabricksPythonSnowflakePostgreSQLPySparkTensorFlowdbtAWS GlueAzure Data Lake Storage (ADLS)AWS S3AWS WorkMailAWS LambdaNumpyPandasMatplotlibSeabornSklearn

Why hire Gaugran?

Production deploy authorityExpert in AWS and Azure

Designed and implemented robust end-to-end data pipelines across multiple domains.

Achieved 89% prediction accuracy in a Real Estate Price Prediction Model through optimization.

Automated data workflows using Apache Airflow to ensure scalability and reliability.

Implemented custom data observability systems and alerting mechanisms for early detection of data quality issues.

Project highlights(5)

Google Analytics to Cloud Analytics PipelineData Engineer

Overview: This project designed and implemented a robust end-to-end data pipeline to extract data from Google Analytics and transfer it to a cloud-based analytics platform for business insights. Responsibilities: Designed and implemented the end-to-end data pipeline using AWS Glue, Databricks, and Apache Airflow for orchestration. Leveraged Databricks for extracting large datasets and integrated with AWS S3 for cloud storage. Utilized Apache Airflow to automate scheduling and execution, optimizing resource usage. Applied PySpark for processing and transforming large-scale datasets, enabling real-time and batch capabilities. Implemented a multi-stage transformation process in Snowflake to deliver actionable insights.

AWS GlueDatabricksApache AirflowSnowflakePySpark

Key outcomes:

  • Designed and implemented a robust end-to-end data pipeline for Google Analytics data.

  • Ensured data accuracy and reliability through end-to-end data quality checks.

Product 360 Data Pipeline and Monitoring SystemData Engineer

Overview: This project involved developing and deploying a data pipeline monitoring system for Product 360. Responsibilities: Developed and deployed the data pipeline monitoring system using Databricks for processing and Snowflake for warehousing. Ingested and processed large-scale datasets from Azure Data Lake Storage (ADLS) and Azure Blob Storage using Databricks. Automated the pipeline with Apache Airflow to ensure scalable and reliable data flow to Snowflake. Implemented a custom data observability system to track data health, detecting anomalies and changes in patterns.

DatabricksSnowflakeAzure Data Lake Storage (ADLS)Apache AirflowPySpark

Key outcomes:

  • Deployed a data pipeline and monitoring system for Product 360.

  • Implemented a custom data observability system with anomaly detection.

Customer Data Integration and Consolidation PlatformData Engineer

Overview: This project focused on designing and implementing a customer data integration platform to consolidate data from multiple sources (CRM, e-commerce platforms) into a single source of truth. Responsibilities: Designed and implemented the platform to consolidate customer data from CRM and e-commerce platforms. Leveraged AWS Glue to automate the ETL process into AWS S3, ensuring data consistency. Employed Databricks and PySpark for large-scale processing and complex transformations.

AWS GlueAWS S3DatabricksSnowflakeApache AirflowPySpark

Key outcomes:

  • Designed and implemented a customer data integration platform for a single source of truth.

  • Automated ETL processes for customer data using AWS Glue into AWS S3.

Health Care Data Extraction and TransformationData Engineer

Overview: This project engineered a fully automated data extraction and transformation pipeline for health care systems. Responsibilities: Engineered a fully automated data pipeline leveraging AWS WorkMail, AWS S3, and AWS Lambda. Integrated AWS Lambda with AWS WorkMail to automate Excel file ingestion into an S3 bucket. Triggered Lambda functions for data cleansing and transformation.

AWS WorkMailAWS S3AWS LambdaPostgreSQL

Key outcomes:

  • Engineered a fully automated data extraction and transformation pipeline for healthcare systems.

  • Designed a fault-tolerant and scalable system capable of handling large data volumes.

Real Estate Price Prediction ModelData Scientist / Machine Learning Engineer

  • This project built an advanced real estate price prediction model with high accuracy using machine learning algorithms.
  • It aimed to provide stakeholders with easily interpretable predicted real estate prices.
  • Built the prediction model using linear regression, SVM, and decision trees.
  • Conducted extensive data analysis and feature engineering using Pandas, Numpy, and Seaborn for preprocessing.
  • Utilized Sklearn and TensorFlow to train, test, and validate various ML models.
  • Visualized key data insights and model results using Matplotlib and Seaborn.
  • Developed an optimization pipeline for model selection and hyperparameter tuning, achieving 89% prediction accuracy.
PythonNumpyPandasMatplotlibSeabornSklearnTensorFlow

Key outcomes:

  • Achieved 89% prediction accuracy for the real estate price prediction model.

  • Developed an optimization pipeline for model selection and hyperparameter tuning.

  • Enabled stakeholders to easily interpret predicted prices through effective visualizations.

Industry experience

Logistics & Supply Chain

Reported in resume

Ready to work with Gaugran?

Schedule an interview and onboard within 48 hours. No long hiring cycles.

At a Glance

Experience5+ years
Work moderemote
Starting from₹1.7 L/mo
Direct hirePossible
Start within48 hours
From₹1.7 L/ month

Single contract. No agency markup confusion.

Typically responds within 4 business hours.

5-day replacement guarantee
48-hour onboarding, single invoice
Direct chat — no recruiter middleman
Seniority signals
Owns production deploysGreenfield architectSystem ownerRecognised OSS contributor
VerifiedVetted by Witarist
Technical skills assessed & verified
Background & identity checked
English communication verified
Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Gaugran

Data Engineer