Shubham Gagrani  ·  Senior AWS Spark Data Engineer  ·  5+ yrs

Mid-Level
Indore5+ years experienceremote
Available within 48 hrs

About Shubham

Shubham Gagrani is a seasoned Data Engineer with over 5.5 years of experience in developing and optimizing data pipelines. He has a proven track record of leveraging technologies such as Python, Apache Spark, and AWS to deliver scalable solutions. His expertise extends to cloud infrastructure management and AI/ML model development, making him a versatile asset for any data-driven organization.

Core expertise

Python
language
10/10
Apache Spark
other
9/10
AWS
cloud
8/10
Terraform
devops
8/10
Docker
devops
8/10

Additional skills(24)

PythonFlaskBootstrapMySQLTerraformDockerPySparkPandasSparkSelenium

Why hire Shubham?

Production deploy authorityLed construction of AI models

Designed and maintained diverse data pipelines handling large amounts of data.

Increased pipeline speeds by up to 10x and reduced errors by 20 percent.

Successfully set up and managed AWS services, implementing CI/CD pipelines.

Increased pipeline speeds by up to 10x.

Reduced errors by 20 percent in data extraction and storage processes.

Attracted 235 registered users to a SaaS web application.

Project highlights(5)

Fine-Tuned Stable Diffusion ModelAI Developer

Overview: Fine-tuned a Stable Diffusion model to generate artwork in the traditional Japanese Ukiyo-e style. Responsibilities: Developed an AI Model by fine-tuning a Stable Diffusion model on Japanese Ukiyo-e style images.

PythonHugging Face

Key outcomes:

  • Optimized training using 33 instance images and 1,000 regularization images.

SaaS Sound Generation ApplicationDeveloper

Overview: A web application to generate sound samples from text prompts, deployed on Render. Responsibilities: Developed and deployed a SaaS application using Flask, Bootstrap, MySQL, and replicate.ai for AI model inference.

PythonFlaskBootstrapMySQLreplicate.ai

Key outcomes:

  • Successfully attracted and managed 235 registered users.

  • Ensured reliable performance and security through secure payment processing.

E-commerce Data PipelineData Engineer

Overview: Zid is a cutting-edge e-commerce platform designed for SaaS applications. Responsibilities: Developed data pipelines using the medallion architecture for analytics. Maintained MySQL databases, automated tasks, and monitored AWS Glue jobs.

PythonMySQLAWS GlueTerraformDockerPySparkAirflowRedshiftAWS LambdaQuicksight

Key outcomes:

  • Ensured smooth operation of MySQL databases through maintenance and performance optimization.

  • Implemented automation strategies to streamline repetitive tasks.

Media Data AggregatorData Engineer / ML Engineer

Overview: Velocity Media Data Aggregator is an innovative project for extracting, integrating, and presenting public information about places in South Africa. Responsibilities: Developed and maintained data extraction scripts using Apify and stored data in AWS S3.

PythonAWS S3AWS LambdaPandasMySQLSparkPySparkApifyRDSStep Functions

Key outcomes:

  • Developed and executed comprehensive testing plans to ensure data accuracy.

  • Successfully set up and managed AWS services, implementing CI/CD pipelines.

Real Estate Ads AggregatorData Engineer

Overview: KI Immo is a real estate ads aggregator website that collects and normalizes ads from diverse sources. Responsibilities: Extracted data from 15+ diverse sources using SQL connections, Selenium, Airflow, Spark jobs, Kafka, and APIs.

PythonSQLSeleniumAirflowSparkPandasDockerKubernetesKafka

Key outcomes:

  • Increased pipeline speeds by up to 10x by transitioning to a different data extraction framework.

  • Reduced errors by 20 percent by applying a data quality check mechanism.

5+ years of industry experience

  • E-commerce Data PipelineData EngineerPython · MySQL · AWS Glue · Terraform +6
  • Media Data AggregatorData Engineer / ML EngineerPython · AWS S3 · AWS Lambda · Pandas +6
  • Real Estate Ads AggregatorData EngineerPython · SQL · Selenium · Airflow +5
  • SaaS Sound Generation ApplicationDeveloperPython · Flask · Bootstrap · MySQL +1
  • Fine-Tuned Stable Diffusion ModelAI DeveloperPython · Hugging Face
Real Estate1 project
  • Real Estate Ads AggregatorData EngineerPython · SQL · Selenium · Airflow +5

Ready to work with Shubham?

Onboard within 48 hours. No long hiring cycles, no recruiter middleman.

At a Glance

LocationIndore
Experience5+ years
Work moderemote
Direct hirePossible
Start within48 hours
From$1,725/ month

Single contract. Billed in USD.

Typically responds within 4 business hours.

5-day replacement guarantee
48-hour onboarding, single invoice
Direct chat — no recruiter middleman

Top Skills

Python
10/10
Apache Spark
9/10
AWS
8/10
Terraform
8/10
Docker
8/10
Seniority signals
Owns production deploysGreenfield architectSystem owner
VerifiedVetted by Witarist
Technical skills assessed & verified
Background & identity checked
English communication verified
Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Shubham Gagrani

Python Developer