Deepali · Data Engineer · 5+ yrs

Mid-Level

India5+ years experienceremote

Available within 48 hrs

About Deepali

Deepali Shrivastava is an experienced Data Engineer with a strong background in developing ETL processes and optimizing data pipelines. With over 5.5 years of experience, she has demonstrated expertise in various Big Data technologies, including Hadoop, Spark, and AWS services. Deepali has a proven track record of driving down operational expenses and improving overall system performance through innovative solutions.

Core expertise

Apache Spark

backend

9/10

Hadoop

backend

9/10

SQL

language

9/10

AWS

cloud

8/10

Python

language

8/10

PySpark

language

8/10

Scala

language

8/10

Salesforce

tooling

8/10

Oracle

database

8/10

SQL Server

database

8/10

Additional skills(9)

AWSSalesforceHadoopPySparkRedshiftSQLAWS Step FunctionsSqoopSpark-Scala

Why hire Deepali?

Production deploy authorityMentored 5+ juniors

Proven ownership in migrating and optimizing data pipelines to AWS Step Functions.

Demonstrated expertise in Redshift cluster management, including encryption and WLM settings.

Consistent recognition for contributions and customer obsession with multiple awards.

Managed encryption across five critical Redshift clusters, minimizing business disruptions.

Successfully migrated all pipelines to Step Functions seamlessly without errors, ensuring uninterrupted functionality.

Spearheaded the migration of S3 files to Glacier for enhanced cost efficiency and S3 management.

Project highlights(4)

Data Pipeline Optimization – Data Engineer

Overview: Currently focused on creating and optimizing data pipelines for efficient data processing. Responsibilities: Created and optimized data pipelines, contributing to efficient data processing. Developed and tuned SQL queries to enhance performance in data processing. Managed and maintained Redshift databases to ensure optimal performance and reliability.

AWSRedshiftSQL

Key outcomes:

Created and optimized data pipelines for efficient data processing.
Developed and tuned SQL queries for performance enhancement.
Managed and maintained Redshift databases to ensure optimal performance.

Redshift Cluster Management – Data Engineer

Overview: Managed encryption across five critical Redshift clusters and orchestrated synchronization for production clusters. Responsibilities: Managed encryption across five critical Redshift clusters, collaborating to minimize business disruptions. Orchestrated synchronization for two production clusters, ensuring seamless encryption and smooth transition of WBR jobs. Proposed and implemented migration of all pipelines to Step Functions seamlessly, ensuring uninterrupted functionality.

AWSRedshiftAWS Step Functions

Key outcomes:

Managed encryption across five critical Redshift clusters minimizing business disruptions.
Successfully migrated all pipelines to Step Functions seamlessly without errors, ensuring uninterrupted functionality.

Salesforce Data Migration – Big Data Developer

Overview: Responsible for data extraction, transformation, and production job management within a Big Data environment. Responsibilities: Extracted and Imported data from Salesforce to DataLake using Teradata Hadoop Connector, ensuring seamless integration. Transformed data according to CDC logic, maintaining historical data using DataFrame in PySpark. Automated the Salesforce Migration component using UNIX shell scripting for efficiency.

SalesforceHadoopPySpark

Key outcomes:

Automated Salesforce Migration component using UNIX shell scripting for efficiency.
Optimized HiveQL queries for ORC transactional tables, enhancing query performance and efficiency.

Data Lake Ingestion Pipelines – Hadoop Developer

Overview: Implemented data ingestion pipelines from various sources into a DataLake using CDC logic. Responsibilities: Implemented data ingestion pipelines to extract data from various sources (SFDC, SAP, VISTAAR, EMPOWER, BRAZIL) and loaded it into the A3 DataLake using CDC logic. Utilized Sqoop for importing and exporting data between relational database systems (SQL Server, Oracle) and HDFS.

HadoopSqoopSpark-Scala

Key outcomes:

Implemented data ingestion pipelines from various sources using CDC logic.
Optimized Spark code performance by implementing best practices.

5+ years of industry experience

SaaS / B2BReported in resume

Ready to work with Deepali?

Onboard within 48 hours. No long hiring cycles, no recruiter middleman.

At a Glance

LocationIndia

Experience5+ years

Work moderemote

Direct hirePossible

Start within48 hours

From$1,725/ month

Single contract. Billed in USD.

Typically responds within 4 business hours.

5-day replacement guarantee

48-hour onboarding, single invoice

Direct chat — no recruiter middleman

Top Skills

Apache Spark

9/10

Hadoop

9/10

SQL

9/10

AWS

8/10

Python

8/10

Seniority signals

Owns production deploysSystem ownerCode reviewerMentor / leads juniorsRecognised OSS contributor

Vetted by Witarist

Technical skills assessed & verified

Background & identity checked

English communication verified

Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Deepali

Big Data Engineer