Deepali  ·  Data Engineer  ·  5+ yrs

Mid-Level
India5+ years experienceremote
Available within 48 hrs

About Deepali

Deepali Shrivastava is an experienced Data Engineer with a strong background in developing ETL processes and optimizing data pipelines. With over 5.5 years of experience, she has demonstrated expertise in various Big Data technologies, including Hadoop, Spark, and AWS services. Deepali has a proven track record of driving down operational expenses and improving overall system performance through innovative solutions.

Core expertise

Apache Spark
backend
9/10
Hadoop
backend
9/10
SQ
SQL
language
9/10
AWS
cloud
8/10
Python
language
8/10
PySpark
language
8/10
Scala
language
8/10
Salesforce
tooling
8/10
Oracle
database
8/10
SQL Server
database
8/10

Additional skills(9)

AWSSalesforceHadoopPySparkRedshiftSQLAWS Step FunctionsSqoopSpark-Scala

Why hire Deepali?

Production deploy authorityMentored 5+ juniors

Proven ownership in migrating and optimizing data pipelines to AWS Step Functions.

Demonstrated expertise in Redshift cluster management, including encryption and WLM settings.

Consistent recognition for contributions and customer obsession with multiple awards.

Managed encryption across five critical Redshift clusters, minimizing business disruptions.

Successfully migrated all pipelines to Step Functions seamlessly without errors, ensuring uninterrupted functionality.

Spearheaded the migration of S3 files to Glacier for enhanced cost efficiency and S3 management.

Project highlights(4)

Data Pipeline OptimizationData Engineer

Overview: Currently focused on creating and optimizing data pipelines for efficient data processing. Responsibilities: Created and optimized data pipelines, contributing to efficient data processing. Developed and tuned SQL queries to enhance performance in data processing. Managed and maintained Redshift databases to ensure optimal performance and reliability.

AWSRedshiftSQL

Key outcomes:

  • Created and optimized data pipelines for efficient data processing.

  • Developed and tuned SQL queries for performance enhancement.

  • Managed and maintained Redshift databases to ensure optimal performance.

Redshift Cluster ManagementData Engineer

Overview: Managed encryption across five critical Redshift clusters and orchestrated synchronization for production clusters. Responsibilities: Managed encryption across five critical Redshift clusters, collaborating to minimize business disruptions. Orchestrated synchronization for two production clusters, ensuring seamless encryption and smooth transition of WBR jobs. Proposed and implemented migration of all pipelines to Step Functions seamlessly, ensuring uninterrupted functionality.

AWSRedshiftAWS Step Functions

Key outcomes:

  • Managed encryption across five critical Redshift clusters minimizing business disruptions.

  • Successfully migrated all pipelines to Step Functions seamlessly without errors, ensuring uninterrupted functionality.

Salesforce Data MigrationBig Data Developer

Overview: Responsible for data extraction, transformation, and production job management within a Big Data environment. Responsibilities: Extracted and Imported data from Salesforce to DataLake using Teradata Hadoop Connector, ensuring seamless integration. Transformed data according to CDC logic, maintaining historical data using DataFrame in PySpark. Automated the Salesforce Migration component using UNIX shell scripting for efficiency.

SalesforceHadoopPySpark

Key outcomes:

  • Automated Salesforce Migration component using UNIX shell scripting for efficiency.

  • Optimized HiveQL queries for ORC transactional tables, enhancing query performance and efficiency.

Data Lake Ingestion PipelinesHadoop Developer

Overview: Implemented data ingestion pipelines from various sources into a DataLake using CDC logic. Responsibilities: Implemented data ingestion pipelines to extract data from various sources (SFDC, SAP, VISTAAR, EMPOWER, BRAZIL) and loaded it into the A3 DataLake using CDC logic. Utilized Sqoop for importing and exporting data between relational database systems (SQL Server, Oracle) and HDFS.

HadoopSqoopSpark-Scala

Key outcomes:

  • Implemented data ingestion pipelines from various sources using CDC logic.

  • Optimized Spark code performance by implementing best practices.

5+ years of industry experience

SaaS / B2BReported in resume

Ready to work with Deepali?

Onboard within 48 hours. No long hiring cycles, no recruiter middleman.

At a Glance

LocationIndia
Experience5+ years
Work moderemote
Direct hirePossible
Start within48 hours
From$1,725/ month

Single contract. Billed in USD.

Typically responds within 4 business hours.

5-day replacement guarantee
48-hour onboarding, single invoice
Direct chat — no recruiter middleman

Top Skills

Apache Spark
9/10
Hadoop
9/10
SQL
9/10
AWS
8/10
Python
8/10
Seniority signals
Owns production deploysSystem ownerCode reviewerMentor / leads juniorsRecognised OSS contributor
VerifiedVetted by Witarist
Technical skills assessed & verified
Background & identity checked
English communication verified
Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Deepali

Big Data Engineer