Imran  ·  Lead AWS Data Engineer  ·  9+ yrs

Senior
Dehradun9+ years experienceremote
Available within 48 hrs

About Imran

Imran Shaik leverages over 9 years of experience in software development, focusing on data engineering for the past 6 years. He has a proven track record of building and executing end-to-end ETL data pipelines using various technologies including Apache Spark and AWS. His expertise in orchestrating complex workflows with Airflow and integrating CI/CD tools like Jenkins enhances his ability to deliver efficient data solutions. Imran has worked extensively in the healthcare domain, contributing to projects that improve patient care and reduce costs.

Core expertise

AA
Apache Airflow
devops
9/10
Apache Spark
language
9/10
Python
language
9/10
SQ
SQL
language
9/10
AWS
cloud
8/10
Jenkins
devops
8/10

Additional skills(18)

Apache SparkAWSHivePySparkSparkSQL ServerOracleJavaAirflowPresto

Why hire Imran?

Production deploy authorityMentored juniorsBuilt from scratch

Ownership of end-to-end data pipeline development across multiple projects.

Developed a common big data platform on AWS for product data processing and analytics.

Automated unit testing platforms and contributed to CI/CD pipeline integration using GitHub/Jenkins.

Onboarded country-specific data from 15 countries to data lakes, improving clinical trial success rates.

Implemented Airflow for efficient job scheduling and managing complex data dependencies.

Successfully built and executed end-to-end ETL data pipelines using Apache Spark, Hive, and Presto.

Contributed to building scalable frameworks that reduce healthcare costs and improve patient care.

Project highlights(5)

Common DATA PlatformData Engineer

Overview: This project involved building a common big data platform on AWS to store and process the company's product data. Responsibilities: Built data pipelines using Apache Spark (Spark SQL and Data Frames). Developed a data input pipeline, storing raw JSON events/database data in an AWS S3 Data Lake. Utilized Pig, Hive, PySpark, and Presto to process data and create tables. Employed Airflow to schedule tasks and manage dependencies for ETL processing. Stored processed output in a data warehouse for analytics and model creation, and developed an automated platform for unit testing.

Apache SparkAWSAirflowHivePrestoPySpark

Key outcomes:

  • Built end-to-end data pipelines on AWS for product data processing.

  • Developed an automated platform for unit testing.

PIMS HealthcareData Engineer

Overview: PIMS is a scalable framework designed to enable WellPoint/Anthem to contract with Primary Care Physicians. Responsibilities: Understood Process Design Documents and project requirements for data processing. Analyzed and worked with BTEQ scripts and created customized HQLs. Developed Spark jobs using SparkSQL and DataFrames. Used Control-M for job scheduling and employed Informatica to pull data from multiple sources. Prepared and shared daily status updates with stakeholders and attended project scrum meetings.

SparkSQLControl-MInformaticaHive

Key outcomes:

  • Developed Spark jobs for data processing using SparkSQL and DataFrames.

ACOE Clinical TrialsData Engineer

Overview: The ACOE project focused on loading country-specific data into individual data lakes and automating syndicated analytics. Responsibilities: Understood Process Design Documents and gathered requirements for data pipeline construction. Extracted data from sources like SQL Server, flat files, and Oracle using Sqoop and Spark. Prepared customized HQLs and validated transformation logic. Scheduled jobs using an internal Job Scheduler and provided training to new team members.

Big DataHiveSQL ServerSqoopOracle

Key outcomes:

  • Onboarded data from 15 countries into data lakes.

AppScriptDeveloper

AppScript — platform for discovering + prescribing + tracking digital patient engagement tools + electronic prescribing of digital health apps + devices + content.

JavaSQL DeveloperSQL

Key outcomes:

  • Wrote test cases and prepared SQL logic for data copying and validation.

  • Involved in automation testing using Java.

  • Prepared data for web and mobile applications.

Mobile Intelligence (MI Touch, MI Online)Developer

Mobile Intelligence (MI Touch + MI Online) — web-enabled solution for life science companies to streamline product launches + promotional activities.

KOMODOPL/SQLOracleSQL

Key outcomes:

  • Loaded flat file data into the database using KOMODO.

  • Implemented Slowly Changing Dimensions (SCD).

  • Validated source-to-target transformation logic.

9+ years of industry experience

HealthTech3 projects
  • PIMS HealthcareData EngineerSpark · SQL · Control-M · Informatica +1
  • ACOE Clinical TrialsData EngineerBig Data · Hive · SQL Server · Sqoop +1
  • AppScriptDeveloperJava · SQL Developer · SQL

Ready to work with Imran?

Onboard within 48 hours. No long hiring cycles, no recruiter middleman.

At a Glance

LocationDehradun
Experience9+ years
Work moderemote
Direct hirePossible
Start within48 hours
From$2,156/ month

Single contract. Billed in USD.

Typically responds within 4 business hours.

5-day replacement guarantee
48-hour onboarding, single invoice
Direct chat — no recruiter middleman

Top Skills

Apache Airflow
9/10
Apache Spark
9/10
Python
9/10
SQL
9/10
AWS
8/10
Seniority signals
Owns production deploysGreenfield architectSystem ownerCode reviewerMentor / leads juniors
VerifiedVetted by Witarist
Technical skills assessed & verified
Background & identity checked
English communication verified
Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Imran

Data Engineer