Naman  ·  Lead Snowflake Data Engineer  ·  9+ yrs

Senior
Indore9+ years experienceremote
Available within 48 hrs

About Naman

Naman Jain is a seasoned Data Engineer with extensive experience in designing and implementing ETL pipelines and data solutions. He has demonstrated leadership in managing critical data infrastructure and optimizing performance for large-scale data processing. His expertise spans across various technologies including Python, PySpark, and Snowflake, making him a valuable asset for any data-driven organization.

Core expertise

Python
language
9/10
PySpark
language
9/10
Snowflake
database
9/10
AI
Airflow
devops
8/10
Kubernetes
devops
8/10
Cassandra
database
8/10
AWS
cloud
8/10
Docker
devops
8/10

Additional skills(17)

PythonPySparkCassandraSnowflakeKubernetesDockerAWSAnsibleJenkinsKafka

Why hire Naman?

Production deploy authorityMentored 5+ juniors

Led development efforts for ETL pipelines handling over 1 billion rows of data.

Achieved a 40% reduction in query processing time through performance tuning in Snowflake.

Managed Kubernetes deployments for critical data infrastructure, ensuring high availability.

Developed a client upgrade system capable of upgrading over 350 clients weekly.

Configured automated ETL jobs with comprehensive monitoring and alert systems.

Successfully handled and ingested over 1 billion rows of data from telecom networks.

Reduced query processing time by 40% in Snowflake through performance tuning.

Developed and deployed a client upgrade system capable of upgrading over 350 clients/customers weekly using Ansible and Jenkins.

Project highlights(4)

Telecommunications Data ProcessingLead Developer

Overview: Developed an application mediation layer to ingest and transform huge volumes of data from telecom network sites for efficient storage and analytical use. Responsibilities: Developed ETL pipelines from scratch using Python, PySpark, Airflow, and Cassandra, and deployed them in Snowflake. Managed Kubernetes deployment for PySpark and Cassandra. Created data models and conducted POCs for data storage technologies (NoSQL or NewSQL). Configured ODI's scheduling framework for automated ETL jobs with monitoring and alert systems.

PythonPySparkAirflowCassandraSnowflakeKubernetes

Key outcomes:

  • Handled and ingested over 1 billion rows of data successfully.

SIEM Data IngestionSenior Data Developer

Overview: Contributed to a Security Incident and Event Management (SIEM) tool that collects logs from hundreds of sources for analytics to detect cyber-attacks and vulnerabilities. Responsibilities: Developed and containerized an AWS CloudTrail connector/ingester agent using Docker for easy deployment. Created Python-based connectors to fetch and ETL data from multiple input sources. Developed a client upgrade system using Ansible and Jenkins, capable of upgrading over 350 clients/customers weekly. Implemented performance tuning for ODI mappings and robust error handling for ETL processes.

PythonODIDockerAWSAnsibleJenkins

Key outcomes:

  • Capable of upgrading over 350 clients/customers in a single week.

Real-Time Data IntegrationSenior ETL Developer

Overview: Led a critical initiative to implement a real-time data integration solution using Oracle Data Integrator (ODI) and Oracle GoldenGate for synchronizing data between operational systems and a central data repository. Responsibilities: Designed and implemented a real-time ETL pipeline for streaming transactional data from multiple source systems (ERP, CRM) ensuring minimal latency. Incorporated data validation rules and quality checks to ensure data accuracy and consistency. Optimized ODI mappings and GoldenGate processes for high throughput and minimal latency, setting up continuous monitoring dashboards.

Oracle Data Integrator (ODI)Oracle GoldenGateSQLPL/SQL

Key outcomes:

  • Achieved a 40% reduction in query processing time through performance optimization.

Snowflake Data Warehouse DevelopmentData Engineer

Overview: Designed and implemented a scalable cloud-based data warehouse solution on Snowflake to support business intelligence and analytics. Responsibilities: Designed and deployed Snowflake data warehouse architecture, optimizing for cost and scalability. Developed and optimized ETL pipelines using Snowflake, Kafka, and Airflow for data ingestion and transformation. Created and maintained star and snowflake schemas for effective querying. Optimized query performance using clustering keys, materialized views, and partitioning, achieving a 40% reduction in query processing time.

SnowflakeKafkaAirflowAWS S3

Key outcomes:

  • Achieved a 40% reduction in query processing time through performance optimization.

9+ years of industry experience

SaaS / B2B1 project
  • Real-Time Data IntegrationSenior ETL DeveloperOracle Data Integrator (ODI) · Oracle GoldenGate · SQL · PL/SQL
Cybersecurity1 project
  • SIEM Data IngestionSenior Data DeveloperPython · ODI · Docker · AWS +2
Telecom1 project
  • Telecommunications Data ProcessingLead DeveloperPython · PySpark · Airflow · Cassandra +2

Ready to work with Naman?

Onboard within 48 hours. No long hiring cycles, no recruiter middleman.

At a Glance

LocationIndore
Experience9+ years
Work moderemote
Direct hirePossible
Start within48 hours
From$2,012/ month

Single contract. Billed in USD.

Typically responds within 4 business hours.

5-day replacement guarantee
48-hour onboarding, single invoice
Direct chat — no recruiter middleman

Top Skills

Python
9/10
PySpark
9/10
Snowflake
9/10
Airflow
8/10
Kubernetes
8/10
Seniority signals
Owns production deploysGreenfield architectSystem ownerCode reviewerMentor / leads juniors
VerifiedVetted by Witarist
Technical skills assessed & verified
Background & identity checked
English communication verified
Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Naman

Data Engineer