Kiran Patil is an experienced Big Data Engineer with 5 years of experience in data engineering, specializing in AWS cloud solutions and Hadoop ecosystem. The candidate has a strong background in Spark development, data migration, and warehousing. He has a proven ability to design and implement robust data pipelines, perform data cleansing, ingestion, and transformation using modern big data technologies.
5 years Big Data Engineer with AWS + Hadoop expertise.
Hands-on with AWS S3 + Glue + Lambda + Redshift + EMR + IAM.
PySpark scripting for ETL pipelines.
Hadoop ecosystem — HDFS + Hive + Sqoop + YARN + MapReduce.
Performance — Hive partitioning + bucketing with Parquet/ORC formats.
Successfully implemented AWS-based data warehousing solutions for data ingestion + transformation.
Developed and optimized Spark programs for data cleansing + ingestion using PySpark.
Managed and utilized Hadoop ecosystem (HDFS, Hive, Sqoop, YARN) for robust data handling.
Designed and created Hive tables with partitioning + bucketing for efficient data management.
Performed data validation, quality checks, and processing using aggregation, joins, filters.
Key outcomes:
Successfully implemented an AWS-based data warehousing solution for data migration.
Established automated data pipelines for ingestion, cleaning, and transformation using AWS services.
Ensured data validation during ingestion to AWS S3 landing buckets.
Key outcomes:
Successfully ingested customer data from mainframe systems into a Data Warehouse.
Implemented data validation and quality checks to ensure data integrity.
Optimized Hive table performance through partitioning and bucketing.
Kiran Patil
Big data engineer