p
prateek_715

Prateek T

@prateek_715

Data Engineer

India
Inglés, Hindi
Parte de la información aparece en idioma inglés.
Sobre mí
I am a Data Engineer with hands-on experience in PySpark, Kafka, Python, SQL, and the Hadoop ecosystem. Currently, I build large-scale data pipelines and ETL workflows at Infosys, focusing on medallion architecture and Spark optimization. I have a strong foundation in ML-powered data products and experience taking projects from EDA to deployed APIs.... Lee más

Habilidades

p
prateek_715
Prateek T
desconectado • 
Tiempo medio de respuesta: 1 hora

Revisa mis servicios

Fórmulas y macros
I will solve your excel problems

Experiencia laboral

Infosys

Data Engineer

Infosys • Tiempo completo

Sep 2025 - Present9 mos

Deployed on Databricks platform; helped build production pipelines processing daily 2–9 GB datasets (7-12 million rows): designed schema transformations for medallion architecture, engineered PySpark optimizations (partition pruning, shuffle hash, broadcast joins), implemented data serialization tuning; optimizations reduced job execution time by upto 20% in some pipelines. Led data quality validation, schema design improvements, and schema evolution to accommodate upstream data changes; worked cross-functionally with team lead and senior engineers on parallelism optimization strategies