
Ismael Zang
Data Engineer
Habilidades

Revisa mis servicios

Porfolio
Experiencia laboral
Data Engineer & CEO
IML MONEY
Apr 2022 - Present • 4 yrs 1 mo
UIF Framework Tools: Python, SQL, Spark, SAP Hana, Azure Data Factory, Snowflake, APIs Developed a robust ingestion platform consolidating data from multiple sources for a major consumer goods company. Enhanced automation, reducing manual workload for business-as-usual teams by 30% and improving efficiency. Streamlined integration by unifying diverse datasets, enabling seamless organizational access and utilization.
Big Data Engineer
Azira
Jul 2024 - Jan 2025 • 6 mos
Tourism Insights: Actionable Data for Strategic Marketing Tools: Spark, AWS (EMR, S3, EC2), Python, Airflow Built a robust data pipeline generating dynamic, client-specific tourism reports, driving a 15% annual revenue increase. Utilized APIs, EMR, S3, PySpark, Python, Bash, Airflow to automate over 200 daily reports, improving accessibility and reducing manual work by 70%. Parameterized processes with YAML/JSON for multi-client customization. Powerful Reporting Pipelines Tools: Spark, Python, Bash, AWS (EMR, S3, EC2), Airflow Developed Airflow pipelines processing 100s of TBs of data. Designed scalable AWS-based solutions for global enterprises. Optimized pipelines, reducing costs and operational overhead. Migration from Hadoop to PySpark Tools: Spark, Python, SQL, Hive, AWS (EMR, EC2, S3) Migrated MapReduce models to Spark, improving performance and lowering compute costs. Applied advanced Spark optimizations (dynamic partition pruning, broadcasting, caching, hashing).
Junior Data Engineer
TCS
Nov 2023 - Jun 2024 • 7 mos
Organisational-wide Financial Reporting Tools: Spark, Python, Scala, SQL, Azure (Databricks, Data Factory, VMs, ADLS Gen2) Built a consolidated dashboard for CXOs/VPs with analytics from 7 business units. Integrated data from SQL Servers, Excel, CSVs into ADLS Gen2 via Azure Data Factory. Used Databricks (PySpark, Scala, Spark SQL) for processing and loaded results to Azure SQL Server. Created numerous Dimension/Fact tables, views, and stored procedures for dynamic Power BI reporting. Data Migration Project Tools: Python, Azure (Data Factory, VMs, ADLS Gen2, Self-Hosted IR) Migrated 100+ GB data from VMs to ADLS Gen2 using Azure services. Automated scheduling via Data Factory with Self-Hosted IR. Designed robust ADLS Gen2 storage to enable scalable processing and reporting.