À propos de Phani
Anglais
Bilingue ou natif
Français
Capacité professionnelle complète
Expériences
- TheFork (Tripadvisor group)Senior Data Engineerjuillet 2023 - mars 2026 (2 ans et 8 mois)Paris, France[Tech stack : Python, Snowflake, AWS, Airflow, DBT, Power BI, Kubernetes, Docker, Terraform, Sifflet]• • Developed ETL, reverse ETL pipelines to integrate new data sources, destinations.• • Enhanced the data mesh implementation and improved the autonomy of partner teams.• • Optimized data pipelines and reduced the data processing time and costs.• • Designed algorithm to check for cycles between multiple airflow DAGs in the project, which is a part of CI checks now and prevents Airflow issues on production.• • Migrated manual SQL(Jinja) transformation jobs to DBT project. Integrated partner teams on to DBT project to make them more autonomous on their requirements.• • Improved S3 storage archival process and optimized costs.• • Integrated Sifflet into our ecosystem for data observability and integrated data quality checks as part of our data pipelines via airflow• • Optimized Snowflake costs using Bluesky, leading to 35 % cost reduction annually.• • Collaborated on POC of data lakehouse architecture with Apache Iceberg• • Integrated Airbyte into our tech stack to optimize data pipelines build, cost and autonomy of stakeholders, which enhances our self-service data platform ideology
- BREVO (ex – SENDINBLUE)Data Engineermai 2021 - juillet 2023 (2 ans et 2 mois)Paris, France[Tech stack : Python, Bigquery, GCP, Airflow, DBT, Power BI, Kubernetes, Docker, Kafka, Terraform, Datadog]• • Lead the cloud migration of pipelines from AWS to GCP. Revamped legacy pipelines to put in place ELT architecture• • Deployed modern data stack tools like DBT, Airflow, Meltano, etc. into our architecture and built data pipelines using them along with tools from GCP.• • Implemented Best practices in DBT, Airflow tools and in Bigquery for optimized and cost efficient Datawarehouse.• • Implemented 'Infrastructure as code' using Terraform to manage cloud resources efficiently.• • Mentored and guided team members and other teams in the company to implement their use cases with DBT and to take autonomy in building analytics dashboards• • Built both batch and streaming data pipelines utilizing tools Dataflow, pub/sub, Airflow and cloud functions to handle processing of more than 500 GB of data every day.• • Worked with Data Scientists to develop and implement machine learning based use cases.• • Developed CI CD pipelines for our GitHub code repositories to be autonomous in code validations and deploying our code in a clean and efficient way.
- AIRCALLData Engineeroctobre 2020 - avril 2021 (6 mois)Paris, France[Tech stack : Python, AWS, Airflow, DBT, Docker, Terraform, Rollbar]• • Orchestrated data pipelines using airflow• • Built, optimized DBT models and made improvements on data modelling in Redshift• • Configured Rollbartool for better management of errors in data pipelines. It enabled our team to actively track and resolve each error• • Optimized layering of docker containers of applications to improve caching and reduced build time on each deployment on AWS ECS.
Recommandations
Ces profils de freelance correspondent également à vos critères
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Formations
- Applied MScDSTI2021Applied MSc
- Bachelor of TechnologySASTRA University2016Bachelor of Technology
Certifications
- HashiCorp Certified: Terraform AssociateHashiCorp2021