À propos de Lucas
- Platform administration, development & FinOps.
- Integrating latest Databricks features at scale.
- Developers tooling & support & standardization.
- Large-scale data pipeline architecture.
Français
Bilingue ou natif
Anglais
Capacité professionnelle complète
Expériences
- DecathlonLead Data EngineerGRANDE DISTRIBUTIONjanvier 2023 - Aujourd'hui (3 ans et 5 mois)Paris, FranceOperating as a Lead Data Engineer of a team of 6 Data Engineers within the central Data Platform team, supporting and scaling a Databricks-based ecosystem on AWS used by 3,500+ data professionals.
- Provide advanced support to Data Engineers, BI Engineers, and Data Scientists (debugging, performance optimization, architecture guidance).
- Contributed to the design, governance, and reliability of the data platform.
- Designed and rolled out reusable project templates (Spark, dbt, Airflow, AWS Lambda, EKS) using Cookiecutter, Cruft, Poetry, GitHub Actions, and SonarCloud, enabling standardized and industrialized data development across teams.
- Improved Databricks infrastructure standardization using Terraform at scale through Hashicorp Entreprise, improving reproducibility, governance and evolution of platform resources.
- Led the technical migration from AWS Glue to Databricks Unity Catalog, improving data governance, security, and cross-team data accessibility.
- Implemented platform monitoring, alerting, and FinOps practices to perfect usage and control costs.
Technical Stack: Databricks, AWS, Apache Spark, dbt, Apache Airflow, AWS Glue, Unity Catalog, Spark Declarative Pipelines, Lakeflow Jobs, AWS EKS, AWS Lambda, Github Actions, Data Contracts. - Autorité des marchés financiers (AMF) – FranceSenior Data EngineerBANQUE & ASSURANCESjuillet 2021 - janvier 2023 (1 an et 6 mois)Paris, FranceJoined the cross-functional DataFab team within the Data & Market Surveillance Department, offering technical ability to support data analysts and data preparation workflows.
- Improved the computation time of a cumulative hypergeometric distribution using Apache Spark (Scala) from 7h to 45min by forking the class HypergeometricDistribution from Apache Commons Math, significantly improving performance of the pipeline.
- Reduced daily data ingestion time into HBase via Phoenix from 6 hours to 1 hour through performance tuning and process optimization.
- Integrated analyst-developed market surveillance and alerting tools into a unified software framework to enhance maintainability and scalability, including advanced analytics functions (machine learning models), enabling easier adoption by analyst teams.
- Designed and implemented data pipelines for publishing open datasets to Data.gouv, ensuring compliance with open data standards.
- Defined data architecture for integrating annual financial reports of French companies into the AMF data platform.
Technical Stack: Apache Spark, Apache Hive, Hadoop ecosystem, Python, Scala, Java - European Security Market Authority (ESMA)Big Data EngineerBANQUE & ASSURANCESavril 2020 - juin 2021 (1 an et 2 mois)Paris, FranceProvided consulting and optimization ability for data processing workflows related to European banking regulations within the Supervision & Data Analytics Systems team. Operated in an international environment with English as the primary working language.
- Designed and implemented data ingestion and transformation pipelines to convert XML regulatory files into structured CSV formats for EMIR, SFTR, and SECR (European Union financial regulations).
- Defined deployment methodologies and built automated CI/CD pipelines for data processing workflows.
- Developed, supported and monitored automated data pipelines using Talend, MySQL, and TIBCO Spotfire.
- Built a scalable XML-to-CSV mapping framework in Java using Altova MapForce, standardizing transformation logic across regulatory datasets.
- Optimized processing of a 19TB Oracle Database table via partitioning, indexing, and stored procedures, and reduced query latency by implementing caching strategies in TIBCO Data Virtualization.
- Improved performance and reliability of recurring analytical scripts used by ESMA analysts, implementing monitoring and optimization best practices.
Technical Stack: Oracle Database, Talend, Altova MapForce, TIBCO Spotfire, TIBCO Data Virtualization, Python, Java, SQL.
Recommandations
Soyez le premier à recommander Lucas
Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.
Ces profils de freelance correspondent également à vos critères
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Formations
- Diplôme d'ingénieur, Technologies de l'informationIMT NORD EUROPE2016Diplôme d'ingénieur, Technologies de l'information
- Diplôme d'ingénieur, Technologies de l'informationСанкт-Петербургский государственный университет2015Diplôme d'ingénieur, Technologies de l'information