You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Guillaume GeoffroyGG

Guillaume Geoffroy

Data Engineer | Databricks | PySpark | CI/CD Azure

750 €/jour
Lausanne, CH
3-7 ans

Délai de réponse moyen : 1h

À propos de Guillaume

Data Engineer PySpark & Cloud | Scalable Data Pipelines

I help companies design, build, and optimize robust and scalable data pipelines in cloud environments.

Specialized in PySpark, Databricks, and Airflow, I support end-to-end data platform projects from architecture to production deployment.


Core expertise:

ETL/ELT pipelines (PySpark, Databricks, Airflow)
Cloud data platforms (Azure, AWS, GCP)
Data lakehouse architecture (Delta Lake)
CI/CD, Terraform, and DevOps practices
Data quality & pipeline monitoring
Spark performance optimization
Github Copilot Integration

Experience:
5 years working on large-scale data projects in energy, aerospace, and industrial environments.


Focus:

Reliable, scalable, production-ready data systems with strong engineering standards.

Available remotely or in Switzerland / France.
  • Français

    Bilingue ou natif

  • Anglais

    Capacité professionnelle complète

  • Espagnol

    Capacité professionnelle limitée

Accepte de travailler sur site
Lausanne (jusqu’à 50 km), Geneva (jusqu’à 50 km)

Expériences

  • Engie - V.I.E
    Data Engineer
    ENERGIE
    juin 2025 - Aujourd'hui (1 an)
    Brussels, Belgium

    Designed and managed data pipelines for energy consumption and billing data.


    Developed a Python library to structure ETL workflows, including complex PySpark transformations on time-series data. Worked on VSCode with Github Copilot.

    Industrialized and managed Databricks jobs, with scheduling through Apache Airflow and data storage on S3 using Delta Lake format.

    Orchestrated CI/CD with Azure DevOps and IaC deployments with Terraform across Databricks environments (dev, preprod, prod).

    Integrated GitHub Copilot into the development workflow for code generation, refactoring, and pull request review support.

    Built a Data Quality framework within the library, implementing checks for duplicates, overlaps, and completeness. Used Docker Image for unit testing / functional testing.

    Performed data analysis and developed dashboards with Databricks.
    Databricks Docker PySpark Azure DevOps Terraform
  • Terra Systema
    CDD Data Scientist
    AGROALIMENTAIRE
    mai 2024 - juillet 2024 (2 mois)
    Molsheim, France

    Analyzed weather sensor data to anticipate late frost events.

    Led the project autonomously, coordinating with multiple stakeholders.

    Analyzed time-series data from weather sensors and developed solutions on Linux using Python (Pandas, Matplotlib, TensorFlow) and MySQL.

    Designed a Proof of Concept and built a Deep Learning model (CNN/LSTM) to
    estimate dew point at parcel level.
    Python MySQL Deep Learning
  • Cs Group
    CDI Data Engineer
    AÉRONAUTIQUE & AÉROSPATIALE
    juin 2021 - avril 2023 (1 an et 10 mois)
    Toulouse, France

    Predicted aircraft failures for Airbus and airline operators.

    Filtered, analyzed, and visualized multi-source aircraft sensor data, including model development and alert monitoring.

    Developed a Python library dedicated to model development, built on complex
    PySpark transformations.

    Industrialized Big Data models using internal DevOps tools within a continuous
    integration framework.

    Used the internal CodeWorkbook ETL for model prototyping and validation.
    Python Spark ETL GitHub DevOps

Recommandations

Soyez le premier à recommander Guillaume

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Final-year exchange
    Université Laval
    2020
    Final-year exchange, Specialization in Machine Learning and Advanced Python
  • Engineering degree
    SUPMICROTECH-ENSMM
    2020
    Computer Sciences

Compétences

Catégories