You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Guillaume GeoffroyGG

Guillaume Geoffroy

Data Engineer | Databricks | PySpark | CI/CD Azure

€750/day
Lausanne, CH
3-7 years

Average response time: 1 hour

About Guillaume

Data Engineer PySpark & Cloud | Scalable Data Pipelines

I help companies design, build, and optimize robust and scalable data pipelines in cloud environments.

Specialized in PySpark, Databricks, and Airflow, I support end-to-end data platform projects from architecture to production deployment.


Core expertise:

ETL/ELT pipelines (PySpark, Databricks, Airflow)
Cloud data platforms (Azure, AWS, GCP)
Data lakehouse architecture (Delta Lake)
CI/CD, Terraform, and DevOps practices
Data quality & pipeline monitoring
Spark performance optimization
Github Copilot Integration

Experience:
5 years working on large-scale data projects in energy, aerospace, and industrial environments.


Focus:

Reliable, scalable, production-ready data systems with strong engineering standards.

Available remotely or in Switzerland / France.
  • French

    Native or bilingual

  • English

    Fluent

  • Spanish

    Conversational

Can work on-site
Lausanne (up to 50km), Geneva (up to 50km)

Experience

  • Engie - V.I.E
    Data Engineer
    ENERGY AND UTILITIES
    June 2025 - Today (1 year)
    Brussels, Belgium

    Designed and managed data pipelines for energy consumption and billing data.


    Developed a Python library to structure ETL workflows, including complex PySpark transformations on time-series data. Worked on VSCode with Github Copilot.

    Industrialized and managed Databricks jobs, with scheduling through Apache Airflow and data storage on S3 using Delta Lake format.

    Orchestrated CI/CD with Azure DevOps and IaC deployments with Terraform across Databricks environments (dev, preprod, prod).

    Integrated GitHub Copilot into the development workflow for code generation, refactoring, and pull request review support.

    Built a Data Quality framework within the library, implementing checks for duplicates, overlaps, and completeness. Used Docker Image for unit testing / functional testing.

    Performed data analysis and developed dashboards with Databricks.
    Databricks Docker PySpark Azure DevOps Terraform
  • Terra Systema
    CDD Data Scientist
    AGRICULTURE
    May 2024 - July 2024 (2 months)
    Molsheim, France

    Analyzed weather sensor data to anticipate late frost events.

    Led the project autonomously, coordinating with multiple stakeholders.

    Analyzed time-series data from weather sensors and developed solutions on Linux using Python (Pandas, Matplotlib, TensorFlow) and MySQL.

    Designed a Proof of Concept and built a Deep Learning model (CNN/LSTM) to
    estimate dew point at parcel level.
    Python MySQL Deep Learning
  • Cs Group
    CDI Data Engineer
    AVIATION AND AEROSPACE
    June 2021 - April 2023 (1 year and 10 months)
    Toulouse, France

    Predicted aircraft failures for Airbus and airline operators.

    Filtered, analyzed, and visualized multi-source aircraft sensor data, including model development and alert monitoring.

    Developed a Python library dedicated to model development, built on complex
    PySpark transformations.

    Industrialized Big Data models using internal DevOps tools within a continuous
    integration framework.

    Used the internal CodeWorkbook ETL for model prototyping and validation.
    Python Spark ETL GitHub DevOps

Recommendations

Be the first to recommend Guillaume

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Final-year exchange
    Université Laval
    2020
    Final-year exchange, Specialization in Machine Learning and Advanced Python
  • Engineering degree
    SUPMICROTECH-ENSMM
    2020
    Computer Sciences

Skill set

Categories