You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Bertrand CabotBC

Bertrand Cabot

AI/HPC Engineer

€900/day
Orsay, FR
8-15 years

Average response time: 1 hour

About Bertrand

J'aide les équipes IA à maîtriser leurs systèmes : optimisation, benchmark, qualification.
Entraînement LLM, pipelines d'inférence, calcul scientifique, comparaison de stacks ou de hardware : la question centrale est toujours la même — est-ce que le système fait ce qu'il devrait, et comment le prouver chiffres en main ?
Mon approche combine deux expertises rares en France :
→ Optimisation GPU bas niveau : profilage PyTorch / Nsight, identification des régimes (math-bound, memory-bound, overhead-bound, SFU-bound), tuning FSDP / DeepSpeed / vLLM, parallélismes 3D, kernels Triton.
→ Benchmark et qualification système : conception et exécution de campagnes de tests sur infrastructure IA (training, inférence, storage, multi-nœuds), définition des métriques (TTFT, ITL, throughput, scaling efficiency), comparaisons de stacks (PyTorch vs JAX, vLLM vs TensorRT-LLM, FSDP vs DeepSpeed), évaluation LLM (Lighteval, MLPerf-style).
Mon parcours : 5 ans au CNRS/IDRIS sur le supercalculateur Jean Zay (H100/A100/V100, InfiniBand, Lustre, SLURM). Support expert sur des projets nationaux : pretraining BLOOM, OpenLLM France, Scribe, leaderboard INESIA, collaboration Hugging Face. 20 ans d'ingénierie dont 10+ en Deep Learning et HPC.
Ce sur quoi on peut travailler ensemble :
→ Audit performance GPU (5-10 jours) : diagnostic complet, identification des goulots, plan d'optimisation chiffré
→ Optimisation entraînement / inférence LLM (mission longue) : FSDP, DeepSpeed, 3D parallelism, vLLM, TensorRT-LLM
→ Campagne de benchmark / qualification système (mission longue) : conception du test plan, exécution sur cluster, rapports techniques pour audience NVIDIA / clients / décideurs
→ Évaluation LLM et IA de confiance : méthodes d'évaluation, hallucination analysis, benchmarks reproductibles, contribution à des leaderboards
→ Formation / transfert de compétences : Deep Learning distribué, profiling GPU, usage HPC
  • French

    Native or bilingual

  • English

    Fluent

Can work on-site
Orsay (up to 50km)

Experience

  • IDRIS
    | AI/HPC Support Engineer
    June 2020 - December 2025 (5 years and 6 months)
    91400 Orsay, France
    • ● Provided advanced consulting and optimization support for AI workloads running on the Jean Zay supercomputer (850 compute nodes, SLURM scheduling, multi-generation GPU fleet: V100, A100, H100, B200; high-speed interconnects: Omni-Path 100, InfiniBand, NVLink).
    • ● Assisted large-scale flagship projects running on Jean Zay, including BIG Science (2022) – BLOOM pretraining, OpenLLM (2024–2025), and Scribe (2025).
    • ● Designed, led, and delivered the "Optimized Deep Learning on Jean Zay" training program (2022–2025), covering Dataloader optimization, GPU computing fundamentals, DDP, FSDP, 3D Parallelism (TP/PP/DP), and DeepSpeed.
    • ● Active contributor and presenter on the FIDLE YouTube channel, a CNRS-backed academic reference for self-learning Deep Learning (CNRS, MIAI, EFELIA).
    • ● Developed the National Coordination / INESIA Leaderboard for AI Summit 2025, in collaboration with Hugging Face teams.
    • ● Authored and published a poster for the GENCI cluster user community and conference audience. (2025): "Democratizing LLMFine-Tuning"
    • ● Contributed to the long-term sustainability of the ChallengeData platform operated by Stéphane Mallat, supporting data science competitions
    Pytorch vllm LLM Training profiling HPC
  • ASTEK (SNCF Innovation Program)
    Data Scientist / Research Engineer
    September 2017 - February 2020 (2 years and 5 months)
    • ● Processed and analyzed > 1 TB of geometric data from IRIS320 measurement trains.
    • ● Built predictive ML models for track wear and equipment fault detection.
    • ● Deployed ML pipelines for railway maintenance optimization.
    Machine learning Big Data
  • BEKK Studio
    Computer Vision & Mobile AI Developer
    July 2015 - September 2017 (2 years and 2 months)
    • ● Developed Android applications using Computer Vision, NLP, and Deep Learning.
    • ● Created experimental reinforcement learning trading bots and video AI apps with TensorFlow Lite.
    Computer Vision Deep Learning

Recommendations

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Engineering Degree
    ISEP (Institut Supérieur d'Électronique de Paris)
    2005
    Engineering Degree
  • Deep Reinforcement Learning Engineer
    2016
    Deep Reinforcement Learning Engineer

Skill set

Categories