Description

J'aide les équipes IA à maîtriser leurs systèmes : optimisation, benchmark, qualification.

Entraînement LLM, pipelines d'inférence, calcul scientifique, comparaison de stacks ou de hardware : la question centrale est toujours la même — est-ce que le système fait ce qu'il devrait, et comment le prouver chiffres en main ?

Mon approche combine deux expertises rares en France :

→ Optimisation GPU bas niveau : profilage PyTorch / Nsight, identification des régimes (math-bound, memory-bound, overhead-bound, SFU-bound), tuning FSDP / DeepSpeed / vLLM, parallélismes 3D, kernels Triton.

→ Benchmark et qualification système : conception et exécution de campagnes de tests sur infrastructure IA (training, inférence, storage, multi-nœuds), définition des métriques (TTFT, ITL, throughput, scaling efficiency), comparaisons de stacks (PyTorch vs JAX, vLLM vs TensorRT-LLM, FSDP vs DeepSpeed), évaluation LLM (Lighteval, MLPerf-style).

Mon parcours : 5 ans au CNRS/IDRIS sur le supercalculateur Jean Zay (H100/A100/V100, InfiniBand, Lustre, SLURM). Support expert sur des projets nationaux : pretraining BLOOM, OpenLLM France, Scribe, leaderboard INESIA, collaboration Hugging Face. 20 ans d'ingénierie dont 10+ en Deep Learning et HPC.

Ce sur quoi on peut travailler ensemble :

→ Audit performance GPU (5-10 jours) : diagnostic complet, identification des goulots, plan d'optimisation chiffré

→ Optimisation entraînement / inférence LLM (mission longue) : FSDP, DeepSpeed, 3D parallelism, vLLM, TensorRT-LLM

→ Campagne de benchmark / qualification système (mission longue) : conception du test plan, exécution sur cluster, rapports techniques pour audience NVIDIA / clients / décideurs

→ Évaluation LLM et IA de confiance : méthodes d'évaluation, hallucination analysis, benchmarks reproductibles, contribution à des leaderboards

→ Formation / transfert de compétences : Deep Learning distribué, profiling GPU, usage HPC

Languages

French
Native or bilingual
English
Fluent

Workplace preferences

Can work on-site

Orsay (up to 50km)

IDRIS
| AI/HPC Support Engineer
June 2020 - December 2025 (5 years and 6 months)
91400 Orsay, France
• ● Provided advanced consulting and optimization support for AI workloads running on the Jean Zay supercomputer (850 compute nodes, SLURM scheduling, multi-generation GPU fleet: V100, A100, H100, B200; high-speed interconnects: Omni-Path 100, InfiniBand, NVLink).
• ● Assisted large-scale flagship projects running on Jean Zay, including BIG Science (2022) – BLOOM pretraining, OpenLLM (2024–2025), and Scribe (2025).
• ● Designed, led, and delivered the "Optimized Deep Learning on Jean Zay" training program (2022–2025), covering Dataloader optimization, GPU computing fundamentals, DDP, FSDP, 3D Parallelism (TP/PP/DP), and DeepSpeed.
• ● Active contributor and presenter on the FIDLE YouTube channel, a CNRS-backed academic reference for self-learning Deep Learning (CNRS, MIAI, EFELIA).
• ● Developed the National Coordination / INESIA Leaderboard for AI Summit 2025, in collaboration with Hugging Face teams.
• ● Authored and published a poster for the GENCI cluster user community and conference audience. (2025): "Democratizing LLMFine-Tuning"
• ● Contributed to the long-term sustainability of the ChallengeData platform operated by Stéphane Mallat, supporting data science competitions
Pytorch vllm LLM Training profiling HPC
ASTEK (SNCF Innovation Program)
Data Scientist / Research Engineer
September 2017 - February 2020 (2 years and 5 months)
• ● Processed and analyzed > 1 TB of geometric data from IRIS320 measurement trains.
• ● Built predictive ML models for track wear and equipment fault detection.
• ● Deployed ML pipelines for railway maintenance optimization.
Machine learning Big Data
BEKK Studio
Computer Vision & Mobile AI Developer
July 2015 - September 2017 (2 years and 2 months)
• ● Developed Android applications using Computer Vision, NLP, and Deep Learning.
• ● Created experimental reinforcement learning trading bots and video AI apps with TensorFlow Lite.
Computer Vision Deep Learning

Check out Bertrand's experience

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

Signup to reveal

Engineering Degree
ISEP (Institut Supérieur d'Électronique de Paris)
2005
Engineering Degree
Deep Reinforcement Learning Engineer
2016
Deep Reinforcement Learning Engineer

Check out Bertrand's education

Bertrand Cabot

AI/HPC Engineer

About Bertrand

Experience

Recommendations

These freelancer profiles also match your criteria

Education

Skill set

Categories