Meet Hugo
Data‑driven Technical Program Manager driving AI response quality and scalable program frameworks for Copilot.
Senior TPM Manager at Microsoft AI
Former Technical Lead of Azure AI Benchmarking Team
Author of technical articles with 150K+ cumulative reads, featured in VentureBeat, HPCwire, and Signal65
MS, Applied Physics, Quantum Institute, Yale University
Resume (PDF)
Portfolio
⚡ Industry-Leading Inference Performance
- Delivered 865,000 tokens/sec and 1.1M tokens/s of real-world inference throughput with respectively the NVIDIA NVL72 GB200 GPUs and the NVIDIA NVL72 GB300 GPUs Azure software stack - highlighted by Satya Nadella (CEO, Microsoft) and Jensen Huang (CEO, NVIDIA) at Microsoft Build 2025 and Microsoft Ignite 2025.
- Achieved top-ranking MLPerf Inference submissions across multiple hardware architectures (GB200, H200, H100, A100, A10).
🧠 Record-Breaking AI Training at Scale
- Led the first supercomputer-scale (10k+ H100 GPUs) submission in MLPerf Training demonstrating end-to-end scaling of transformer models - highlighted by Satya Nadella (CEO, Microsoft) at Microsoft Ignite 2023.
- Reduced the cost performance of AI LLM Training by 28% with NVIDIA H200 GPUs - highlighted by Satya Nadella (CEO, Microsoft) at Microsoft Ignite 2024.
🌐 Pioneering AI Developements in the Cloud
- Executed the first-ever public proof of concept for training Large Language Models (GPT-3, 530B parameters) at scale in the cloud using NVIDIA NeMo framework - catalyzing the global rush for AI infrastructure, including OpenAI's move to Azure - highlighted by Satya Nadella (CEO, Microsoft) at Microsoft Build 2022.
- Proved on-par performance between cloud and on-premises infrastructures across 10 foundation models — <5% variance at scale.
🔍 Industry Benchmarking Standards
- Spearheaded the Azure AI Benchmarking Guide, a widely-referenced validation suite setting performance baselines of infrastructures (NVIDIA and AMD hardware) for AI production workloads.
Current open positions
Latest content
Latest publication: Breaking the Million-Token Barrier: The Technical Achievement of Azure ND GB300 v6
Azure High Performance Computing Blog - Nov 3, 2025
Latest video: Latest Infrastructure Trends for Inference
Hallway Track interview series
