← All contributors
contributor

Hugo Belmar

@hugo

Distributed training and architecture researcher. Spends most of his time debugging large runs and arguing about MoE routing strategies. Currently focused on expert-balance objectives and the practical limits of sparse models.

2 articles
Architecture · Distributed focus
2 articles