Technologybreaking
HuggingFace ml-intern: Autonomous Post-Training Agent Pushes GPQA 10% → 32% in Under 10 Hours
HuggingFace's ml-intern autonomously runs the full ML research-to-training loop, lifting GPQA from 10% to 32% on a 1.7B model in <10 hours and beating Codex on HealthBench by 60%.
April 23, 20261 min read