Researchbreaking
Anthropic's BioMysteryBench: Claude Solves 30% of Expert-Stumping Problems
Anthropic's BioMysteryBench tested Claude on 99 bioinformatics problems; latest models solved ~30% of expert-stumping cases in open-ended research.
April 30, 20261 min read