Benchmark Sets

Monthly Current Benchmark

May 28, 2026 roster

Monthly comparison set that adds Claude Opus 4.8 to the established model roster.

21 shared rounds 5 models Met current threshold

Jul 8, 2026 roster

Weekly comparison set that starts when Grok 4.5 joins the weekly benchmark roster.

8 shared rounds 7 models Met 6+ threshold

Jun 9, 2026 roster

Weekly comparison set that starts when Claude Fable 5 joins the weekly benchmark roster.

15 shared rounds 6 models Met 6+ threshold

May 28, 2026 roster

Weekly comparison set that adds Claude Opus 4.8 to the established model roster.

33 shared rounds 5 models Met 6+ threshold

May 24, 2026 roster

Original weekly comparison set for the first four CapitalBench models.

35 shared rounds 4 models Met 6+ threshold

Monthly Qualified

May 10, 2026 roster

Original monthly comparison set for the first four CapitalBench models.

24 shared rounds 4 models Met 3+ threshold

Weekly Forming

Jul 21, 2026 roster

Weekly comparison set automatically opened when the Jul 21 official roster first required a new equal-run benchmark group across 7 models.

3 shared rounds 7 models 3/6 3 more to qualify

Monthly Forming

Jun 9, 2026 roster

Monthly comparison set that starts when Claude Fable 5 joins the monthly benchmark roster.

2 shared rounds 6 models 2/3 1 more to qualify

Weekly Waiting

Jul 24, 2026 roster

Weekly comparison set automatically opened when the Jul 24 official roster first required a new equal-run benchmark group across 8 models.

0 shared rounds 8 models 0/6 6 more to qualify

Jul 24, 2026 roster

Monthly comparison set automatically opened when the Jul 24 official roster first required a new equal-run benchmark group across 8 models.

0 shared rounds 8 models 0/3 3 more to qualify

Jul 21, 2026 roster

Monthly comparison set automatically opened when the Jul 21 official roster first required a new equal-run benchmark group across 7 models.

0 shared rounds 7 models 0/3 3 more to qualify

Jul 10, 2026 roster

Monthly comparison set automatically opened when the Jul 10 official roster first required a new equal-run benchmark group across 8 models.

0 shared rounds 8 models 0/3 3 more to qualify

Jul 8, 2026 roster

Monthly comparison set that starts when Grok 4.5 joins the monthly benchmark roster.

0 shared rounds 7 models 0/3 3 more to qualify