CapitalBench Score
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Fair model comparisons
Equal-run rankings for fixed model rosters using only shared completed rounds.
Older sets keep accumulating shared rounds. New official rosters are opened automatically only when no existing set already contains the models in that run. Sets qualify at 6 weekly shared rounds or 3 monthly shared rounds.
Weekly comparison set that adds Claude Opus 4.8 to the established model roster.
Original weekly comparison set for the first four CapitalBench models.
Original monthly comparison set for the first four CapitalBench models.
Weekly comparison set that starts when Claude Fable 5 joins the weekly benchmark roster.
Monthly comparison set that starts when Claude Fable 5 joins the monthly benchmark roster.
Monthly comparison set that adds Claude Opus 4.8 to the established model roster.
No benchmark sets match this filter.
The index above is the source of truth for all sets. These charts show the current weekly set and the best available monthly set.
Every ranked model in this set completed the same 6 weekly rounds.
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Average portfolio return across the same finished rounds.
Every ranked model in this set completed the same 1 monthly rounds.
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Average portfolio return across the same finished rounds.