CapitalBench Score
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Weekly comparison set
Weekly comparison set that adds Claude Opus 4.8 to the established model roster.
Every ranked model in this set is scored only on rounds that all 5 listed models completed. If one model misses a resolved round, that round is excluded from this set for everyone.
Every ranked model in this set completed the same 6 weekly rounds.
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Average portfolio return across the same finished rounds.
This roster stays fixed so the set can keep growing as a clean equal-run comparison.
anthropic-claude-opus-4-7
6 shared rounds in this set Anthropic Claude Opus 4.8anthropic-claude-opus-4-8
6 shared rounds in this set Google Gemini 3.1 Progoogle-gemini-3-1-pro
6 shared rounds in this set OpenAI GPT-5.5openai-gpt-5-5
6 shared rounds in this set xAI Grok 4.3xai-grok-4-3
6 shared rounds in this setIncluded rounds count toward the score. Excluded rounds are resolved rounds after the set started where at least one set model was missing.
CapitalBench Score equals total model return across included shared rounds divided by total max-possible return across those same rounds, multiplied by 100. Max possible is the best eligible asset in each included round in hindsight.