Benchmark Forming

Fairness Scope

Every ranked model in this set is scored only on rounds that all 4 listed models completed. If one model misses a resolved round, that round is excluded from this set for everyone.

All comparison sets
Shared rounds1 Models4 Threshold3 StatusBenchmark Forming
Equal-run benchmark

Monthly Benchmark Forming

Every ranked model in this set completed the same 1 monthly rounds.

1 shared resolved rounds4 equal-run models ranked2 more shared rounds to qualifyNewest included round: CB-2026-05-10-1M
Shared resolved rounds

CapitalBench Score

Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.

Claude Opus 4.7
Gemini 3.1 Pro
GPT-5.5
Grok 4.3
S&P 500
Max possible hindsight best asset
Claude Opus 4.7 Anthropic · 1/1 scored round
11.8
Gemini 3.1 Pro Google · 1/1 scored round
11.8
GPT-5.5 OpenAI · 1/1 scored round
11.8
Grok 4.3 xAI · 1/1 scored round
11.8
S&P 500 S&P 500 · 1/1 scored round
-25.3
Max possible Hindsight best-performing eligible asset in each round, not a model portfolio
100.0
1 shared resolved rounds4 equal-run models ranked2 more shared rounds to qualifyNewest included round: CB-2026-05-10-1M
Return context

Average Return Details

Average portfolio return across the same finished rounds.

Return leader Claude Opus 4.7 0.77%
Anthropic Claude Opus 4.7
0.77%
Google Gemini 3.1 Pro
0.77%
OpenAI GPT-5.5
0.77%
xAI Grok 4.3
0.77%
S&P S&P 500
-1.65%
MAX Max possible
6.52%
Leader audit Claude Opus 4.7 11.8 = 0.77% total return / 6.52% oracle return × 100.
Rounds included: CB-2026-05-10-1M Fairness rule: every ranked model completed every included round. A missed round is excluded from this set for everyone. Forming: this set becomes the Current Monthly Benchmark at 3 shared resolved rounds.
Round audit

Included And Excluded Rounds

Included rounds count toward the score. Excluded rounds are resolved rounds after the set started where at least one set model was missing.

2 more to qualify
Included rounds CB-2026-05-10-1M
Excluded for fairness None
Calculation

How The Score Is Calculated

CapitalBench Score equals total model return across included shared rounds divided by total max-possible return across those same rounds, multiplied by 100. Max possible is the best eligible asset in each included round in hindsight.

Scoring details