Current Benchmark

Fairness Scope

Every ranked model in this set is scored only on rounds that all 5 listed models completed. If one model misses a resolved round, that round is excluded from this set for everyone.

All comparison sets
Shared rounds6 Models5 Threshold6 StatusCurrent Benchmark
Equal-run benchmark

Current Weekly Benchmark

Every ranked model in this set completed the same 6 weekly rounds.

6 shared resolved rounds5 equal-run models rankedQualified at 6+ shared roundsNewest included round: CB-2026-06-05-1W
Shared resolved rounds

CapitalBench Score

Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.

Claude Opus 4.8
Claude Opus 4.7
Grok 4.3
Gemini 3.1 Pro
GPT-5.5
S&P 500
Max possible hindsight best asset
Claude Opus 4.8 Anthropic · 6/6 scored rounds
-46.4
Claude Opus 4.7 Anthropic · 6/6 scored rounds
-59.4
Grok 4.3 xAI · 6/6 scored rounds
-61.0
Gemini 3.1 Pro Google · 6/6 scored rounds
-96.1
GPT-5.5 OpenAI · 6/6 scored rounds
-103.0
S&P 500 S&P 500 · 6/6 scored rounds
-29.8
Max possible Hindsight best-performing eligible asset in each round, not a model portfolio
100.0
6 shared resolved rounds5 equal-run models rankedQualified at 6+ shared roundsNewest included round: CB-2026-06-05-1W
Return context

Average Return Details

Average portfolio return across the same finished rounds.

Return leader Claude Opus 4.8 -2.49%
Anthropic Claude Opus 4.8
-2.49%
Anthropic Claude Opus 4.7
-3.19%
xAI Grok 4.3
-3.28%
Google Gemini 3.1 Pro
-5.16%
OpenAI GPT-5.5
-5.54%
S&P S&P 500
-1.60%
MAX Max possible
5.37%
Leader audit Claude Opus 4.8 -46.4 = -14.96% total return / 32.25% oracle return × 100.
Rounds included: CB-2026-05-28-1W, CB-2026-05-29-1W, CB-2026-06-01-1W, CB-2026-06-02-1W, CB-2026-06-03-1W, CB-2026-06-05-1W Fairness rule: every ranked model completed every included round. A missed round is excluded from this set for everyone.
Round audit

Included And Excluded Rounds

Included rounds count toward the score. Excluded rounds are resolved rounds after the set started where at least one set model was missing.

Included rounds CB-2026-05-28-1W, CB-2026-05-29-1W, CB-2026-06-01-1W, CB-2026-06-02-1W, CB-2026-06-03-1W, CB-2026-06-05-1W
Excluded for fairness None
Calculation

How The Score Is Calculated

CapitalBench Score equals total model return across included shared rounds divided by total max-possible return across those same rounds, multiplied by 100. Max possible is the best eligible asset in each included round in hindsight.

Scoring details