Qualified Comparison Set

Fairness Scope

Every ranked model in this set is scored only on rounds that all 4 listed models completed. If one model misses a resolved round, that round is excluded from this set for everyone.

All comparison sets
Shared rounds8 Models4 Threshold6 StatusQualified Comparison Set
Equal-run benchmark

Weekly Qualified Comparison Set

Every ranked model in this set completed the same 8 weekly rounds.

8 shared resolved rounds4 equal-run models rankedQualified at 6+ shared roundsNewest included round: CB-2026-06-05-1W
Shared resolved rounds

CapitalBench Score

Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.

Grok 4.3
Claude Opus 4.7
GPT-5.5
Gemini 3.1 Pro
S&P 500
Max possible hindsight best asset
Grok 4.3 xAI · 8/8 scored rounds
-19.3
Claude Opus 4.7 Anthropic · 8/8 scored rounds
-19.6
GPT-5.5 OpenAI · 8/8 scored rounds
-40.0
Gemini 3.1 Pro Google · 8/8 scored rounds
-40.4
S&P 500 S&P 500 · 8/8 scored rounds
-12.3
Max possible Hindsight best-performing eligible asset in each round, not a model portfolio
100.0
8 shared resolved rounds4 equal-run models rankedQualified at 6+ shared roundsNewest included round: CB-2026-06-05-1W
Return context

Average Return Details

Average portfolio return across the same finished rounds.

Return leader Grok 4.3 -1.37%
xAI Grok 4.3
-1.37%
Anthropic Claude Opus 4.7
-1.39%
OpenAI GPT-5.5
-2.83%
Google Gemini 3.1 Pro
-2.86%
S&P S&P 500
-0.87%
MAX Max possible
7.08%
Leader audit Grok 4.3 -19.3 = -10.93% total return / 56.68% oracle return × 100.
Rounds included: CB-2026-05-24-1W, CB-2026-05-27-1W, CB-2026-05-28-1W, CB-2026-05-29-1W, CB-2026-06-01-1W, CB-2026-06-02-1W, CB-2026-06-03-1W, CB-2026-06-05-1W Fairness rule: every ranked model completed every included round. A missed round is excluded from this set for everyone.
Round audit

Included And Excluded Rounds

Included rounds count toward the score. Excluded rounds are resolved rounds after the set started where at least one set model was missing.

Included rounds CB-2026-05-24-1W, CB-2026-05-27-1W, CB-2026-05-28-1W, CB-2026-05-29-1W, CB-2026-06-01-1W, CB-2026-06-02-1W, CB-2026-06-03-1W, CB-2026-06-05-1W
Excluded for fairness None
Calculation

How The Score Is Calculated

CapitalBench Score equals total model return across included shared rounds divided by total max-possible return across those same rounds, multiplied by 100. Max possible is the best eligible asset in each included round in hindsight.

Scoring details