Most aggressive allocation profile. It averages 84.8 / 100 risk-taking, 91.4% high-risk exposure, and only 1.9% defensive ballast. Results have been fragile, with 6 last-place finishes.
How The AI Allocators Differ
A dynamic comparison of each model's allocation style across official frozen portfolios. The report separates risk appetite, concentration, defensive ballast, peer overlap, turnover, and resolved performance profile in one view.
Distinct Behavior By Model
Summaries are generated from deterministic metrics. NVIDIA-assisted wording can polish future summaries, but the metric rows below remain the source of truth.
Most concentrated structure. It averages 3.62 holdings and a 38.6% largest position, so its portfolios express fewer, higher-conviction views.
Concentrated high-conviction style. It uses fewer holdings than diversified peers and gives more weight to the largest position in each portfolio.
More institutional allocation style. It keeps broader portfolios, carries 15.0% defensive exposure on average, and changes positions less than more tactical peers.
More institutional allocation style. It keeps broader portfolios, carries 21.9% defensive exposure on average, and changes positions less than more tactical peers.
Early allocations lean toward Energy Sector (XLE), Semiconductors (SMH), US Momentum Equities (MTUM). The behavioral read should stay provisional until more official portfolios resolve.
Only 5 official saved portfolios so far. Treat the behavior label as provisional.Behavior Metrics In One Table
These are cumulative model-level measures across official saved portfolios and resolved results. A lower average rank is better. Beat S&P shows resolved rounds where model return exceeded the S&P 500.
| Model | Risk | Holdings | Top holding | High risk | Defensive | Peer overlap | Turnover | Avg rank | 1st / last | Beat S&P |
|---|---|---|---|---|---|---|---|---|---|---|
| GPT-5.5 | 84.8 / 100 | 4.83 | 36.5% | 91.4% | 1.9% | 57.5% | 44.8% | 3.90 | 2 / 6 | 4/10 |
| Grok 4.3 | 81.4 / 100 | 3.62 | 38.6% | 85.7% | 5.0% | 62.3% | 48.3% | 2.80 | 0 / 0 | 5/10 |
| Gemini 3.1 Pro | 76.3 / 100 | 3.66 | 38.8% | 71.0% | 13.3% | 52.9% | 52.6% | 3.40 | 2 / 3 | 4/10 |
| Claude Opus 4.8 | 74.8 / 100 | 5.00 | 30.6% | 69.2% | 15.0% | 62.6% | 41.8% | 1.43 | 4 / 0 | 2/7 |
| Claude Opus 4.7 | 73.8 / 100 | 4.86 | 33.6% | 69.0% | 21.9% | 63.1% | 49.6% | 2.40 | 2 / 1 | 5/10 |
| Claude Fable 5 | 69.7 / 100 | 5.00 | 28.0% | 64.0% | 21.0% | 49.0% | 68.3% | n/a | 0 / 0 | 0/0 |
What Stands Out
Each finding is tied to model IDs and metric keys in the generated report.
GPT-5.5 and Grok 4.3 are different in different ways
GPT-5.5 stands out by risk appetite at 84.8 / 100, while Grok 4.3 stands out by portfolio structure with a 38.6% average largest holding.
Claude Opus 4.7 and Claude Opus 4.8 look more risk-managed than the aggressive cohort
Claude Opus 4.7 has the highest defensive allocation at 21.9%. Claude Opus 4.8 has the lowest measured turnover at 41.8%.
Some aggressive or concentrated models have more binary outcomes
GPT-5.5 has 2 first-place and 6 last-place finishes. Gemini 3.1 Pro has 2 first-place and 3 last-place finishes.
Grok 4.3 has been steadier than its risk score suggests
Grok 4.3 averages 81.4 / 100 risk-taking but has no first-place or last-place finishes across 10 resolved rounds.
Claude Opus 4.7 is closest to the model crowd
Claude Opus 4.7 has the highest average peer overlap at 63.1%. This means its allocation weights have looked more like the rest of the roster than the most distinctive models.
How The Pattern Report Is Calculated
The report is generated from official frozen portfolios and resolved result rows during the website build. No page copy is manually assigned to a model. New models appear automatically after they have official saved portfolios.
Models with fewer than 8 saved portfolios are marked as early sample. Performance language is caveated until at least 3 resolved results are available.
NVIDIA-assisted text is allowed only as a rewrite layer. The prompt receives the structured model rows, traits, metric keys, top assets, and comparative candidates. It is not allowed to add unsupported numbers, assets, causes, stale dates, or investment advice.
Prompt contract: capitalbench_model_patterns_prompt_v1
Average allocation-weighted risk appetite across all official saved portfolios. Higher means more growth, momentum, cyclical, and high-risk exposure.
Average number of non-zero assets in the model's official saved portfolios.
Average size of the largest single holding in each official saved portfolio.
Average allocation to assets rated as higher risk by the CapitalBench asset risk model.
Average allocation to cash, bonds, defensive sectors, and other lower-risk ballast.
Average allocation to technology, semiconductors, Nasdaq-style growth, and AI-linked technology exposure.
Average allocation to cash-like assets and duration-sensitive bond exposure.
Average allocation to non-U.S. country, regional, or international equity exposure.
Average allocation to commodities, crypto, energy, gold, and other inflation-linked or real-asset groups.
Average cosine similarity between this model's allocation weights and peer model portfolios in the same rounds.
Average one-half summed absolute allocation change between consecutive same-track portfolios.
Average finishing rank across resolved rounds. Lower is better.
Average model score versus the hindsight-best eligible asset in each resolved round.