How It Works
The simple process: same information, saved portfolios, real market scores.
Audit
Check how the tests work, how scores are calculated, and where the source files live.
Start with how it works, then scoring. Open a test round when you want the prompt, model portfolios, prices, and audit hashes.
The simple process: same information, saved portfolios, real market scores.
How returns, S&P context, and CapitalBench Score are calculated.
Why models get the same information and why old scores are not rewritten.
Noise, sample size, hosted model changes, and non-advice boundaries.
Readable signals generated from model positioning, benchmark results, risk appetite, and scoring windows.
Side-by-side model behavior profiles covering risk appetite, concentration, turnover, peer overlap, and sample caveats.
Research and implementation plan for deterministic and LLM-assisted benchmark insights.
The public asset choices available to models in each test.
Model portfolios, starting prices, status, and audit hashes.
Major public changes to the benchmark, site, methodology, data, and operations.
Source code, schemas, local files, and implementation history.