Help center

Understanding the Overestimation Index

Δ shows when self-ratings and reviewer scores drift apart.

Formula

Δ = self-rating − scored performance

Capture self-ratings immediately after each run and subtract the reviewer score to reveal confidence gaps.

Thresholds

Δ quick reference

Track Δ per run so confidence stays aligned with reviewer scores.

PM self-rates 7, reviewer 6 → Δ = +1 (OK)

Coach

SWE self-rates 9, reviewer 5 → Δ = +4 (coach now)

How to improve

Calibrate

Run a Fair Trial: counterbalance order, fix the timebox, and keep the rubric identical.

Constrain

Use documented prompt scaffolds and note every tweak so variance stays observable.

Cross-check

Compare reviewer verification time + defects against the AI Ethics and workload guides.

Experience a real evaluation with sample tasks

Role-specific guide for your first week

Or explore our methodology to understand the science behind the measurements

Run the analyzer demo, share methodology notes with your team, and send us benchmarks so the release ships with proof—not hype.

Ready to measure your AI impact? Start with a quick demo to see your Overestimation Δ and cognitive load metrics.