Trust From Regret¶
We frame “trust” as calibrated expectation of low future regret enabled by near‑zero‑latency internal feedback and bounded updates.
Definition (Operational)¶
Trust rises when recent decisions show: (i) low regret at fixed λ, (ii) good deferral quality on ambiguous cases, (iii) calibrated uncertainty, (iv) stable corrections after contradiction signals, and (v) constraint compliance.
We measure these via the Control‑of‑Error Index (CEI) and Control KPIs.
Instantaneous Feedback → Faster Correction¶
Compitum emits endogenous, judge‑free signals at decision time: feasibility, boundary ambiguity (gap/entropy/uncertainty), and drift/trust‑radius state.
A Lyapunov‑inspired trust‑region caps update step sizes, turning contradiction signals into gentle, stable adjustments of the learned SPD metric.
This reduces correction latency and noise vs. delayed judge schemes, supporting lower future regret.
Evidence (What to Report)¶
Fixed‑WTP performance: regret and win‑rate vs. best baseline at λ ∈ {0.1, 1.0}; bootstrap CIs.
CEI components (from existing CSVs/certificates):
Deferral quality (boundary vs. high‑regret): AP, AUROC.
Calibration: Spearman ρ(uncertainty, |regret|) and reliability curve.
Stability: Spearman ρ(shrink in trust‑radius vs. future regret decrease).
Compliance: feasible rate ≈ 1.
Control KPIs: trust‑radius event counts (shrink/expand/steady), r summary stats, shrink→improve correlation.
Helper Commands¶
CEI report
python tools\analysis\cei_report.py ^
--input data\rb_clean\eval_results\<latest-compitum-csv>.csv ^
--out-json reports\cei_report.json ^
--out-md reports\cei_report.md
Reliability curve
python tools\analysis\reliability_curve.py ^
--input data\rb_clean\eval_results\<latest-compitum-csv>.csv ^
--bins 10 ^
--out-csv reports\reliability_curve.csv ^
--out-md reports\reliability_curve.md ^
--out-png reports\reliability_curve.png
Control KPIs
python tools\analysis\control_kpis.py ^
--certs reports\certificates.jsonl ^
--eval data\rb_clean\eval_results\<latest-compitum-csv>.csv ^
--out-json reports\control_kpis.json ^
--out-md reports\control_kpis.md
Notes¶
“Lyapunov‑inspired” emphasizes bounded, stabilizing updates without claiming a formal proof.
Approximate shadow prices are report‑only diagnostics; selection is feasibility‑first argmax U.
Coherence prior is bounded (clipping, small β_s); conclusions robust to modest β_s changes.