feat(ai-206): add policy-aware score calibration for AI appeal outputs by khaadish · Pull Request #1 · khaadish/soroban-dev-console

khaadish · 2026-06-24T11:54:04Z

Summary

Separates raw model confidence from policy thresholds so fairness, review timing, and risk tolerances can be tuned without rewriting the AI pipeline.

Changes

ScoreCalibrationService — explicit inputs (rawScore, CalibrationPolicy) and outputs (band, action, needsHumanReview, appliedPolicy)
needsHumanReview=true whenever confidence < humanReviewThreshold — hard review boundary
biasCorrectionFactor enables fairness tuning without touching model code
Types exported from @devconsole/api-contracts
Operator reference doc at docs/ai-206-score-calibration.md

Acceptance Criteria

Explicit inputs/outputs, no hidden heuristics
All thresholds in CalibrationPolicy — measurable and safe to tune
needsHumanReview + appliedPolicy enable human inspection and audit replay

Closes Ibinola#423

- ScoreCalibrationService decouples raw model confidence from policy thresholds - CalibrationPolicy exposes approveThreshold, rejectThreshold, humanReviewThreshold, biasCorrectionFactor - needsHumanReview=true whenever confidence < humanReviewThreshold (explicit review boundary) - appliedPolicy + rawScore preserved in every output for audit/replay - Corresponding types exported from @devconsole/api-contracts - Spec covers approve/escalate/reject bands, bias correction, clamping, and traceability Closes Ibinola#423

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai-206): add policy-aware score calibration for AI appeal outputs#1

feat(ai-206): add policy-aware score calibration for AI appeal outputs#1
khaadish wants to merge 1 commit into
mainfrom
feat/ai-206-score-calibration

khaadish commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

khaadish commented Jun 24, 2026

Summary

Changes

Acceptance Criteria

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant