2.6 KiB
2.6 KiB
Conflict Quality Checklist
Use this checklist during Phase 4 of the Self-Review SOP.
Sampling Strategy
- Total conflicts in scan: ___
- Sample size (minimum 10): ___
- Sampling approach: Random / Stratified by extractor / Stratified by file
Conflict Review Template
For each sampled conflict:
Conflict #1
- File:
- Line:
- Claim:
- Conflicting Authority:
- Conflict Score:
- Verdict: Block / Flag / Pass
Assessment:
- True Positive: Real conflict that matters
- False Positive: Not actually a conflict
- True but Not Actionable: Correct but user can't/won't fix
Rationale:
Conflict #2
- File:
- Line:
- Claim:
- Conflicting Authority:
- Conflict Score:
- Verdict: Block / Flag / Pass
Assessment:
- True Positive
- False Positive
- True but Not Actionable
Rationale:
(Repeat for 10+ conflicts)
Precision Calculation
| Category | Count |
|---|---|
| True Positives | |
| False Positives | |
| True but Not Actionable | |
| Total Sampled |
Conflict Precision: True Positives / Total = ___ / ___ = ___
Status: PASS (>= 0.80) / FAIL (< 0.80)
False Positive Pattern Analysis
Group false positives by cause:
| Pattern | Count | Examples | Root Cause |
|---|---|---|---|
| Wrong concept path matching | |||
| Stale authority data | |||
| Context not considered | |||
| Overly broad regex | |||
| Test/fixture misidentified | |||
| Other: ___ |
Extractor Quality
| Extractor | Conflicts Sampled | True Positive Rate | Issues |
|---|---|---|---|
High-Value True Positives
List conflicts that provided genuine value:
- File: ___ — Why: ___
- File: ___ — Why: ___
- File: ___ — Why: ___
Problematic False Positives
List conflicts that wasted time or caused confusion:
- File: ___ — Problem: ___
- File: ___ — Problem: ___
- File: ___ — Problem: ___
Authority Quality Issues
| Authority Source | Issue | Impact |
|---|---|---|
Recommendations from Conflict Review
Based on the conflict quality analysis:
- Suppress these patterns: ___
- Fix these extractors: ___
- Update these authorities: ___
- Tune these thresholds: ___
Outcome
- Sample size >= 10
- All conflicts assessed
- Precision calculated
- False positive patterns documented
- Problematic extractors identified
- Recommendations generated