# Baseline: YYYY-MM-DD **Prompt Version:** X.Y.Z **Model:** gemini-2.0-flash **Fixture Count:** N --- ## Overall Metrics | Metric | Value | Target | Status | |--------|-------|--------|--------| | Precision | X.XX | 0.80 | | | Recall | X.XX | 0.75 | | | F1 | X.XX | 0.77 | | | Parse Success | X.XX% | 95% | | ## Per-Category Breakdown | Category | Fixtures | Passed | Failed | Precision | Recall | F1 | |----------|----------|--------|--------|-----------|--------|-----| | tls | N | N | N | X.XX | X.XX | X.XX | | jwt | N | N | N | X.XX | X.XX | X.XX | | secrets | N | N | N | X.XX | X.XX | X.XX | | auth | N | N | N | X.XX | X.XX | X.XX | | negative | N | N | N | X.XX | X.XX | X.XX | | edge | N | N | N | X.XX | X.XX | X.XX | ## Failed Fixtures | ID | Category | Issue | Root Cause | |----|----------|-------|------------| | | | | | ## Changes Since Last Baseline - Change 1 - Change 2 ## Known Issues - [ ] Issue 1 - [ ] Issue 2 ## Next Optimization Targets 1. Target 1 2. Target 2 3. Target 3 --- ## Raw Results ```json // Paste JSON output here for reference ```