stemedb/applications/aphoria/tests/llm_fixtures/manifest.toml
jordan 157dbbb9eb feat: Complete Aphoria Phase 8-9 + UAT suite (90/90 tests passing)
## Phase 8: Enterprise Extractor Improvements 
- 14 security extractors (TLS, JWT, SQL injection, XSS, etc.)
- 10 framework-specific extractors (Spring, Django, Rails, etc.)
- Config file security detection (YAML, TOML)

## Phase 9: Autonomous Extractor Generation 
- Shadow mode executor with TP/FP tracking
- Graduation pipeline with confidence thresholds
- Auto-rollback on regression detection
- Cross-project pattern syncing

## UAT Suite Complete (14 scripts, 90 tests)
- test-core-detection.sh (6 tests)
- test-declarative-extractors.sh (5 tests)
- test-domain-frameworks.sh (5 tests)
- test-domain-unreal.sh (3 tests)
- test-llm-extraction.sh (6 tests)
- test-eval-harness.sh (5 tests)
- test-cross-language.sh (3 tests)
- test-precommit-performance.sh (4 tests)
- test-output-formats.sh (8 tests)
- test-drift-detection.sh (6 tests)
- test-exit-codes.sh (12 tests)
+ 3 more scripts

## Other Changes
- Updated roadmap to mark Phase 8-9 complete
- Added .gitignore entries for build artifacts
- Updated pre-commit: 800 line limit, exclude tests/data/cmd

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-06 22:50:55 -07:00

38 lines
850 B
TOML

[corpus]
version = "1.0.0"
created = "2025-02-05"
description = "Golden fixtures for Aphoria LLM prompt evaluation"
[categories.tls]
fixtures = 2
description = "TLS/SSL certificate and protocol validation"
[categories.negative]
fixtures = 2
description = "Safe patterns that should NOT trigger findings"
[categories.edge]
fixtures = 1
description = "Edge cases and boundary conditions"
[categories.jwt]
fixtures = 2
description = "JWT token security configurations"
[categories.secrets]
fixtures = 2
description = "Hardcoded secrets and credential detection"
[categories.auth]
fixtures = 1
description = "Authentication bypass and debug modes"
[baseline]
precision = 0.9285714285714286
recall = 1.0
f1 = 0.962962962962963
total_fixtures = 10
prompt_version = "1.0.0"
model = "gemini-2.0-flash"
measured_at = "2026-02-06T07:50:55.245991+00:00"