stemedb/phase7-uat.md at bb0c33f8d3062dad0cc3cc655cc491c47215ae9e

jordan b3e8a9a058 feat: Multi-application expansion with chaos testing and community UI

Major additions:
- Community Next.js app (port 18187) for browsing claims with API docs
- stemedb-chaos crate: Fault injection, chaos testing, CRDT properties
- Latent ingestion system: Reddit/FDA ingesters with ADK-Go agents
- Disputed claims handling: Manual review workflows and validation
- Aphoria security scanner: New extractors (SQL injection, command
  injection, weak crypto, TLS version), policy-based ignores, UAT reports
- Docker infrastructure: Dockerfile, docker-compose.yml for full stack
- VulnBank demo: Intentionally vulnerable multi-language test corpus

SDK & API enhancements:
- Source registry handlers for tracking data provenance
- Metrics endpoint
- Skeptic filtering improvements

Code quality:
- Split 14 large files (>500 lines) into focused modules
- All files now under 500-line limit per project guidelines

Documentation:
- Chaos testing guide, circuit breakers, observability docs
- Phase 7 UAT documentation updates
- Martin Kleppmann technical writer agent

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Area	Tests	Status
Circuit Breakers (7D)	25	PASS
Trust Graph Store (7B)	23	PASS
Trust Rank Store	22	PASS
PoW types (7A)	19	PASS
Domain Trust Store (7B)	18	PASS
Admission Store (7A)	16	PASS
Content Defense (quality) (7C)	13	PASS
Similarity Index (MinHash/LSH) (7C)	12	PASS
Quarantine Store (7C)	9	PASS
Trust Tier types (7A)	8	PASS
API Admission integration	6	PASS
Content Defense Layer (7C)	5	PASS
Total Phase 7	176	ALL PASS

Tier	Trust Range	Quota Multiplier	Hourly Limit
Untrusted	0.0-0.3	0.1x	1,000/hr
Limited	0.3-0.5	0.5x	5,000/hr
Verified	0.5-0.7	1.0x	10,000/hr
Trusted	0.7-0.9	2.0x	20,000/hr
Authority	0.9-1.0	10.0x	100,000/hr

Content Type	Expected Quality	Should Quarantine?
Normal assertion: "Aspirin:treats:Headache"	>0.6	No
Low entropy: "aaaa:bbbb:cccc"	<0.4	Yes
Structured data with JSON	>0.7 (bonus)	No
Untrusted agent + high confidence	<0.5 (penalty)	Yes

Content Pair	Jaccard Similarity	Expected
"Aspirin:treats:Headache" vs same	1.0	Duplicate
"Aspirin:treats:Headache" vs "Aspirin:treats:Migraine"	~0.7	Unique
"Aspirin treats headaches" vs "Aspirin:treats:Headache"	~0.85	Unique
"Asprin:treats:Headach" (typos) vs "Aspirin:treats:Headache"	~0.92	Duplicate

8.8 KiB

Raw Blame History

Phase 7 UAT: The Shield

Summary

Test Coverage (Verified)

Realistic Usage Scenarios

Scenario 1: New Agent Onboarding

Scenario 2: Trust Tier Quotas

Scenario 3: EigenTrust Sybil Resistance

Scenario 4: Content Quality Filtering

Scenario 5: Quarantine Admin Workflow

Integration Points to Verify

Scenario 6: Near-Duplicate Detection (MinHash/LSH)

Scenario 7: Circuit Breaker State Machine (7D)

Known Limitations

Commands to Run

Success Criteria

8.8 KiB Raw Blame History Unescape Escape

Phase 7 UAT: The Shield

Summary

Test Coverage (Verified)

Realistic Usage Scenarios

Scenario 1: New Agent Onboarding

Scenario 2: Trust Tier Quotas

Scenario 3: EigenTrust Sybil Resistance

Scenario 4: Content Quality Filtering

Scenario 5: Quarantine Admin Workflow

Integration Points to Verify

Scenario 6: Near-Duplicate Detection (MinHash/LSH)

Scenario 7: Circuit Breaker State Machine (7D)

Known Limitations

Commands to Run

Success Criteria

Related Documentation

8.8 KiB

Raw Blame History