Commit Graph

2 Commits

Author SHA1 Message Date
jml
7facac08a2 feat(aphoria): add enhanced bulk claim import with validation and reporting
Replaces tedious shell scripts with TOML-based bulk import:
- 340 lines bash → 200 lines TOML → 1 command
- 15 minutes → <1 second execution time
- 0% → 100% error detection before writes

Features:
- Pre-import validation (ID format, tiers, required fields, duplicates)
- Detailed reporting (table and JSON formats)
- Template generation (--template)
- Validation-only mode (--validate-only)
- Merge strategies (skip_existing, overwrite, fail_on_duplicate)

Documentation:
- Comprehensive guide: docs/guides/bulk-claim-import.md
- Updated README with quick start
- Example files with inline documentation

Validation catches:
- Invalid claim IDs (must be kebab-case)
- Unknown authority tiers
- Empty required fields
- Duplicate IDs within import file
- Duplicate concept paths (warnings)

Error reporting:
- Shows ALL errors before any writes (not just first failure)
- Clear context: claim index, ID, field, and error message
- Warnings for non-blocking issues

Testing:
- All clippy checks pass
- Production build succeeds
- Validated template generation, validation-only, dry-run, import, merge strategies

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 05:31:04 +00:00
jml
bb0c33f8d3 fix(api): enable querying of CLI-created community corpus items
## Problem
CLI-created community corpus items (tier 3) were stored correctly but
invisible via API queries. Two issues blocked discoverability:

1. **Prefix mismatch**: API hardcoded 'community://pattern/' for
   aggregated patterns, but CLI creates 'community://rust/http/...' URIs
2. **Query parameter parsing**: Axum's default parser doesn't support
   bracket notation (?sources[]=value) used by the dashboard

Result: 0/22 CLI-created items were queryable.

## Solution

### Fix 1: Broaden Community Prefix
- Changed: 'community://pattern/' → 'community://' in corpus handler
- Impact: Now matches both aggregated patterns AND CLI-created items
- Backward compatible: Broader prefix includes narrower results

### Fix 2: Add QsQuery Extractor
- Added: serde_qs dependency + custom QsQuery extractor
- Supports: Bracket notation for array parameters (?sources[]=a&sources[]=b)
- Compatible: Works with JavaScript URLSearchParams standard
- Tested: 3 new unit tests for extractor behavior

## Verification
-  All 22 CLI-created community items now queryable (was 0)
-  Source filtering works: community (22), RFC (2), vendor (5)
-  Multi-source queries work: ?sources[]=community&sources[]=rfc → 24
-  All 89 API tests pass + 3 new extractor tests
-  Clippy clean (0 warnings)
-  No regressions in existing functionality

## Files Changed
- crates/stemedb-api/Cargo.toml: Add serde_qs dependency
- crates/stemedb-api/src/extractors.rs: New QsQuery extractor (117 lines)
- crates/stemedb-api/src/handlers/aphoria/corpus.rs: Use QsQuery, broaden prefix
- crates/stemedb-api/src/lib.rs: Export extractors module

Also includes: Scale-adaptive thresholds, wiki corpus extraction,
documentation updates, and dashboard UI improvements from prior work.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 15:54:35 +00:00