tidaldb

Author	SHA1	Message	Date
jordan	192c473f55	feat: complete Milestone 5 — full-text search, RRF fusion, and creator search - M5p1: BM25 text indexing via Tantivy with background syncer (0.26ms @ 10K docs) - M5p2: RRF fusion layer combining BM25 + ANN scores (46µs @ 1K candidates) - M5p3: unified Search query API (8-stage pipeline, BM25 + vector + ranking) - M5p4: creator text + vector indexing and creator search executor (< 20ms @ 200 creators) - Refactor db/mod.rs into focused sub-modules (creators, items, sessions, signals, etc.) - Decompose monolithic files into directory modules (query/executor, ranking/diversity, etc.) - Split brute.rs → brute/mod.rs + brute/tests.rs; extract search executor helpers - Add benches: fusion, search, session, text_index - Add M5 UAT test suites (m5_uat, m5_search, m5p4_creator_search, text_index) - Update blog posts, roadmap, content strategy, and M5 planning docs - Add tmp/ and .claude/worktrees/ to .gitignore Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-21 23:53:16 -07:00
jordan	39ada28c6e	feat: complete Milestones 2–4 — RETRIEVE query, vector index, ranking profiles, diversity, entity system, sessions M2: RETRIEVE query pipeline with 5-stage execution (candidate → filter → score → diversify → limit), usearch HNSW vector index, bitmap/range/universe filters, ranking profiles with signal scoring, MMR diversity enforcement, and m2_uat integration tests. M3: Entity system with typed metadata, relationship graph (follows/blocks/interactions), creator entities, session tracking, and m3_uat integration tests. M4: Advanced ranking with builtin functions (freshness, trending, controversy, wilson), ranking executor with explain mode, query executor integration, benchmarks for query/ranking/vector/filters/diversity, and m4_uat integration tests. Includes: 9 new blog posts, marketing site updates, updated roadmap, and updated vision doc. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 16:24:48 -07:00
jordan	4f076c927d	feat: M0p1 runtime skeleton, M0p2 tooling & diagnostics, m1p4 signal ledger ## M0p1 — Embeddable Runtime Skeleton (329 tests) - TidalDb with builder(), health_check(), close(), and Drop-based cleanup - TidalDbBuilder fluent API: ephemeral(), with_data_dir(), wal_dir(), cache_dir() - Config, StorageMode, ConfigError types; Config(ConfigError) variant on LumenError - Paths: single source of truth for directory layout (wal, items, users, creators, cache) - TempTidalHome: test isolation helper gated behind #[cfg(test)] / test-utils feature - 8 integration tests: tests/sandboxed_storage.rs ## M0p2 — Tooling & Diagnostics (349 tests) - Workspace root Cargo.toml (members: ["tidal", "tidalctl"]) - tidal/build.rs: BUILD_HASH from GIT_HASH with option_env!() fallback to "dev" - MetricsState: always-compiled Arc-shared atomics (uptime, health_ok) - MetricsHandle (metrics feature): hand-rolled TcpListener HTTP, zero new deps - GET /healthz → {"status":"ok","uptime_secs":N} - GET /metrics → Prometheus text (tidaldb_uptime_seconds, health_ok, info) - TidalDbBuilder.enable_metrics(addr) starts background metrics thread - tidalctl binary: status + paths commands, manual std::env::args() parsing - 7 metrics integration tests, 9 tidalctl CLI tests ## m1p4 Signal Ledger (in-progress) - SignalLedger: DashMap<(EntityId, SignalTypeId), EntitySignalEntry>, WAL-first writes - HotSignalState: #[repr(C, align(64))], lock-free CAS decay, out-of-order handling - BucketedCounter: 60 per-minute + 168 per-hour circular buffers, trigger-based rotation - CheckpointMeta + serialize/restore: 983-byte fixed records, atomic WriteBatch - Property tests: running score matches analytical to 1e-6, decay monotonic, non-negative - Proptest regression: signals/warm.txt ## Documentation and planning - ROADMAP: m0p1 COMPLETE (329), m0p2 COMPLETE (349), product track milestones - PRODUCT_ROADMAP: P0-P4 product milestone track (personal briefing beachhead) - Milestone planning docs: milestone-0 (phases 1-3), milestone-p (phases 1-5) - docs/research/tidaldb_tooling_and_diagnostics.md - ARCHITECTURE.md, CLAUDE.md, VISION.md updates ## Site - Blog: every-platform-builds-the-same-6-systems.mdx (new) - Blog: why-tidaldb.mdx (updated) - next.config.ts, layout.tsx, blog/page.tsx updates Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 20:32:00 -07:00
jordan	29400d48db	feat: implement Milestone 1 phases 1-3 — schema, WAL, and storage layer Implements the foundation of tidalDB's data pipeline: Phase 1 – Schema primitives - EntityId newtype (u64, big-endian ordering) - SignalTypeDefinition with pre-computed decay λ, deduped/sorted windows - SchemaBuilder with full constraint validation (duplicates, identifiers, half-life, windows, velocity) - LumenError wrapping all subsystems with required From impls Phase 2 – Write-Ahead Log - Length-prefixed, BLAKE3-protected entry format - Group-commit writer (batch up to 100 events / 10 ms) - Double-buffered content-hash deduplication - Checkpoint, truncation, and crash-recovery with full replay - Integration, property, and UAT tests (incl. 5,500-event deterministic UAT) - Proptest coverage scaled to 10 000 events/run (was ≤500) to meet acceptance criterion; cases reduced 100→10 to keep runtime comparable Phase 3 – Storage engine - StorageEngine trait (get/put/delete/scan/batch/flush) - Key encoding: [EntityId][0x00][Tag][suffix] with ordering/prefix helpers - InMemoryBackend (BTreeMap + RwLock) - FjallStorage with three isolated keyspaces and atomic batch helper - Property tests for key ordering and round-trip correctness Also adds planning docs for phases 4-5, research docs, architecture overview, and roadmap updates. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 16:43:24 -07:00
jordan	413b712c0a	chore: initialize tidalDB repository with schema foundation and standards - Schema phase 1 (tasks 01-02): EntityId, EntityKind, Timestamp, Score, SignalTypeDef, DecayModel, Window, WindowSet — all with property tests and benchmarks scaffolding - Stub modules for storage, signals, query, ranking - Full documentation suite: VISION, USE_CASES, SEQUENCE, API, CODING_GUIDELINES, ai-lookup, research docs, specs, roadmap, planning docs - Marketing site (Next.js) with blog infrastructure - .claude/ agents and skills for the tidalDB development workflow - Foundation standards enforced: thiserror + tracing declared as dependencies, clippy::unwrap_used = deny added to lint config - .gitignore hardened: .next/, node_modules/, .env, secrets, logs Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 12:52:20 -07:00

5 Commits