stemedb/position.md
jordan 1ce4004807 feat: Complete Phase 2 (The Cortex) - query, lens, and API layers
This commit adds the read path (Cortex) to complement the write path (Spine):

## Crates
- stemedb-api: HTTP API with axum + utoipa OpenAPI
  - /v1/assert, /v1/query, /v1/epoch, /v1/skeptic, /v1/trace, /v1/audit
  - Metered endpoints with quota enforcement
  - Ed25519 signature verification
- stemedb-lens: Truth resolution lenses
  - RecencyLens, ConsensusLens, ConfidenceLens
  - VoteAwareConsensusLens (Ballot Box pattern)
  - TrustAwareAuthorityLens (The Hive pattern)
  - SkepticLens (conflict analysis)
  - EpochAwareLens (paradigm-safe queries)
- stemedb-query: Query engine with materialized views

## Storage Extensions
- VoteStore: Vote aggregation with cached counts
- TrustRankStore: Agent reputation with decay
- AuditStore: Query audit trail
- IndexStore: SP/P/S index structures
- SupersessionStore: Epoch supersession chains

## SDKs
- sdk/go/steme: Go HTTP client with Ed25519 signing
- sdk/go/adk: ADK-Go tools for AI agents

## Documentation
- Updated CLAUDE.md, architecture.md, roadmap.md
- New ai-lookup entries for all services
- Use case docs for consumer health intelligence
- Arena roadmap for simulation advancement

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-01 13:22:44 -07:00

74 lines
4.1 KiB
Markdown

# StemeDB: Market Position & Thesis
> **The Pitch:** The winners of the AI era won't just be the smartest models—it will be the infrastructure that makes them safe to use.
> **The Metaphor:** Google/OpenAI are building the **CPU** (Intelligence). We are building the **Hard Drive** (State).
---
## 1. The Core Thesis: The "Amnesiac Genius" Problem
We are witnessing a "Processor War." OpenAI, Google, and Anthropic are spending billions to build models with higher IQ.
* **The Result:** Agents that can reason brilliantly for 10 minutes.
* **The Flaw:** Once the context window closes, the reasoning vanishes. They are "Super-Geniuses with Amnesia."
Enterprises cannot run critical operations (Finance, Law, Science) on ephemeral thought. They need **Durable Execution**. They need a system that remembers *why* a decision was made, *who* made it, and *what evidence* supported it.
**StemeDB is the File System for the AI Scientist.**
---
## 2. The Competitive Landscape: Why Us?
Investors ask: *"Why won't Google or OpenAI just build this?"*
**Answer:** Because their business models and architectures prohibit it.
### vs. Google Gemini (The Monolith)
* **Their Play:** "The Source Graph." Fuses AI with Google Drive/Docs.
* **The Flaw:** It is a **Walled Garden**. It only works if your data is in Google.
* **Our Moat:** **Neutrality.** Modern enterprises use a "Tower of Babel" stack (Salesforce, Notion, GitHub). Google will never optimize for data living in Microsoft or Atlassian silos. StemeDB is the "Switzerland of Memory"—model-agnostic and platform-neutral.
### vs. OpenAI o3 (The Black Box)
* **Their Play:** "Internalized Chain of Thought." The model "thinks" before it answers.
* **The Flaw:** The reasoning is **Ephemeral & Opaque**. You cannot audit the thought process after the chat ends.
* **Our Moat:** **Auditability.** We sell "Signed Assertions." If an agent executes a trade, we provide the cryptographic proof of exactly which document it read. OpenAI provides the decision; we provide the compliance trail.
### vs. Vector Databases (The Legacy)
* **Their Play:** "Semantic Search." Optimize for Cosine Similarity.
* **The Flaw:** **Context Pollution.** To a Vector DB, `Budget_Draft_v1` and `Budget_Final_v2` look 99% identical. It retrieves both, causing agents to hallucinate based on outdated data.
* **Our Moat:** **Validity.** We optimize for Truth, not Similarity. Our "Epoch" and "TrustRank" systems automatically filter out superseded or low-confidence data.
---
## 3. The "Next Giant" Defense
*"Won't Open Source developers (LangChain) or Search engines (Perplexity) build this?"*
**No. They are solving different layers.**
* **Open Source (LangChain/AutoGPT):** They are building the **Manager**. They focus on orchestration (spinning up 5 agents). They treat memory as a plugin, not a platform.
* **Perplexity:** They organize the **Web**. Their architecture is optimized for 200B public URLs. StemeDB organizes **Work**—private, forked realities ("What if we acquire Company X?") that must never touch a public index.
---
## 4. The Product Strategy: "Invisible Infrastructure"
We do not ask developers to "manage a database." We ask them to "annotate their work."
1. **The Hook:** "Stop writing retry loops."
* Our SDK (`steme.BindJob`) automatically handles state recovery for long-running agents. The "Database" is just a side effect of keeping the agent alive.
2. **The Stickiness:** "The Un-Gaslightable Dashboard."
* Once a team sees a timeline of *exactly* when their agent hallucinated vs. when a human intervened, they cannot go back to black-box logs.
3. **The Flywheel:** "The Simulator."
* We convert their logs into training data. The more they use StemeDB, the smarter their custom agents become.
---
## 5. Summary
We are not fighting the Model War. We are selling shovels in the Gold Rush.
* Let OpenAI build the **Genius** (The Agent).
* Let Google build the **Office** (The Workspace).
* We will build the **Filing Cabinet** (The Truth).
History shows that the System of Record (Oracle, Snowflake) is often more valuable than the Application Layer.