stemedb/docs/about/market-position.md
jml 9bfa626203 docs: reorganize documentation structure for clarity
Major documentation restructure to improve discoverability and reduce duplication.

## Changes

**Deleted (Archived/Consolidated)**:
- Removed duplicate getting started guides
- Archived outdated planning documents
- Consolidated corpus and configuration docs
- Removed obsolete vision/spec files (superseded by vision.md)
- Cleaned up scrapyard and old PDFs

**New Structure**:
- docs/about/ - Project overview and introduction
- docs/guides/ - User guides (moved from root)
- docs/specs/ - Technical specifications
- docs/sdk/ - SDK documentation (Go)
- docs/references/ - API references
- docs/archive/ - Archived historical docs
- applications/aphoria/docs/advanced/ - Advanced topics
- applications/aphoria/docs/reference/ - CLI reference
- applications/aphoria/docs/archive/ - Archived aphoria docs

**Updated**:
- README.md - New root README with clear navigation
- CONTRIBUTING.md - Contribution guidelines
- CLAUDE.md - Updated paths to new structure
- roadmap.md - Added recent completions

## Files Changed
- 57 files changed
- 1,977 insertions(+)
- 961 deletions(-)

**Net change**: +1,016 lines (added CONTRIBUTING.md, README.md, reorganized content)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-11 07:33:40 +00:00

74 lines
4.1 KiB
Markdown

# StemeDB: Market Position & Thesis
> **The Pitch:** The winners of the AI era won't just be the smartest models—it will be the infrastructure that makes them safe to use.
> **The Metaphor:** Google/OpenAI are building the **CPU** (Intelligence). We are building the **Hard Drive** (State).
---
## 1. The Core Thesis: The "Amnesiac Genius" Problem
We are witnessing a "Processor War." OpenAI, Google, and Anthropic are spending billions to build models with higher IQ.
* **The Result:** Agents that can reason brilliantly for 10 minutes.
* **The Flaw:** Once the context window closes, the reasoning vanishes. They are "Super-Geniuses with Amnesia."
Enterprises cannot run critical operations (Finance, Law, Science) on ephemeral thought. They need **Durable Execution**. They need a system that remembers *why* a decision was made, *who* made it, and *what evidence* supported it.
**StemeDB is the File System for the AI Scientist.**
---
## 2. The Competitive Landscape: Why Us?
Investors ask: *"Why won't Google or OpenAI just build this?"*
**Answer:** Because their business models and architectures prohibit it.
### vs. Google Gemini (The Monolith)
* **Their Play:** "The Source Graph." Fuses AI with Google Drive/Docs.
* **The Flaw:** It is a **Walled Garden**. It only works if your data is in Google.
* **Our Moat:** **Neutrality.** Modern enterprises use a "Tower of Babel" stack (Salesforce, Notion, GitHub). Google will never optimize for data living in Microsoft or Atlassian silos. StemeDB is the "Switzerland of Memory"—model-agnostic and platform-neutral.
### vs. OpenAI o3 (The Black Box)
* **Their Play:** "Internalized Chain of Thought." The model "thinks" before it answers.
* **The Flaw:** The reasoning is **Ephemeral & Opaque**. You cannot audit the thought process after the chat ends.
* **Our Moat:** **Auditability.** We sell "Signed Assertions." If an agent executes a trade, we provide the cryptographic proof of exactly which document it read. OpenAI provides the decision; we provide the compliance trail.
### vs. Vector Databases (The Legacy)
* **Their Play:** "Semantic Search." Optimize for Cosine Similarity.
* **The Flaw:** **Context Pollution.** To a Vector DB, `Budget_Draft_v1` and `Budget_Final_v2` look 99% identical. It retrieves both, causing agents to hallucinate based on outdated data.
* **Our Moat:** **Validity.** We optimize for Truth, not Similarity. Our "Epoch" and "TrustRank" systems automatically filter out superseded or low-confidence data.
---
## 3. The "Next Giant" Defense
*"Won't Open Source developers (LangChain) or Search engines (Perplexity) build this?"*
**No. They are solving different layers.**
* **Open Source (LangChain/AutoGPT):** They are building the **Manager**. They focus on orchestration (spinning up 5 agents). They treat memory as a plugin, not a platform.
* **Perplexity:** They organize the **Web**. Their architecture is optimized for 200B public URLs. StemeDB organizes **Work**—private, forked realities ("What if we acquire Company X?") that must never touch a public index.
---
## 4. The Product Strategy: "Invisible Infrastructure"
We do not ask developers to "manage a database." We ask them to "annotate their work."
1. **The Hook:** "Stop writing retry loops."
* Our SDK (`steme.BindJob`) automatically handles state recovery for long-running agents. The "Database" is just a side effect of keeping the agent alive.
2. **The Stickiness:** "The Un-Gaslightable Dashboard."
* Once a team sees a timeline of *exactly* when their agent hallucinated vs. when a human intervened, they cannot go back to black-box logs.
3. **The Flywheel:** "The Simulator."
* We convert their logs into training data. The more they use StemeDB, the smarter their custom agents become.
---
## 5. Summary
We are not fighting the Model War. We are selling shovels in the Gold Rush.
* Let OpenAI build the **Genius** (The Agent).
* Let Google build the **Office** (The Workspace).
* We will build the **Filing Cabinet** (The Truth).
History shows that the System of Record (Oracle, Snowflake) is often more valuable than the Application Layer.