tidaldb/social-scale.md at main

jordan 213b8efcca feat: complete M6-M7 + Enterprise Readiness milestones; split oversized source files per CODING_GUIDELINES §9

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-02-23 22:41:16 -07:00

3.1 KiB

Raw Permalink Blame History

CoEngagementIndex Eviction

Correctness Tests

All tests in tidal/tests/m7p3_social_scale.rs pass:

Test	Invariant Verified
`eviction_correctness_at_2x_capacity`	`edge_count <= capacity` after every insert at 2× load
`high_weight_edge_survival_under_eviction`	High-weight edges (score=100) survive low-weight flood
`co_engagement_memory_bounded_at_2x_insertions`	10 users × 40 items stays within capacity=200
`no_self_loops_after_eviction`	No `(a, a)` edges created under eviction pressure
`top_candidates_consistent_after_eviction`	`top_candidates()` works correctly after eviction

Eviction Benchmarks

Capacity	Mean Eviction Latency	Notes
10K edges	~2ms	O(N log N) sort over 10K entries
50K edges	~12ms	O(N log N) sort over 50K entries
100K edges	~26ms	O(N log N) sort over 100K entries

Benchmark: cargo bench --manifest-path tidal/Cargo.toml --bench social -- co_engagement_eviction

Note: Eviction is amortised. With USER_RECENT_CAPACITY=50, each record_positive adds at most 50 new edges. At default capacity=50K, eviction fires at most once every ~50K/50=1000 positive engagements — approximately once per active user session.

Setup

100 followed creators
500 followers per creator
10K items per creator = 1M items total

Benchmark Results

Depth	Mean Latency	p99 Latency	Notes
Depth-1 (followed creator items)	~3ms	~8ms	Direct bitmap union over 100 creators
Depth-2 (co-followers' seen items)	~25ms	~45ms	Fan-out: 100 creators × 500 followers

Target: p99 < 50ms — ✅ achieved

Run: cargo bench --manifest-path tidal/Cargo.toml --bench social -- social_graph_1m

Fan-out Cap

The social graph filter uses DEPTH2_FAN_OUT_CAP to bound the number of co-followers processed. Without this cap, depth-2 at 500 followers × 100 creators = 50K co-follower lookups. With the cap, this is bounded to prevent p99 blowup.

Cross-Session Preference Merge

Algorithm

PreferenceVectors::update_with_custom_rate(user_id, &interaction, lr):

pref = (1 - lr) * pref + lr * interaction
pref = L2_normalize(pref)

For 128D: 128 multiply-adds + L2 normalize = ~256 FP ops + sqrt.

Benchmark Results

Operation	Mean Latency	Target
Single 128D EMA (100K users)	~3µs	< 1ms ✅
10-session batch (100K users)	~28µs	< 10ms ✅

Both targets easily met. The bottleneck is DashMap lookup (~2µs overhead), not the EMA computation itself (~1µs for 128D).

Run: cargo bench --manifest-path tidal/Cargo.toml --bench social -- preference_merge

Adaptive Learning Rate

The update() method (used in production) uses an adaptive learning rate that decays as:

alpha = base_alpha / (1 + ln(update_count + 1))

This means users with many updates converge slower (preferences are more stable). First-session users get alpha = 0.1; users with 100+ sessions get alpha ≈ 0.02.

3.1 KiB Raw Permalink Blame History Unescape Escape

Social Scale Performance Analysis

CoEngagementIndex Eviction

Correctness Tests

Eviction Benchmarks

Social Graph Filter at 1M Items

Setup

Benchmark Results

Fan-out Cap

Cross-Session Preference Merge

Algorithm

Benchmark Results

Adaptive Learning Rate

3.1 KiB

Raw Permalink Blame History