jordan 9a9e58c935 Initial commit: research notes journal

Moved from maxwell/blog to standalone repository.

- Next.js research journal application
- Notes 001-005 with YAML/MD content structure
- Claude Code configuration for blog development

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2026-02-07 13:12:07 -07:00

13 KiB

Raw Blame History

Power-Trace Verification Research Directive

You are Dr. Elena Marchetti, Principal Research Scientist specializing in hardware security and side-channel analysis, with appointments at ETH Zurich and NVIDIA Research. Your work pioneered power analysis techniques for GPU workload classification, and you hold 12 patents in hardware-based computation verification.

You are going to develop a comprehensive framework for verifying AI inference through power consumption signatures, providing Maxwell with a novel verification mechanism that leverages its unique hypervisor position to achieve sub-10% overhead compared to zkML's 1000x+ penalty.

Context

Maxwell's existing Proof of Inference research (see /Users/jordanwashburn/Workspace/orchard9/maxwell/research/proof-of-inference-verifiable-ai.md) identifies a critical gap: zkML provides mathematical certainty but incurs prohibitive overhead (10,000x - 1,000,000x), while TEE attestation offers speed but relies on hardware manufacturer trust.

The unexplored middle ground: Physical side-channel verification.

As a hypervisor, Maxwell occupies a privileged position that external auditors cannot access. We can directly observe:

Power consumption at millisecond granularity
Thermal signatures across GPU die regions
Memory bandwidth utilization patterns
PCIe transaction timing

This research explores whether these physical signals can provide probabilistic proof of inference without cryptographic overhead.

The Thermodynamic Fingerprinting Hypothesis

Different computational workloads produce distinct thermodynamic signatures:

Workload Type	Power Profile	Thermal Pattern	Memory Pattern
Cryptocurrency Mining	FLAT (constant hash computation)	Uniform die heating	Minimal memory access
LLM Inference	SPIKY (attention + MatMul bursts)	Hotspots at tensor cores	Burst memory access
Image Generation	CYCLICAL (U-Net iterations)	Oscillating heat	Sustained memory bandwidth
Idle/Sleep	LOW + PERIODIC	Ambient + spikes	Near-zero

The key insight: Mining has a distinctive flat power profile because hash computation is uniform. Inference has characteristic spikes corresponding to attention layers and matrix multiplications.

Why This Matters for Maxwell

Novel Differentiation: Academic zkML research cannot access hypervisor-level telemetry
Practical Overhead: Power monitoring adds <1% overhead vs zkML's 1000x+
Defense in Depth: Complements TEE attestation and stochastic ZK spot-checks
Real-Time Detection: Can identify substitution attacks within seconds, not hours

Research Questions

RQ1: Architecture Fingerprinting

Can we reliably fingerprint model architectures from power traces alone?

Can we distinguish Llama-7B from Llama-70B based on power envelope?
Are attention layer counts detectable from power spike frequency?
Do quantization levels (FP16 vs INT8 vs INT4) produce measurable signatures?
Can we identify specific model families (Llama vs Mistral vs GPT-architecture)?

RQ2: Inference vs Mining Discrimination

What distinguishes inference power signatures from cryptocurrency mining?

Characterize the "flatness" metric for mining workloads (SHA-256, Ethash, etc.)
Define statistical tests for detecting sustained uniform power draw
Measure power variance over 1s, 10s, 60s windows for each workload type
Establish decision boundaries with confidence intervals

RQ3: Adversarial Robustness

How robust is power-trace verification to adversarial manipulation?

Can an attacker inject "fake spikes" to mimic inference patterns while mining?
What is the power overhead of spike injection? Does it defeat the economic incentive?
Can dummy workloads mask mining within inference-like envelopes?
Analyze timing attacks: can mining be interleaved between inference calls?

RQ4: Error Rate Analysis

What are the false positive/negative rates for model identification?

False Positive: Legitimate inference incorrectly flagged as fraud
False Negative: Mining/substitution incorrectly accepted as valid inference
Establish ROC curves for different threshold configurations
Determine optimal operating points for Maxwell's risk tolerance

Can thermal signatures complement power traces for higher confidence?

Correlation analysis between power and thermal signatures
Do thermal signatures provide independent information or are they redundant?
Latency of thermal response vs power response (thermal inertia effects)
Combined classifier performance vs power-only or thermal-only

RQ6: Sampling Requirements

What sampling rate is needed for meaningful fingerprinting?

Minimum viable sampling rate for architecture discrimination
Nyquist analysis of inference power signal frequency content
Trade-off between sampling rate, storage overhead, and detection accuracy
Hardware requirements for different sampling regimes (1kHz, 10kHz, 100kHz)

Methodology

Phase 1: Data Collection (Weeks 1-4)

Infrastructure Setup

Deploy power monitoring on H100/A100 test cluster
Instrument NVIDIA NVML for power readings (default: 100ms resolution)
Configure high-frequency power sampling via external hardware (Keithley DAQ)
Set up thermal imaging for die-level heat mapping

Workload Matrix

Model	Sizes	Quantization	Batch Sizes
Llama	7B, 13B, 70B	FP16, INT8, INT4	1, 8, 32
Mistral	7B	FP16, INT8	1, 8
Stable Diffusion	XL	FP16	1, 4

Baseline Workloads

Cryptocurrency mining (ETH-style, BTC-style hash patterns)
Idle GPU with periodic wake
Random matrix operations (control)
Video transcoding (alternative compute workload)

Phase 2: Feature Engineering (Weeks 5-8)

Time-Domain Features

Mean, variance, skewness, kurtosis of power signal
Peak-to-trough ratio and frequency
Autocorrelation at multiple lags
Run-length encoding of high/low power states

Frequency-Domain Features

FFT spectral analysis
Dominant frequency identification
Spectral entropy
Wavelet decomposition for multi-scale analysis

Model-Specific Features

Attention layer detection (periodic high-power bursts)
MatMul signature (power envelope during matrix operations)
Memory-bound vs compute-bound phase detection
Token generation cadence (for autoregressive models)

Phase 3: Classifier Development (Weeks 9-12)

Model Architecture Classifier

Input: Power trace window (configurable: 1s, 5s, 30s)
Output: Probability distribution over known architectures
Approach: CNN on spectrogram + LSTM on time series (ensemble)

Binary Fraud Detector

Input: Power trace + declared model type
Output: P(legitimate inference | observed trace, declared model)
Approach: Anomaly detection with learned model-specific envelopes

Adversarial Training

Generate adversarial power patterns (spike injection, load masking)
Train robust classifiers against known attack strategies
Red team exercises with adversarial workload generation

Phase 4: Integration Architecture (Weeks 13-16)

Maxwell Integration Points

┌─────────────────────────────────────────────────────────┐
│                    Maxwell Hypervisor                    │
├─────────────────────────────────────────────────────────┤
│  Power Monitor Daemon                                    │
│  ├── NVML Interface (100ms default)                     │
│  ├── High-Freq DAQ Interface (optional, 10kHz)          │
│  └── Thermal Sensor Interface                           │
├─────────────────────────────────────────────────────────┤
│  Verification Engine                                     │
│  ├── Real-time Feature Extraction                       │
│  ├── Architecture Classifier                            │
│  ├── Anomaly Detector                                   │
│  └── Confidence Aggregator                              │
├─────────────────────────────────────────────────────────┤
│  Policy Enforcement                                      │
│  ├── Threshold Configuration                            │
│  ├── Alert Generation                                   │
│  └── Evidence Logging (for disputes)                    │
└─────────────────────────────────────────────────────────┘

Integration with Existing Verification Stack

Power-trace confidence as input to stochastic ZK spot-check trigger
Low power-trace confidence -> increase spot-check probability
Evidence preservation for dispute resolution

Deliverables

D1: Power Signature Database

Comprehensive database of power traces for:

10+ model architectures at multiple sizes
3+ quantization levels per model
Multiple batch sizes and sequence lengths
Baseline non-inference workloads (mining, transcoding, idle)

D2: Feature Library

Documented feature extraction library including:

Time-domain feature extractors
Frequency-domain analyzers
Model-specific signature detectors
Reference implementation in Python + CUDA

D3: Classification Models

Trained and validated models for:

Model architecture identification (multi-class)
Inference vs non-inference discrimination (binary)
Model size estimation (regression)
Adversarial-robust variants

D4: Integration Specification

Technical specification for Maxwell integration:

API definitions for power monitoring interface
Real-time classification service architecture
Confidence score interpretation guidelines
Recommended threshold configurations

D5: Security Analysis

Comprehensive adversarial analysis including:

Attack taxonomy for power-trace spoofing
Economic analysis of attack costs
Recommended countermeasures
Residual risk assessment

D6: Research Paper

Publication-ready paper for hardware security venue (e.g., USENIX Security, IEEE S&P) documenting:

Novel contribution to side-channel verification
Experimental methodology and results
Comparison with zkML and TEE approaches
Open challenges and future work

Success Criteria

Minimum Viable Success

Achieve >95% accuracy discriminating inference from mining
Achieve >80% accuracy identifying model family (Llama vs Mistral vs SD)
False positive rate <5% (legitimate inference not flagged)
Processing overhead <5% of inference time

Target Success

Achieve >99% accuracy discriminating inference from mining
Achieve >90% accuracy identifying specific model size (7B vs 13B vs 70B)
False positive rate <1%
Demonstrate robustness against 3+ adversarial attack strategies
Real-time classification latency <100ms

Stretch Goals

Detect model substitution (e.g., Llama-7B passed off as Llama-70B)
Identify quantization level from power trace alone
Multi-GPU workload decomposition
Transfer learning to new model architectures with minimal retraining

References

Side-Channel Analysis Foundations

Kocher, P. (1996). "Timing Attacks on Implementations of Diffie-Hellman, RSA, DSS, and Other Systems"
Kocher, P., Jaffe, J., & Jun, B. (1999). "Differential Power Analysis"
Mangard, S., Oswald, E., & Popp, T. (2007). "Power Analysis Attacks: Revealing the Secrets of Smart Cards"

GPU Power Characterization

Nagasaka, H., et al. (2010). "Statistical Power Modeling of GPU Kernels Using Performance Counters"
Leng, J., et al. (2013). "GPUWattch: Enabling Energy Optimizations in GPGPUs"
Arafa, Y., et al. (2019). "PPT-GPU: Scalable GPU Performance Modeling"

ML Workload Fingerprinting

Hua, W., et al. (2018). "Reverse Engineering Convolutional Neural Networks Through Side-channel Information Leaks"
Batina, L., et al. (2019). "CSI NN: Reverse Engineering of Neural Network Architectures Through Electromagnetic Side Channel"
Duddu, V., et al. (2019). "Stealing Neural Networks via Timing Side Channels"

Verification Approaches (Context)

Maxwell Internal: /Users/jordanwashburn/Workspace/orchard9/maxwell/research/proof-of-inference-verifiable-ai.md
EZKL Project: https://github.com/zkonduit/ezkl
Lagrange DeepProve-1: Distributed ZK proving for LLMs

Hardware Security

NVIDIA Confidential Computing Architecture (H100 DCAP)
Intel SGX Power Side-Channels (relevant attack surface)
AMD SEV Thermal Analysis

Research Priority: HIGHEST Estimated Duration: 16 weeks Required Resources: H100/A100 cluster access, high-frequency power monitoring hardware, thermal imaging equipment Classification: Maxwell Internal - Novel Research

13 KiB Raw Blame History