What the ~$3.0K/mo production floor buys (pilot scale)
| Component | $ / mo | What it covers |
|---|---|---|
| Compute — AKS / Container Apps | ~$600 | Hosts the ~12 .NET services; orchestration in-process over the event spine |
| Azure Data Explorer (ADX) | ~$1,200 | Production time-series cluster — every event + the LLM / decision audit trail |
| Event Hubs (Standard) | ~$100 | The event spine — POS, traffic, inventory, sensors, video metadata |
| Cosmos DB (provisioned) | ~$250 | Operational records, agent memory, tenant / store context |
| Cache for Redis (Standard C2) | ~$115 | Hot state, cooldowns, the statistical anomaly baselines |
| Azure AI Search (S1) | ~$250 | Vector index for RAG — store & enterprise knowledge base |
| Blob storage | ~$80 | Canonical audit — full LLM prompts / responses retained (SOC 2) |
| Platform overhead | ~$400 | Monitoring, Key Vault, identity, networking, egress, contingency |
| Production floor | ~$3.0K | ≈ $36K / yr at pilot |
What the floor carries — at near-zero marginal cost
The event spine already ingests POS, inventory, people-counters, sensors, weather, supply-chain & staffing. These structured streams are tiny next to 12M video frames/store·mo — video is ~90% of per-store cost, so the retail data rides essentially free on the same infrastructure.
Scales step-wise, not linearly
Floor grows by tier bumps (AI Search, ADX, Event Hubs Premium) as you add stores — so the per-store share of the floor keeps falling.
Not in this estimate — budget separately
Total cost of ownership — scaling with store count
| 1 storepilot | 5 storesexpansion | 10 storespilot target | → 100fleet · curve | |
|---|---|---|---|---|
| Platform floor /mo (slide 1) | $3.0K | $3.3K | $4.0K | $18K |
| Per-store variable (~$1.7K/store·mo) | $1.7K | $8.4K | $16.7K | $167K |
| Total / month | ~$4.7K | ~$11.7K | ~$20.7K | ~$185K |
| Budget / year plan on this | ≈ $56K | ≈ $140K | ≈ $250K | ≈ $2.2M |
| ↓ Optimized target / year | ≈ $44K | ≈ $81K | ≈ $130K | ≈ $1.0M |
Per-store cost stack — conservative (~$1.7K/mo)
Optimization headroom — upside, not budgeted
The 3-stage cascade (clip batching → Gemini Flash + edge-CV) is ~half — held as upside until a footage spike validates Flash recall. Cost can also rise: 24/7 cameras ~2×, higher frame rate ~6×.