Skip to content

Gap: Performance Benchmarks & SLOs

Status: 🔴 Not Started Priority: Important (Pre-Scale) Estimated Effort: 2-3 days

What's Missing

Concrete performance targets and benchmarks including:

  • Response Time Targets

    • p50, p95, p99 targets per endpoint category
    • Read vs. write operation targets
    • Complex operation targets (PDF generation, segment evaluation)
  • Throughput Targets

    • Requests per second by endpoint
    • Concurrent user capacity
    • Database queries per second
  • Capacity Planning

    • Users per organization limits
    • Appointments per day capacity
    • Form submissions per day capacity
    • Concurrent videocalls supported
  • Database Benchmarks

    • Query performance baselines
    • Connection pool sizing formulas
    • Index coverage targets
    • Query timeout justifications
  • Scaling Triggers

    • When to add read replicas (concrete metrics)
    • When to scale vertically (CPU/memory thresholds)
    • When to move to Phase 2/3/4 infrastructure
  • Load Testing Results

    • k6 test scenarios and results
    • Bottleneck identification
    • Breaking point analysis
    • Recommendations from load tests

Why Important

  1. SLOs - Cannot guarantee 99.5% uptime without targets
  2. Capacity Planning - Need to know when to scale
  3. Performance Regression - Need baselines to detect slowdowns
  4. Customer Expectations - White-label / negotiated-contract customers need performance guarantees

Current State

  • Monitoring targets exist (e.g., < 500ms p95)
  • Scaling phases documented
  • No concrete benchmarks
  • No load testing results

docs/performance/ directory with:

  • README.md - Performance overview
  • benchmarks.md - Baseline performance metrics
  • slos.md - Service level objectives
  • load-testing.md - Load testing methodology and results
  • capacity-planning.md - Capacity planning formulas

Success Criteria

  • [ ] Baseline benchmarks established for all endpoint categories
  • [ ] p50/p95/p99 targets defined
  • [ ] Throughput targets documented
  • [ ] Load testing scenarios executed
  • [ ] Scaling triggers quantified
  • [ ] Capacity planning formulas documented