Cache hierarchies: CDN → gateway → app → DB

A layered cache turns latency into throughput. Start at the edge (CDN), then the gateway, then your app (in‑memory/Redis), and finally database query/result caches.

Design principles

Metrics to watch