Edge Decisioning
What it is. The capability to execute personalization or activation logic at CDN edge nodes — geographically distributed Points of Presence (PoPs) operated by edge cloud platforms — rather than making a round-trip to a central application server or data center. The goal is sub-100ms signal-to-activation latency at the point where the user's request first contacts the platform's infrastructure.
Two implementation tiers.
The appropriate tier depends on whether the decisioning logic can be pre-encoded as rules or requires live ML model execution:
-
Rule-based edge compute (vendor.fastly Compute@Edge, Cloudflare Workers, AWS Lambda@Edge): Execute WebAssembly-compiled business logic at the nearest PoP. Fastly Compute runs code in microseconds with no cold starts; supports JavaScript, Rust, Go, and C++. Integrates with Key Value Store (instant data access for pre-pushed profile attributes) and Fanout (real-time personalized messaging at scale). Appropriate when decisioning rules are deterministic — segment flag lookups, A/B test assignment, content routing based on pre-positioned audience attributes. Cannot execute an ML model live; model outputs must be pre-computed and pushed to edge key-value stores before the request arrives.
-
Edge AI inference (vendor.akamai Inference Cloud): Execute transformer-class ML models directly at CDN PoPs. Sub-millisecond inference latency (NVIDIA-backed, launched October 2025). Appropriate when decisioning logic is ML-based and cannot be pre-computed — intent classification on the live request context, next-best-message selection from a generative model, real-time fraud scoring at checkout before the page renders.
What this capability requires (and does not provide). Edge decisioning is the execution layer only. The CDW or packaged CDP must pre-compute profile attributes and segment membership and push them to edge key-value stores (for rule-based execution) or edge inference caches (for AI inference). Neither vendor.fastly nor vendor.akamai is a CDP — they do not own identity resolution, segmentation, or profile management.
Where it fits in the latency tier map. Edge decisioning serves latency-tier.intra-page (sub-100ms, page-render personalization) and latency-tier.inter-page (sub-10s, next-page decisioning based on prior-page signals). It is not the correct layer for intra-session (seconds to minutes) or inter-session use cases — those belong in platform-bundled or CDW-native decisioning. See concept.real-time-decisioning Layer 1 taxonomy.
Organizational fit signal. Relevant to organizations with web-first customer experiences and documented sub-100ms latency requirements. Mobile-app-only organizations have no page-render boundary and should route to in-app messaging or push channels instead. Teams without a backend data capability able to pre-position attributes at the edge will not realize the capability's value — the CDW pre-compute pipeline is a prerequisite.