Akamai acquires LayerX, delivering end-to-end security and real-time AI usage control to any browser. Get details

Back Products Close

Cloud Computing

Cybersecurity

Content Delivery

See all products

Our Infrastructure

Global Services

Back Cloud Computing Close

Artificial intelligence (AI)

Akamai Inference Cloud

Storage

Object Storage

Block Storage

Backups

Databases

Managed Databases

compute

GPU

CPU

Kubernetes

App Platform

Accelerated Compute

Serverless

Akamai Functions

Networking

Cloud Firewall

DNS Manager

NodeBalancers

Private Networking

View cloud pricing

Explore plans and pricing that fit your needs — from small projects to global-scale deployments.

See pricing

Get started with Akamai Cloud

Sign up today and unlock cloud computing, edge, and AI tools built for your business.

Sign up

See all Cloud Computing

Back Cybersecurity Close

app and api security

API Security

App & API Protector

Firewall for AI

Client-Side Protection & Compliance

Bot & Agent Control

Account Protector

Content Protector

Bot Manager

AI Brand Presence

Segmentation

Akamai Guardicore Segmentation

zero trust security

Akamai Workforce Protector (formerly LayerX)

Secure Internet Access

Enterprise Application Access

Akamai MFA

Identity, Credential, and Access Management

infrastructure security

Edge DNS

Prolexic

IP Accelerator

DNS Posture Management

Brand Guardian

Get started with Security

Protect the applications that drive your business — every day, every time.

Contact Sales

See all Cybersecurity

Back Content Delivery Close

Application performance

Ion

API Acceleration

IP Accelerator

Media Delivery

Adaptive Media Delivery

Download Delivery

Edge Applications

EdgeWorkers

EdgeKV

Image & Video Manager

Media Services Live

Cloudlets

Cloud Wrapper

Global Traffic Management

Monitoring, reporting and testing

Data Stream

mPulse

CloudTest

Get started with Content Delivery

Trust the agility and scale of Akamai to help you flawlessly deliver extraordinary digital experiences.

Contact Sales

See all Content Delivery

Back Solutions Close

Cloud Computing

Serverless

Media

SaaS

Gaming

See all Cloud Computing

security

Frontier AI Security Risks

Akamai Application Protection Platform

Cybersecurity Compliance

Ransomware Protection

Secure Apps and APIs

DNS Delivery and Security

Zero Trust

DDoS Protection

Bot & Agent Control

Identity, Credential and Access Management

See all Cybersecurity

content delivery

App and API Performance

Media Delivery

See all Content Delivery

industry solutions

Media and Entertainment

Retail, Travel, and Hospitality

Financial Services

Healthcare and Life Sciences

Public Sector

Defense

Games

Online Sports Betting and iGaming

Service Providers

See all Industry Solutions

Back Pricing Close

Security and Delivery

Get started

Contact Sales

Free trials

Cloud pricing

GLOBAL PRICING

North America pricing

Europe pricing

Asia Pacific pricing

South America pricing

SPECIFIC LOCAL PRICING

Jakarta pricing

See all pricing

Cloud pricing

Try Akamai Cloud with US$100 in credits*

Deploy faster with global cloud infrastructure — no surprise bills, no lock-in, and transparent pricing across every data center.

Try now

*See Promotion Redemption Rules & Conditions

Back Developers Close

Cloud developers

Developer hub

Akamai GitHub repo

docs and guides

Cloud docs

Guides and tutorials

cloud marketplace

Developer apps

Get started with Akamai Cloud

Sign up today and unlock cloud computing, edge and AI tools built for your business.

Sign up

Back Resources Close

What’s new

Akamai blog

Events and workshops

Learning

White papers, ebooks, videos, product briefs

Customer stories

Training and certifications

Cybersecurity Research

Akamai Security Intelligence Group (SIG)

State of Internet (SOTI) reports

Partners

Partner with Akamai to innovate, scale, and grow your advantage

Channel Partners

Partner Portal

Partner Stories

Technology Partners

Technology Partners Directory

Log in

Back Log in Close

Cloud Manager
Manage your cloud computing services

Back Log in Close

Control Center
Manage your security and delivery services
- Docs
- Sales
- Support
- Under Attack ?
English
Back Language Close
- English
- Deutsch
- Español
- Français
- Italiano
- Português
- 中文
- 日本語
- 한국어

Create account

Under Attack?

Akamai Cloud

Akamai Security and Delivery

Connect with our Sales team to discuss your business needs and find the right solutions.

Contact Sales

The Cloud Giants Are Architecting an Agentic Future They Can’t Run

Jun 26, 2026

Jon Alexander

Written by

Jon Alexander

Jon Alexander is Senior Vice President of Product for the Cloud Technology Group at Akamai. He is responsible for the strategy, roadmap, and success of the cloud computing and delivery products. Jon joined Akamai in 2017 and led various product teams inside Akamai, starting within the media division. Previously, he worked in several roles focused on building large-scale internet infrastructure. Jon spent 10 years running the media business at one of the world’s largest telecommunications carriers and has led product teams at start-ups as they defined, launched, and grew new solutions. He is passionate about fostering innovation and building customer-centric product teams. He holds a Master of Arts degree and a Master of Engineering degree from Cambridge University.

The tech industry is currently selling an architectural illusion: the belief that highly transactional, fluid agentic workflows can be successfully deployed over a legacy, centralized footprint designed for batch processing. We are optimizing for the wrong variable.

While the market obsesses over token-generation speeds — the silicon math — it is ignoring the reality of physical geography. If you want to scale intelligence, you have to stop dragging data back to the core. It’s time to decouple the geographic location of centralized model training from the distributed network required for edge execution.

Why centralization is failing AI workflows

For two decades, cloud strategy obeyed a single law: consolidate. We consolidated branch offices and replaced local servers to feed massive, centralized data centers. It was a rational response to static content delivery, but as AI shifts from lab training and simple chatbots to real-world agentic inference, the physics of compute have flipped. Centralization is no longer an optimization; it is an anchor.

The industry is currently treating AI inference as if it were just another cloud native workload to be crammed into existing hyperclusters. It is not. Inference is an entirely different species of compute. By continuing to route real-world AI traffic through centralized design patterns, enterprises are actively building on top of an architecture that was explicitly engineered for yesterday's problems.

The architectural shift is unavoidable. Inference demands a distributed fabric that meets applications where decisions actually happen — not 2,000 miles away in a proprietary availability zone.

Extraction vs. execution: A bottleneck built by design

Centralized architectures have one job: brute force batch processing and training foundation models with trillions of weights. If you're running an overnight job, a dense GPU cluster in the core is fine. For real-time automation, that model is a non-starter.

The issue is forcing a system built for extraction to power interactive intelligence. Modern clouds pull data from the edge — users, factories, and retail floors — and haul it back to a hub. Hyperscalers engineered data gravity to lock you in, making it free to enter but expensive to leave.

This extractive model worked for training. Training is a gravity problem; it needs localized data and massive power. Inference, however, is a distribution problem. Agentic inference is the operational moment when the model interacts with a live business process. It is transactional, unpredictable, and context dependent.

When you force agentic workflows into a centralized architecture, you are asking a collection system to function as an execution system. The result is massive misallocation of compute resources. Moving contextual data across a network to make a microdecision that should happen only feet away is, to put it bluntly, architectural malpractice.

The myth of the “local” cloud

Hyperscalers know that their footprint is the bottleneck. Their “solution”? Regional zones and local cloud extensions. They want you to buy more proprietary racks to extend their legacy control plane. It’s a static, physical answer to a fluid software problem.

Infrastructure must move with the workload. An autonomous inventory system or a fleet of vehicles can't wait for a round-trip to Virginia. Proximity to the decision point is now a make-or-break factor for inference, yet nearly half of organizations remain anchored to single cloud regions.

Legacy platforms force a binary choice: Accept the structural penalties of distance and transit costs, or undertake the complex, manual nightmare of replicating your stack across fixed zones:

If you choose centralization, you accept network congestion and latency.
If you choose manual replication, you wind up overprovisioning expensive, specialized silicon across dozens of locations just to handle occasional spikes.
Both paths are a direct consequence of a rigid, legacy design philosophy that ignores the speed of light.

Decoupling architecture from geography

The solution to this structural deadlock is not abandoning the centralized cloud entirely — it is changing how we orchestrate workloads across it.

In a true continuum, architecture is decoupled from geography. The infrastructure acts as a single, fluid fabric that spans from heavy core data centers to regional clusters, all the way to the true network edge. Instead of dragging data back to the compute, the infrastructure intelligently pushes the intelligence out to the data.

This distributed approach changes how we allocate resources:

Asymmetric compute matching: Stop forcing every AI workload onto a GPU. Keep heavy reasoning at the core, but execute orchestration and tool-driven actions on the ubiquitous CPUs already at the edge.
Fluid workload routing: Orchestrate based on network state and user proximity, not pre-determined availability zones. If the user moves, the compute moves.
Zero-penalty data locality: Process data where it makes sense. Bypass synthetic egress walls designed to trap your data. Application performance must dictate architecture, not metered toll booths.

The history of network architecture is the history of moving resources closer to the user. The need to distribute logic is straining centralized cloud architecture just as the need to distribute content once brought the web to its knees. In both cases, the architectural solution was distribution.

Yet, we cannot architect a distributed future while benchmarking success by centralized standards. If you assess an AI platform solely on GPU count or token speed, you are failing at the math.

Stop measuring the wrong things: Three nonnegotiables for agentic scale

Before deploying real-world inference, demand these three baseline technical requirements:

Hardware-located portability: Can your stack deploy across edge nodes and multiple clouds without a total rewrite? If your tools are tethered to one provider's control plane, you've built for lock-in, not speed.
Edge elasticity: Don't pay for fixed GPU capacity that sits idle. A scalable architecture handles spikes dynamically by using a hybrid of CPU and distributed resources.
Transit autonomy: Eliminate financial penalties for moving data between zones. Predictable movement ensures that architecture follows performance, not vendor-dictated tax brackets.

Beyond the limits of the core

The centralized cloud served us well for the last 20 years. It will continue to serve us for the next 20. But it won’t be at the forefront where the needs of inference stretch beyond its geographic and Moore's law limits. We are long past the point in time when we can treat centralization and distribution as an either/or choice.

The future is being built on systems that intelligently route compute to where compute actually needs to live and where it creates the most value, not just where it’s easiest to host.

Read the report

Jun 26, 2026

Jon Alexander

Written by

Jon Alexander

View cloud pricing

Get started with Akamai Cloud

Get started with Security

Get started with Content Delivery

Try Akamai Cloud with US$100 in credits*

Get started with Akamai Cloud

Partners

Akamai Cloud

Akamai Security and Delivery

The Cloud Giants Are Architecting an Agentic Future They Can’t Run

Why centralization is failing AI workflows

Extraction vs. execution: A bottleneck built by design

The myth of the “local” cloud

Decoupling architecture from geography

Stop measuring the wrong things: Three nonnegotiables for agentic scale

Beyond the limits of the core

Related Blog Posts