Cloudflare Containers: Outgrowing Workers

10 May 2026

5 Min. reading time

Those who have built on Cloudflare Workers know the wall: from the moment the code requires more than 128 MB RAM or a real Linux toolchain, deployment becomes ugly. With Containers GA since April 13, 2026, Cloudflare is shifting this wall. Web devs who previously switched to EC2, Fly.io, or Render get an edge layer in between, without Kubernetes clusters and without data center contracts.

May 10, 2026

Key Takeaways

Active CPU pricing as a real lever: Containers are only billed when the CPU actually burns cycles. Idle containers only cost memory plus storage, not compute. For burst workloads, this is a different pricing framework than with AWS Fargate or Fly.io.
Hostnames, Docker Hub, SSH: Workers address containers via service bindings, images come from Docker Hub or a registry, SSH works for live debugging. The stack feels like classic Linux, not like an edge sandbox.
Gap between Workers and EC2: For headless browsers, ffmpeg, Pandoc, or small inference endpoints, Workers were too small and EC2 too heavy. Containers fill exactly this middle layer and reduce the stack jump to a single provider.

What Containers GA really brings

What is a Cloudflare Container? A Cloudflare Container is a short-lived Linux container that runs in the Cloudflare edge network and is addressed by a worker via service bindings. It lies functionally between a Workers function and a classic cloud VM and covers workloads that require more runtime, more memory, or a complete Linux toolchain.

The GA version brings three major improvements over the public beta. Active CPU pricing only bills the actually consumed CPU cycles, not the wall clock time. Limits are in the thousands of parallel-running containers per account. Service bindings address containers by hostname instead of IP, which removes DNS logic and discovery code from the worker.

In addition, there is Docker Hub support for direct image pulls, SSH for live debugging, and sandboxes as a sister product for AI agent workloads with persistent filesystem sessions. If you have a worker that needs a headless browser for screenshots or a Pandoc pipeline for PDFs, you now call a container via service binding instead of interposing an external API provider.

300+

Cloudflare locations worldwide where containers can be rolled out. For DACH, this means: Frankfurt, Düsseldorf, Munich, Vienna, and Zurich are closer to the end user than any AWS region.

Source: Cloudflare Network Map, as of May 2026

Where the Edge Advantage Comes into Play, and Where It Doesn’t

Containers don’t run in every Edge location; instead, they are started in the nearest available region when needed. For interactive web apps with DACH users, this usually means Frankfurt or Düsseldorf. Latencies between worker and container are thus in the single-digit millisecond range, eliminating the need to jump to a classic backend in Frankfurt or Dublin. The official platform documentation explicitly lists the supported regions and limits.

Where this really makes a difference is in image and PDF processing. A worker calling a container with ImageMagick saves a hop to a Lambda or render service, plus its cold start. Similarly, for headless browser workloads: Playwright in a container next to the worker delivers screenshots in 700 to 1,200 ms total latency, whereas the detour via an external browserless endpoint often takes twice as long.

What containers can’t replace are long-running services with their own state. A Postgres instance still belongs in a database platform, and a Kafka broker on a VM. Containers are short-lived, shut down when inactive, and start cold. Those who don’t keep this in mind build architectures that come as a surprise on the monthly bill.

Workers, Containers, EC2: What to Use When

When Containers Are Suitable

Headless browsers, Pandoc, ffmpeg, Tesseract next to a worker
Small inference endpoints that don’t fit into a Workers AI function
CLI tools that require a full Linux environment
Burst workloads with long idle phases, thanks to active CPU pricing

When Not to Use Containers

Databases or other long-running services with local state
Workloads that require fixed region guarantees (compliance)
GPU inference for large models; Workers AI or a hyperscaler is better suited here
Existing stacks that already run on Kubernetes and are consolidated

A 60-Day Plan for DACH Teams

Anyone who wants to seriously examine the stack doesn’t start with a migration, but with a specific workload that’s currently causing pain.

60-Day Plan: Integrating Containers into the Workers Stack

Week 1-2

Identify an external service that’s currently accessed via HTTP (Browserless, ImageKit, Cloudconvert). Build an image, test locally with Docker, and deploy as a container in a test Wrangler configuration.

Week 3-4

Remove the service binding from the worker, measure latency compared to the status quo, and examine cold-start behavior under load. Use Workers Logs for logging, no separate stack.

Week 5-6

Compare active CPU pricing to the old billing. For burst workloads with high idle periods, the switch usually pays off clearly, but not for continuously high-load driven workloads.

from Week 7

Productive switch for the pilot workload, set monitoring thresholds, and select a second workload. Only then should you consider architecture consolidation.

Frequently Asked Questions

Do I need a paid Cloudflare plan for containers?

Yes, containers run on the Workers Paid Plan, which starts at $5 per month. Active CPU pricing is added on top, billed by the second. The Paid Plan is sufficient for pure testing, but for productive setups with high load, Workers Enterprise is recommended due to better limits and support SLAs.

Can I use my existing Docker images directly?

In most cases, yes. Cloudflare supports Docker Hub pulls and private registries, and image sizes of several gigabytes are possible. However, images that rely on Privileged Mode or special kernel modules, and workloads with GPU requirements beyond what Cloudflare containers currently offer, will not work.

How does the stack relate to GDPR and data residency?

Cloudflare offers region affinity options, allowing containers to be pinned to EU locations. For organizations requiring strict data residency (public sector, banks), review the Enterprise configuration and obtain written assurance. Standard setups pragmatically land in Frankfurt or Amsterdam, which suffices for most DACH workloads.

About the Author

Adrian Garcia-Kunz is a Web Developer at Evernine. He comes from the frontend stack but knows when a worker or lambda is no longer sufficient. He dislikes stack fashion trends and favors tools that still work six months later.