NET

Cloudflare Announces New High-Performance Infrastructure for AI Models

NET May 03, 2026

Cloudflare launched new infrastructure to run large AI language models across its global network. The system separates input processing (prefill) from output generation (decode) using optimized hardware configurations.

The architecture features a custom inference engine named Infire to manage GPUs. A secondary system called Unweight compresses AI model weights by 15-22% without sacrificing accuracy.

These advancements reduce memory usage to increase model speed and efficiency on the platform.

More NET News

Related News

Cloudflare Launches AI Security Program, Targeting Legacy Network Migration

🟢 Cloudflare Inc is trading 4.3% up today on analyst target hikes and AI roadmap optimism

Cloudflare Gains $250 Price Target, Driven by Surging AI Traffic

🟢 Cloudflare Inc is trading 4% up today as dip-buyers return amid broader tech rebound

🔴 Cloudflare Inc is trading 5.6% down today as profit-taking continues after recent surge