Saumya Majumder, lead software engineer at BigScoots, joined the Jukebox Podcast to discuss a recent major Cloudflare outage and how BigScoots designs high-performance WordPress architectures using Cloudflare Enterprise. Saumya specializes in high-performance WordPress engineering, edge computing, worker-based automations, custom caching engines and migration tools. He oversees enterprise customers and develops scalable, developer-friendly hosting solutions.
Why outages happen
Saumya explains that large internet services like Cloudflare are immensely complex systems with many interdependent components. Small, rare edge-case failures can cascade into large outages despite tests and reviews. When a critical lower-layer component fails, higher-level services that depend on it can be affected. He emphasizes that all large providers—Cloudflare, AWS, Google Cloud, Azure—can and do experience outages. These incidents are costly in terms of reputation and service credits to enterprise customers, so providers work hard to remediate and learn from failures.
Cloudflare’s scale and response
Cloudflare is more than a CDN; it provides a broad developer platform and many security and networking services. Because of its scale, fixes and configuration changes must propagate across a very large, global control plane and edge network, which can take time and create a ripple effect as traffic and state stabilize. Saumya praises Cloudflare’s transparency—postmortems that explain root causes and remedial steps—and their ongoing work to remove single points of failure (for example, previous work to remove dependencies on external services like GCP for KV).
How BigScoots mitigated the recent outage
During the outage, Cloudflare’s public-facing services like Turnstile and many websites (including X and ChatGPT) were impacted. BigScoots had automated responses in place: their tooling used Cloudflare’s API to turn off the Cloudflare proxy for affected customers, routing traffic directly to BigScoots’ origin servers until Cloudflare recovered. Because the Cloudflare API remained available, this rapid failover reduced actual customer downtime for sites hosted at BigScoots that normally rely on Cloudflare as a proxy. Saumya notes that such mitigations work when Cloudflare is used as a proxy; sites fully hosted on Cloudflare Workers or using Cloudflare as their origin would still be affected differently.
CDN-level page caching and history
Saumya describes how traditional WordPress caching worked—server-side caches (advanced-cache.php) storing HTML on origin to avoid reprocessing. Static assets were later moved to CDNs, but Saumya and a collaborator pioneered CDN-level HTML page caching (the Super Page Cache for Cloudflare plugin, later acquired). The idea: serve full HTML from the CDN PoP closest to the user, avoiding round trips to a distant origin and drastically reducing latency.
Cloudflare Enterprise and tiered caching
Cloudflare Enterprise unlocks advanced caching behaviors. Key benefits include high cache-hit ratios, longer-lived cache entries, and tiered/regional caching. Saumya explains tiered caching: PoPs are arranged in tiers so a lower-tier PoP (near the user) will first ask upper tiers within Cloudflare’s private backbone if a resource is cached, before querying the origin. This internal Cloudflare intranet (private backbone) is faster and avoids origin requests, lowering load and improving global response times.
Private connectivity and CNI
BigScoots runs their own data centers and has set up direct physical connections (CNI) to Cloudflare, effectively plugging BigScoots’ network into Cloudflare’s intranet. Instead of Cloudflare fetching from origins over the public internet (which can be slower and subject to congestion), Cloudflare can pull content over the private fiber connection directly to BigScoots’ origin. This reduces latency further and improves reliability and performance for cache misses.
BigScoots Cache plugin and platform features
BigScoots developed a proprietary plugin—BigScoots Cache—that integrates deeply with Cloudflare Enterprise caching while exposing fine-grained controls to users and developers:
– Intelligent cache purging: when content is published or updated, the plugin purges the affected page and related pages (archives, taxonomy, authors) automatically.
– Hooks and REST API: advanced users can customize TTLs, purge logic, and integrate cache control into external workflows (e.g., clear cache from a Laravel backend when inventory changes).
– Managed assistance: BigScoots will implement custom snippets or purge logic for customers who can’t or don’t want to configure advanced behavior themselves.
– Portal controls: BigScoots exposes a broad set of toggles for Cloudflare Enterprise features—WAF/login protection, image optimization, Rocket Loader, blocking/challenging bots, geo-blocking, and fine-tuned security rules—so customers can configure or ask support to configure protections and performance settings.
Who benefits
While BigScoots offers advanced features useful to agencies and enterprise customers, Saumya stresses that many smaller site owners also benefit. Controls are presented simply in the portal for common tasks (e.g., block or challenge traffic from a specific country), and managed support can implement changes for less technical customers. BigScoots provides systematic onboarding for enterprise migrations, aiming for zero-downtime migrations and performance optimization as part of managed packages.
Operational mindset and engineering trade-offs
Saumya reiterates that engineering is about managing trade-offs: it’s impractical to engineer for every 0.00001% edge case at all times, so providers prioritize. When outages occur, providers analyze and remove root causes and dependencies to reduce recurrence frequency. He encourages realism: outages are part of running large distributed systems; the goal is to mitigate impact, be transparent about incidents, and learn to prevent similar problems.
Closing
The episode explains how global infrastructure failures can ripple across the web, how Cloudflare’s scale contributes to both the impact and the complexity of fixes, and how BigScoots leverages deep Cloudflare Enterprise integration, private networking, and custom caching to reduce latency and improve reliability for WordPress sites. Saumya emphasizes transparency, continuous improvement, and managed services to help customers at all levels benefit from advanced caching and security features. For more details and links related to the episode, visit wptavern.com and search for this podcast episode.
