Tiled Hacker news on React Router

Cloudflare outage on February 20, 2026

121 points - today at 7:05 PM

Source

kgeist
today at 9:27 PM
It's something we debated in our team: if there's an API that returns data based on filters, what's the better behavior if no filters are provided - return everything or return nothing?
The consensus was that returning everything is rarely what's desired, for two reasons: first, if the system grows, allowing API users to return everything at once can be a problem both for our server (lots of data in RAM when fetching from the DB => OOM, and additional stress on the DB) and for the user (the same problem on their side). Second, it's easy to forget to specify filters, especially in cases like "let's delete something based on some filters."
So the standard practice now is to return nothing if no filters are provided, and we pay attention to it during code reviews. If the user does really want all the data, you can add pagination to your API. With pagination, it's very unlikely for the user to accidentally fetch everything because they must explicitly work with pagination tokens, etc.
Another option, if you don't want pagination, is to have a separate method named accordingly, like ListAllObjects, without any filters.
CommonGuy
today at 7:28 PM
Insufficient mock data in the staging environment? Like no BYOIP prefixes at all? Since even one prefix should have shown that it would be deleted by that subtask...
From all the recent outages, it sounds like Cloudflare is barely tested at all. Maybe they have lots of unit tests etc, but they do not seem to test their whole system... I get that their whole setup is vast, but even testing that subtask manually would have surfaced the bug
otar
today at 8:26 PM
Reliability was/is CF's label.
It's alarming already. Too many outages in the past months. CF should fix it, or it becomes unacceptable and people will leave the platform.
I really hope they will figure things out.
alansaber
today at 8:29 PM
Not sure why everyone is complaining, new MCP features are more important than uptime
blibble
today at 7:45 PM
is this blog post LLM generated?
the explanation makes no sense:
> Because the client is passing pending_delete with no value, the result of Query().Get(“pending_delete”) here will be an empty string (“”), so the API server interprets this as a request for all BYOIP prefixes instead of just those prefixes that were supposed to be removed. The system interpreted this as all returned prefixes being queued for deletion.
client:
```
     resp, err := d.doRequest(ctx, http.MethodGet, `/v1/prefixes?pending_delete`, nil)
```
server:
```
    if v := req.URL.Query().Get("pending_delete"); v != "" {
        // ignore other behavior and fetch pending objects from the ip_prefixes_deleted table
        prefixes, err := c.RO().IPPrefixes().FetchPrefixesPendingDeletion(ctx)
        if err != nil {
            api.RenderError(ctx, w, ErrInternalError)
            return
        }

        api.Render(ctx, w, http.StatusOK, renderIPPrefixAPIResponse(prefixes, nil))
        return
    }
```
even if the client had passed a value it would have still done exactly the same thing, as the value of "v" (or anything from the request) is not used in that block
atty
today at 7:29 PM
I do not work in the space at all, but it seems like Cloudflare has been having more network disruptions lately than they used to. To anyone who deals with this sort of thing, is that just recency bias?
NinjaTrance
today at 7:45 PM
The irony is that the outage was caused by a change from the "Code Orange: Fail Small initiative".
They definitely failed big this time.
anurag
today at 8:12 PM
The one redeeming feature of this failure is staged rollouts. As someone advertising routes through CF, we were quite happy to be spared from the initial 25%.
himata4113
today at 7:54 PM
This blog post is inaccurate, the prefixes were being revoked over and over - to keep your prefixes advertised you had to have a script that would readd them or else it would be withdrawn again. The way they seemed to word it is really dishonest.
boarush
today at 7:28 PM
While neither am I nor the company I work for directly impacted by this outage, I wonder how long can Cloudflare take these hits and keep apologizing for it. Truly appreciate them being transparent about it, but businesses care more about SLAs and uptime than the incident report.
dilyevsky
today at 8:37 PM
> Because the client is passing pending_delete with no value, the result of Query().Get(“pending_delete”) here will be an empty string (“”), so the API server interprets this as a request for all BYOIP prefixes instead of just those prefixes that were supposed to be removed.
Lmao, iirc long time ago Google's internal system had the same exact bug (treating empty as "all" in the delete call) that took down all their edges. Surprisingly there was little impact as traffic just routed through the next set of proxies.
jaboostin
today at 8:12 PM
Hindsight is 20/20 but why not dry run this change in production and monitor the logs/metrics before enabling it? Seems prudent for any new “delete something in prod” change.
ssiddharth
today at 7:49 PM
The eternal tech outage aphorism: It's always DNS, except for when it's BGP.
vimda
today at 9:08 PM
One has to wonder when the board realises Dane was a bad replacement for JGC. These outages are getting ridiculous
user205738
today at 9:09 PM
They should have rewritten this code in Rust using these brilliant language models. /jk
tokyobreakfast
today at 8:15 PM
Is this trend of oversharing code snippets and TMI postmortems done purposely to distract their customers from raging over the outage and the next impending fuckup?
wa008
today at 8:42 PM
This transparent report can earn my trust
today at 7:47 PM
djfobbz
today at 9:09 PM
I'm honestly amazed that a company CF's size doesn't have a neat little cluster of Mac Minis running OpenClaw and quietly taking care of this for them.
VirusNewbie
today at 8:05 PM
If you track large SaaS and Cloud uptime, it seem to correlate pretty highly with compensation for big companies. Is cloudflare getting top talent?
henning
today at 8:04 PM
Sure vibe-coded slop that has not been properly peer reviewed or tested prior to deployment is leading to major outages, but the point is they are producing lots of code. More code is good, that means you are a good programmer. Reading code would just slow things down.
NooneAtAll3
today at 8:25 PM
again?

Cloudflare outage on February 20, 2026

kgeist

alemanek

MobileVet

CommonGuy

zmj

dabinat

asciii

suhputt

martinald

otar

argestes

slothsarecool

pocksuppet

slothsarecool

Sanzig

slothsarecool

arcatech

ranger_danger

alansaber

blibble

subscribed

bretthoerner

blibble

jbxntuehineoh

bstsb

himata4113

atty

Icathian

NinjaTrance

jsheard

gtowey

blibble

ranger_danger

dana321

bonesss

brutalc

dakiol

samrus

sp00chy

tovej

hypeatei

renegade-otter

logicchains

Ylpertnodi

dazc

lysace

jacquesm

dazc

jgrahamc

SoKamil

brcmthrowaway

tedd4u

slophater

sebmellen

goalieca

slophater

lysace

__turbobrew__

a24446ff87

slophater

slophater

Betelbuddy

candiddevmike

NinjaTrance

anurag

himata4113

boarush

llama052

boarush

jacquesm

boarush

samrus

dilyevsky

jaboostin

ssiddharth

subscribed

vimda

user205738

tokyobreakfast

turbobrew