a month ago
Today's outage hit our production app and we found out from customer emails. Not from Railway, not from our dashboard, not from any alert, not from a webhook. Nothing. 30 paying customers reached out before we had any idea something was wrong.
The dashboard was showing services as active when they were already terminated. That's not just an outage, that's misinformation during a crisis. How are we supposed to trust what we're seeing?
I get that abuse detection is important but rolling out enforcement that nukes legitimate production workloads, including databases, is unacceptable. The dry run caught nothing. The staged rollout still hit 3% of the fleet. That's not staged, that's just slow.
No email, no webhook, no push notification to affected accounts. We had container webhooks set up and got nothing when our services were forcefully terminated. What's the point of having webhooks if they don't fire when Railway itself kills your workloads?
What's Railway doing about proactive customer notifications? We shouldn't have to find out from our users that our services are down. At minimum there should be instant alerts pushed to affected accounts when something like this happens.
We've been loyal customers but today seriously shook our confidence in the platform.
0 Replies