2 months ago
Got the outbound traffic maintenance email at 17:22 UTC. It`s been down for more than an hour already.
Service is still serving 502s / ETIMEDOUT. What can I do on my side, if anything?
Attachments
19 Replies
@Brody could you confirm if this is a Railway issue or a Cloudflare issue?
2 months ago
Neither.
2 months ago
Have you redeployed your service?
we started having issues 2 hours ago but no luck understanding what's going on, no recent deployments. DBs are working fine
just these logs showing up
Attachments
2 months ago
Please do then.
2 months ago
AWS shut outbound off for the host you where on.
ok cool, followup Qs:
- aren't we on Railway metal? or just for building the containers?
- can't we detect and auto-redeploy once issues like this happen?
2 months ago
Railway metal isn't relevant anymore, services may be put on GCP or AWS depending on availability at any given time, same performance and same cost, so no worries there.
This is something we are already talking to our AWS rep with, it shouldn't happen again.
ok so nothing much to do on our end? should we just expect improvements happening from Railway and other providers?
2 months ago
Correct, we are always improving something!
we love Railway, but unfortunately these incidents are very costly with our customers
hope Railway is also thinking on some "auto-resolve" systems that after these incidents it goes back to normal rather than relying on manual interventions
2 months ago
really +1 on this, shouldn't have to manually do things to work around incidents
a day ago
We're aware of intermittent outbound connectivity drops affecting some services, where outbound connections fail while internal networking and DNS keep working, and we're actively investigating. If you hit it, the quickest workaround is to redeploy, which moves your service onto a fresh host and restores connectivity. If it happens again, reply here with the rough time it started and we'll jump on it while it's live. Those real-time reports are exactly what help us track down the cause.
Status changed to Awaiting User Response angelo-railway • 1 day ago