Production 502s after "Railway Maintenance Underway" email
Anonymous
PROOP

2 months ago

Got the outbound traffic maintenance email at 17:22 UTC. It`s been down for more than an hour already.

Service is still serving 502s / ETIMEDOUT. What can I do on my side, if anything?

WhatsApp_Image_2026-04-17_at_14.24.20.jpeg

Attachments

Awaiting User Response

19 Replies

rafaelmuttoni
PRO

2 months ago

@Brody could you confirm if this is a Railway issue or a Cloudflare issue?


2 months ago

Neither.


2 months ago

Have you redeployed your service?


rafaelmuttoni
PRO

2 months ago

we started having issues 2 hours ago but no luck understanding what's going on, no recent deployments. DBs are working fine


rafaelmuttoni
PRO

2 months ago

just these logs showing up

image.png

Attachments


2 months ago

Please do then.


rafaelmuttoni
PRO

2 months ago

ok doing now


rafaelmuttoni
PRO

2 months ago

we just restarted before but it didn't work, now I'm doing a redeploy


rafaelmuttoni
PRO

2 months ago

ok it's back, could you help us understand what happened?


rafaelmuttoni
PRO

2 months ago

also the redeploy was almost instant, usually take 5mins+


2 months ago

AWS shut outbound off for the host you where on.


rafaelmuttoni
PRO

2 months ago

ok cool, followup Qs:

  • aren't we on Railway metal? or just for building the containers?
  • can't we detect and auto-redeploy once issues like this happen?

rafaelmuttoni
PRO

2 months ago

also anything we should do on our end to prevent stuff like this?


2 months ago

Railway metal isn't relevant anymore, services may be put on GCP or AWS depending on availability at any given time, same performance and same cost, so no worries there.

This is something we are already talking to our AWS rep with, it shouldn't happen again.


rafaelmuttoni
PRO

2 months ago

ok so nothing much to do on our end? should we just expect improvements happening from Railway and other providers?


2 months ago

Correct, we are always improving something!


rafaelmuttoni
PRO

2 months ago

we love Railway, but unfortunately these incidents are very costly with our customers

hope Railway is also thinking on some "auto-resolve" systems that after these incidents it goes back to normal rather than relying on manual interventions


really +1 on this, shouldn't have to manually do things to work around incidents


We're aware of intermittent outbound connectivity drops affecting some services, where outbound connections fail while internal networking and DNS keep working, and we're actively investigating. If you hit it, the quickest workaround is to redeploy, which moves your service onto a fresh host and restores connectivity. If it happens again, reply here with the rough time it started and we'll jump on it while it's live. Those real-time reports are exactly what help us track down the cause.


Status changed to Awaiting User Response angelo-railway 1 day ago


Welcome!

Sign in to your Railway account to join the conversation.

Loading...