Postgres HA cluster outage - production is down [URGENT]
adirkol
PROOP

17 days ago

looking for visibility on a P0 support ticket that's been silent for ~4 hours. Total Postgres HA cluster outage on a Pro plan project (AI Hub, eu-west, project ID 79c3d9e6). Production app hub.videobakery.co fully down for ~13 hours.

Latest update I sent the ticket includes what I believe is the root cause — Patroni superuser/replication credentials on postgres-2 and postgres-3 don't match each other or the application's DATABASE_URL, almost certainly regenerated during Railway's May 19 cleanup or by the Railway Agent yesterday. The original credentials are still visible in the WebApp's DATABASE_URL on Railway.

Angelo (employee) replied earlier acknowledging the postgres-1 volume mismatch issue and escalated to platform engineering. Chandrika (employee) confirmed the escalation 4 hours ago. No response since on the broader cluster outage.

Happy to share ticket ID + full diagnostic via DM. Just looking for eyes on it.

Thanks

Closed

1 Replies

adirkol
PROOP

17 days ago

Clustered recovered, updated the ticker - but still having medium problems. Please check updated ticker


Status changed to Closed brody 17 days ago


Welcome!

Sign in to your Railway account to join the conversation.

Loading...