URGENT: Postgres DB Not Re Deploying
benshaener
PROOP

3 months ago

Postgres gets stuck in deploy for 10 + Minutes then does:

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

Mounting volume on: /var/lib/containers/railwayapp/bind-mounts/b32148b5-0212-4a79-8935-46732435247a/vol_mfi99efo8co8km4j

=========================

Container failed to start

=========================

An unknown error occurred.

If this error persists, please reach out to the Railway team at https://station.railway.com.

this is leading to significant downtime for our business.

Solved

40 Replies

Railway
BOT

3 months ago

Hey there! We've found the following might help you get unblocked faster:

If you find the answer from one of these, please let us know by solving the thread!


pedro-shopinfo
PRO

3 months ago

Also here


benshaener
PROOP

3 months ago

This is extremely urgent, every DB deploy is failing, our entire business utilizes this DB.


benshaener
PROOP

3 months ago

Our service has now been down for an hour, do I need to upgrade to an enterprise plan or something in order to get this fixed ASAP


Hey there Ben, not good, jumping on the thread to make sure you are in a good spot.


Status changed to Awaiting User Response Railway 3 months ago


Gathering context, one second.


benshaener
PROOP

3 months ago

thank you


Status changed to Awaiting Railway Response Railway 3 months ago


Railway
BOT

3 months ago

Hello!

We've escalated your issue to our engineering team.

We aim to provide an update within 1 business day.

Please reply to this thread if you have any questions!

Status changed to Awaiting User Response Railway 3 months ago


benshaener
PROOP

3 months ago

Is there any sort of immediate fix or anything, I can't be waiting a full day to get the service back online


Status changed to Awaiting Railway Response Railway 3 months ago


Engaging the platform team here while we look into the issue, will give you realtime updates. In the meantime, I am now going to try to recover access to the DB. It looks like an issue with mounting.


Status changed to Awaiting User Response Railway 3 months ago


benshaener

Is there any sort of immediate fix or anything, I can't be waiting a full day to get the service back online

We aren't going to leave you in the lurch for that long.


benshaener
PROOP

3 months ago

Ok thank you, that bot message was scary


Status changed to Awaiting Railway Response Railway 3 months ago


Yep, the bot message was anything when we attach a Linear ticket to it.

Working through the issue now, going to take a few minutes to debug.


Status changed to Awaiting User Response Railway 3 months ago


angelo-railway

Yep, the bot message was anything when we attach a Linear ticket to it.Working through the issue now, going to take a few minutes to debug.

jemguan
PRO

3 months ago

same for me


Status changed to Awaiting Railway Response Railway 3 months ago


roy-law
PRO

3 months ago

same here. everything stopped. service offline. couldn't connect to the database. It's been an hour and no signs of recovery


3 months ago

benshaener, your database is online and accessible now.

For anyone else, please provide direct links to your database or service in question.


Status changed to Awaiting User Response Railway 3 months ago


brody

benshaener, your database is online and accessible now.For anyone else, please provide direct links to your database or service in question.

roy-law
PRO

3 months ago


Status changed to Awaiting Railway Response Railway 3 months ago


brody

benshaener, your database is online and accessible now.For anyone else, please provide direct links to your database or service in question.

benshaener
PROOP

3 months ago

Ok, thank you. I am getting only 502 Errors when I hit my API currently, this might be an issue on my end so I'll look into it and let you know if there is any issues with that aswell, thanks



oranuare

This is mine, I'm having the same issue https://railway.com/project/12729143-7dd3-4121-9637-b9edc1637a5f/service/9149fae5-7ef5-4d0b-91e5-6cd853eae135/database?environmentId=92d75e8d-5215-46b7-88f4-cc93111d8e0f

3 months ago

We're still working on a fix which should resolve this. Are you able to try redeploying that postgres service to see if it has the same problem?


Status changed to Awaiting User Response Railway 3 months ago


noahd

We're still working on a fix which should resolve this. Are you able to try redeploying that postgres service to see if it has the same problem?

oranuare
PRO

3 months ago

I had a redeploy before for about 12min. Now I'm trying again, is going on for 4min now


Status changed to Awaiting Railway Response Railway 3 months ago


oranuare
PRO

3 months ago

Ok, so the deployment is up now, but I get this when I try to access the database:

Database Connection

We are unable to connect to the database via SSH.

psql: error: connection to server at "localhost" (::1), port 5432 failed: Connection refused Is the server running on that host and accepting TCP/IP connections? connection to server at "localhost" (127.0.0.1), port 5432 failed: Connection refused Is the server running on that host and accepting TCP/IP connections?


roy-law
PRO

3 months ago

this is what I see

Attachments


brody

benshaener, your database is online and accessible now.For anyone else, please provide direct links to your database or service in question.

benshaener
PROOP

3 months ago

My system is back up, but the performance is very significantly worse, like 1 minute wait times worse, its very bad. Don't expect it will stay this way, right? Any sort of estimate of when I can expect it to be back to normal performance?


benshaener

My system is back up, but the performance is very significantly worse, like 1 minute wait times worse, its very bad. Don't expect it will stay this way, right? Any sort of estimate of when I can expect it to be back to normal performance?

No estimate yet, working on getting everyone back and then we can address the performance.


Status changed to Awaiting User Response Railway 3 months ago


oranuare
PRO

3 months ago



I'm getting this in the logs right now

Attachments


Status changed to Awaiting Railway Response Railway 3 months ago


3 months ago

The coalition version mismatch shouldn't be impacting much as its just a warning. You can sort that out whenever you'd like.
As Angelo had mentioned we're currently working towards getting everything back up and performant again!


Status changed to Awaiting User Response Railway 3 months ago



Status changed to Awaiting Railway Response Railway 3 months ago



noahd

The coalition version mismatch shouldn't be impacting much as its just a warning. You can sort that out whenever you'd like. As Angelo had mentioned we're currently working towards getting everything back up and performant again!

jemguan
PRO

3 months ago

It has now been over 5 hours since our critical database failure began, and the issue remains unresolved.

We are still completely offline. We have seen no recovery of the service, and the lack of a concrete solution or timeline is alarming.

Every hour of downtime is causing significant damage to our business. We cannot wait any longer.


jemguan
PRO

3 months ago

This situation has escalated beyond a technical issue. Our customer support team is now completely overwhelmed.

Because of this 5+ hour downtime:

Our support channels are inundated with angry customers.

We are facing a massive backlog of complaints that we cannot resolve because the database is unresponsive.

Our brand reputation is suffering irreversible damage every minute this continues.


3 months ago

Completely understand and I am very sorry to hear about this. We're still working towards a fix involving startup times/performance and will let you know as soon as we get info.
This degraded performance issue seems to only be related to our US-East region so if it works for you temporarily switching to US-West should resolve things.


Status changed to Awaiting User Response Railway 3 months ago


jemguan

This situation has escalated beyond a technical issue. Our customer support team is now completely overwhelmed.Because of this 5+ hour downtime:Our support channels are inundated with angry customers.We are facing a massive backlog of complaints that we cannot resolve because the database is unresponsive.Our brand reputation is suffering irreversible damage every minute this continues.

Acknowledged, but the issues I see on the projects that you have attached to the account doesn't show any database or impact related to the database issue unless there is another thread I am missing here.

That said, happy to be the upstream blame for the customers who are reporting that issue.


Separate thread, noting that all the DB reports are back online. Next step for us is to check the performance of the underlying host.


oranuare
PRO

3 months ago

I tried moving to us-west, and now it got stuck. Can I cancel the redeployment safely? It seems to be back again on us-east

Attachments


Status changed to Awaiting Railway Response Railway 3 months ago


oranuare

I tried moving to us-west, and now it got stuck. Can I cancel the redeployment safely? It seems to be back again on us-east

3 months ago

Hey Oranuare for the sake of trying to help you out, are you able to make your own thread and link it here? Would love to help debug there.


Status changed to Awaiting User Response Railway 3 months ago


noahd

Hey Oranuare for the sake of trying to help you out, are you able to make your own thread and link it here? Would love to help debug there.

oranuare
PRO

3 months ago

Yes, I have one called "How long are database backups supposed to take?". We could continue there


Status changed to Awaiting Railway Response Railway 3 months ago


oranuare

Yes, I have one called "How long are database backups supposed to take?". We could continue there

3 months ago

Would be ideal if you can create a new one directly for this issue!


Status changed to Awaiting User Response Railway 3 months ago


3 months ago

Bit of an update for y'all,
Everything has leveled out on our end, and things are back online. We'll keep monitoring this in case something comes up, but for now, everything should be back to normal. Feel free to let us know if you see this again.


Railway
BOT

3 months ago

✅ The ticket Setup issue with integrations has been marked as completed.


Railway
BOT

3 months ago

✅ The ticket Setup Issue with Connections has been marked as completed.


Railway
BOT

3 months ago

This thread has been marked as solved automatically due to a lack of recent activity. Please re-open this thread or create a new one if you require further assistance. Thank you!

Status changed to Solved Railway 3 months ago


Loading...