Database migration to railway metal taking 170 hours
jaytonbye
PROOP

a year ago

Hello, we just began migrating our database to Railway Metal in Virginia, but it is taking an incredibly long time. The data is moving, but very slowly. We initially tried this with our test environment first, which worked smoothly; after it worked with our test environment, we dumped our production database and then tried to do the same thing. I am not sure what is going wrong. Any suggestions?

Solved

56 Replies

jaytonbye
PROOP

a year ago

1fc9ac5f-29f8-4b23-ab28-1eebdaeb8c06/volume/6d410a84-ccf6-4064-b5dc-b53bb746fc15/metrics?environmentId=87adcc32-cf45-44c4-bd23-7de68500165a


a year ago

kbit/s <:patrickwow:901282270806175775>


a year ago

!t


a year ago

This thread has been escalated to the Railway team.

Status changed to Awaiting Railway Response medim 11 months ago


jaytonbye
PROOP

a year ago

Do you recommend that we spin up a new instance a database, and do a data import? Or should we try to wait it out?


jaytonbye
PROOP

a year ago

This is a substantial outage for us, we've never been down for over an hour. I need to make a decision, please advise.


a year ago

Hello,

Your database is not offline from the looks of it?


jaytonbye
PROOP

a year ago

It says 1%, am I not understanding this?


jaytonbye
PROOP

a year ago

We're waiting for the migration to complete to bring our servers back online.


a year ago

1% of the migration, your database is still online.


a year ago

You do not need to stop anything that connects to it.


jaytonbye
PROOP

a year ago

IS our database online in California or in Virginia? It was in california to start, but we migrated to Virginia.


a year ago

Its currently online in US-West2


jaytonbye
PROOP

a year ago

I don't understand how that is possible, as the migration was to virginia, as it shows in the picture (from us west to east).


a year ago

Its hasn't finished the migration yet, so its still running in US-West


jaytonbye
PROOP

a year ago

Are you saying that we can continue to function normally? Our cron jobs immediately started having issues, and we experienced some errors; which I assumed were related.


a year ago

The database is fully online.


a year ago

It never went offline.


a year ago

At least for the last 21 hours (since thats when the latest deploy happened)


jaytonbye
PROOP

a year ago

No, we did this 1.5 hours ago.


a year ago

Yes, the database has not been offline for the last 21 hours


a year ago

1388249336730882191


jaytonbye
PROOP

a year ago

I believe what you are saying is that my database is still running on US West, but when the migration completes, it will automatically be booted on the East. Is this correct?

If so, in the meantime, I should keep our servers running in the west, but I will need to immediately switch them to the east when the database migration completes, as they cannot be that far apart. Does that sound correct?


a year ago

Correct to everything but -

they cannot be that far apart

They can be that far apart, it will work, but you will see additional latency.


jaytonbye
PROOP

a year ago

I realize it will work from a technical perspective, but everything will break due to the slowdown. 90% of the functionality of our platform is handled by cron jobs which will be fired off faster than they complete.


a year ago

Then you will see a temporary increase in latency.


a year ago

Obviously, the kbps speed is an issue, I'm just trying to help you get yourself unblocked in the meantime.


jaytonbye
PROOP

a year ago

Yes, thank you Brody. I appreciate your guidance.


a year ago

Ill report the issue, but yeah, please bring your services back online.


jaytonbye
PROOP

a year ago

We are handling that now. When the migration completes, will we be able to know? So long as we can get it done and it doesn't happen this Saturday or Sunday, we should be okay. However, if it happens in the middle of our live events, it puts our company in a bit of danger, as we have hundreds of people at these events depending on our service. Going down for a few minutes would be ok, but if we were to go down for a few hours, it would be disasterous for us. I will hope for the best.


a year ago

I will ask platform for next steps here, we may have you stop the migration all together and restart it.


jaytonbye
PROOP

a year ago

Thank you.


a year ago

@Jason Layton - We stopped the migration, please start a new migration again.


jaytonbye
PROOP

a year ago

Before I start the migration, can you give me an estimate of how long you think it will take? I'm trying to avoid an outage occuring tomorrow or Sunday. If you think it would complete tonight, I'm happy to do it, if not I would prefer to wait until Monday. Please advise.


a year ago

It will give you an accurate ETA when you start it, +250 hours was wrong.


jaytonbye
PROOP

a year ago

Ok, I will try to do it as soon as we are back on our feet. We are still experiencing outages in other areas, I'm putting out fires right now. Thank you.


a year ago

No problem!


jaytonbye
PROOP

a year ago

Any chance you can give me some guidance on the problem, my ai assistant thinks it's related to the migration.

We were using a link with a JWT embeded in the URL (I know this isn't great practice, but fixing it hasn't been a priority).

https://wrestlingtournaments.com/dualsWrestlerRegistrationMaster/eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9/eyJ0ZWFtSWQiOjgxNCwiZHVhbERpdmlzaW9uSWQiOjEwLCJpYXQiOjE3NTEwNjAwNDUsImV4cCI6MTc1OTcwMDA0NX0/f2uJJr-ZI1p5EPVyzRiTl6Fj097t8vuwISmFdJq7Gzc

The page is blank, and the console shows:

Failed to load module script: Expected a JavaScript-or-Wasm module script but the server responded with a MIME type of "text/html". Strict MIME type checking is enforced for module scripts per HTML spec.


a year ago

there is a lot of white, but how blank should the page be?

1388275485938683936

1388275486337142884


jaytonbye
PROOP

a year ago

I think I fixed it by forcing a redirect to www on cloudflare. I'm not sure where the issue came from, but the subdomain seems to fix the issue.


a year ago

Sounds good, but it is safe to say it wasnt due to the migration, your database's availability and operation did not change


jaytonbye
PROOP

a year ago

Well, we moved the front end servers as well. It was working until we took the site down to do maintenance; once it came back online, anyone who had tried to load the site couldn't load it at all, unless they did a hard refresh.


jaytonbye
PROOP

a year ago

Once they did a hard refresh, it would work; but not that link. That link required the subdomain.


a year ago

Glad you got it solved!


jaytonbye
PROOP

a year ago

Computers are hard. Thank you Brody. I'm a bit terrified to try the migration right now, we're in too precarious of a position; but I will do it first thing Monday morning. Thanks again for your help.


a year ago

Remember, the database will not go offline until the migration starts its last step.


jaytonbye
PROOP

a year ago

Sunday is the debut of our new product which will have about 700 people on location at the event (for hopefully only a few hours, assuming we don't go down at all [the product is definitely not as stable as I would like it to be]); that alone is enough stress. We're stable now (I think), so I'm going to spend the rest of my effort preparing for the event. Thank you.


a year ago

No problem, good luck with the event!


Status changed to Solved christian 11 months ago


a year ago

@Jason Layton How did it go?


a year ago

If you don't mind me asking


a year ago

Any issue with your migration?


a year ago

I just checked, done 3 days ago without issues, completed in 12 minutes


a year ago

!s


cicerorph
HOBBY

a year ago

Btw, the bot didn't mark it as solved, you should prob try it again


a year ago

!s


a year ago

I just checked the help station and it seems like that this thread has been already marked as solved.


a year ago

I'll go ahead and lock this thread.


Welcome!

Sign in to your Railway account to join the conversation.

Loading...