60 Replies

mckay
PROOP

15 days ago

Attempting to switch off of metal to see if that helps


mckay
PROOP

15 days ago

look at error logs


15 days ago

What should I be looking at?



mckay
PROOP

15 days ago

Please see Metrics, you'll see the 503 influx


mckay
PROOP

15 days ago

More info: Same machine same service in the most recent hop on cache-iad-kjyo7100048-IAD


mckay
PROOP

15 days ago

Have moved regions and redeployed


mckay
PROOP

15 days ago

Still nothing


mckay
PROOP

15 days ago

Is anyone looking into this?


15 days ago

503 is returned from your app though?


mckay
PROOP

15 days ago

It is proxying calls to a downstream


mckay
PROOP

15 days ago

and soemthing is happening between railway and fastly


mckay
PROOP

15 days ago

No commit / deployment changes in either application around time of error


mckay
PROOP

15 days ago

Same machine same service in the most recent hop on cache-iad-kjyo7100048-IAD: scJ4iSV4UkyHDI2juJNzrcUzYk/oO1PMCyYg4bz5Utg!CMH!cache-cmh1290116-CMH, scJ4iSV4UkyHDI2juJNzrcUzYk/oO1PMCyYg4bz5Utg!IAD!cache-iad-kjyo7100048-IAD


mckay
PROOP

15 days ago

cache-iad-kjyo7100048-IAD and cache-cmh1290116-CMH -- these are Fastly edge cache node identifiers. IAD = Ashburn, Virginia datacenter. CMH = Columbus, Ohio datacenter. Fastly uses airport codes for their PoPs (Points of Presence).
"Same machine same service in the most recent hop" -- this is a specific Fastly error message. It means a request arrived at a cache node that already handled it, indicating a routing loop.
Railway uses Fastly as its CDN/proxy layer for *.up.railway.app domains. So when your app hits APPURL, the request goes through Fastly first.
503 status + that specific error string is Fastly's way of saying "I can't reach the origin server" or "the request is looping." It's not your app or your backend code returning 503 -- it's the infrastructure layer in front of it.


mckay
PROOP

15 days ago

I can get on a call if needed as well


mathu97
PRO

15 days ago

I have a similar setup and am seeing 503s as well.


mckay
PROOP

15 days ago

I appreciate you.


mckay
PROOP

15 days ago

Going to try swapping to a custom domain to see if it fixes the issue on railways side


mckay
PROOP

15 days ago

Attempting a production fix by using custom domains as I really do believe this is an issue on Railway's side


mckay
PROOP

15 days ago

Fix did not work


mathu97
PRO

15 days ago


mckay
PROOP

14 days ago

Timestamp of errors starting

image.png

Attachments



domain?


wait


Okay


Tried something


lets see if errors go down


Can I get a redeploy rq?


mckay
PROOP

14 days ago

yes please


mckay
PROOP

14 days ago

on it


We had to re-roll Fastly because of… another… DDoS, that lasted 45 seconds


this may or may not be related


mckay
PROOP

14 days ago

Didn't work


mckay
PROOP

14 days ago

Did redeploy


Likely not network then


looking deeper


mckay
PROOP

14 days ago

thanks let me know what you need from me and i'll be here


Wait


Going to move everything off Fastly


mckay
PROOP

14 days ago

I have already tried custom domains incase it was a proxying issue at railway.app


mckay
PROOP

14 days ago

thanks dude


okay, give it a sec


mckay
PROOP

14 days ago

ok, will I need to redeploy


I don't think so


okay, everything off Fastly on your side


mckay
PROOP

14 days ago

I attempted redeploy


if still not, will jump on bridge


mckay
PROOP

14 days ago

Still seeing 503s


okay, lemme get a meet spun up



request sent


mckay
PROOP

14 days ago

accepted


ramosfbc
PRO

14 days ago

I'm having the same issue. App is completely down. Frontend is not reaching the backend


mckay
PROOP

14 days ago

Was good for a bit, happening again now


mckay
PROOP

14 days ago

image.png

Attachments


14 days ago

Is this a specific endpoint?


we got it solved, put em off fastly


sorry, needed to exfil that


Status changed to Solved brody 11 days ago


Loading...