a day ago
monitors went down showing "socket hang up" just now
87 Replies
a day ago
might've been resolved on its own?
a day ago
back green but still...
a day ago
connection err closed something something when I tried to fetch my service's healthcheck endpoint
a day ago
3 services were/are effected so far, no new deployments
a day ago
will look around
a day ago
I just had it happen on railway.com as well
a day ago
I'm in eu ams
affected services whose healthchecks failed externally
a day ago
Seeing latency spikes
a day ago
both screenshots are of services linked above
Attachments
a day ago
not seeing the socket hang up anymore but still seeing slight latency spikes
a day ago
socket hang up again
a day ago
may I ask which ip/isp you're getting these from? can dm too
a day ago
502 bad gateways, connection reset by peer
a day ago
69.46.46.14:443: i/o timeout
a day ago
constantly, like right now?
a day ago
upstream error
Attachments
a day ago
uhh not super constantly but quite consistent
a day ago
is there any response body?
a day ago
upstream error
a day ago
everything seems to be stable the last 30m, but might just be luck
a day ago
let me know if you see one in the next 5min or so
a day ago
instability is back...
a day ago
unexpected EOF
a day ago
read tcp 172.18.0.8:33922->66.33.22.216:443: read: connection reset by peer
a day ago
ok, I know why you see this now
a day ago
wait what
a day ago
Oh I see, the origin/railway proxies (on the 66.33.22.0/24 prefix) were just deployed
a day ago
How aggressively are you monitoring those? Do you keep the connection open?
a day ago
I don't think so, minutely checks
a day ago
I'm trying to access the failing URLs from my browser, I get hit with a conn refused (or alike, browser autorefreshes) and afterwards it seems to work fine
a day ago
could you send me the failing URL in here/over DMs? (does it resolve to an address within 66.33.22.0/24)?
a day ago
I'll send you the ones my monitors are annoying me about, do want to note that it's intermittent so far
21 hours ago
seeing similar intermittent down behavior
19 hours ago
happened again around 40 minutes ago
9 hours ago
seeing timeouts, question mark
8 hours ago
do you know if its the 66.33.22 IPs or 69.46.46 IPs you're seeing timeouts on?
8 hours ago
69.46.46.58
8 hours ago
resolved ""itself"" as of now but
Attachments
8 hours ago
ok, that would do it. bgp should reconverge
Attachments
8 hours ago
Let me know if you see anything else.
8 hours ago
Was this from a monitor out of interest? Is the monitor hitting the "ams1" POP? https:///.railway/cdn-trace
Yeah, it was a monitor hitting my own API. The entire app was running into a Cloudflare timeout error, but the Railway metrics were still showing everything as healthy and operating normally so idk what was that
8 hours ago
https://discord.com/channels/713503345364697088/1511663784127762483/1511663784127762483 seeing some latency spikes
8 hours ago
69.46.46.58
Attachments
8 hours ago
Could you run a traceroute/mtr to that IP from the monitor's network/server if possible? 🙏
8 hours ago
not consistent latency... mtr just shows ams eq6 right now, but ill keep retrying
8 hours ago
some cloudflare proxied requests are timing out completely
7 hours ago
I'd like to see the hops you take before eq6
7 hours ago
2.|-- sre02.gs.core.blackgate.nl 0.0% 10 4.6 4.7 4.3 6.0 0.5
3.|-- 100.65.1.14 0.0% 10 4.3 5.0 3.8 11.2 2.2
4.|-- 100.65.0.161 0.0% 10 8.6 5.6 3.9 9.8 2.0
5.|-- 81.20.68.161 0.0% 10 4.6 5.8 4.5 10.0 1.8
6.|-- ae-7.r23.amstnl07.nl.bb.gin.ntt.net 0.0% 10 84.0 85.5 83.4 90.8 2.5
7.|-- ae-18.a01.amstnl07.nl.bb.gin.ntt.net 0.0% 10 8.6 7.4 4.7 14.1 3.1
8.|-- 81.20.68.138 0.0% 10 3.4 4.3 3.3 10.5 2.2
9.|-- vl221.ams-eq6-dist-2.cdn77.com 0.0% 10 4.5 5.1 4.5 6.5 0.6
10.|-- 69.46.46.70 0.0% 10 4.9 5.0 4.3 7.6 0.97 hours ago
https://utilities-us-east.up.railway.app/raw
What is the x-railway-edge header you see here?
7 hours ago
railway/europe-west4-drams3a
7 hours ago
What about here? https://utilities-us-east-cf-proxied.railway.com/raw
7 hours ago
same edge val, cf-ray ending in -ams
7 hours ago
Ty
7 hours ago
timing out for me
7 hours ago
Attachments
7 hours ago
and... not anymore
7 hours ago
No timeouts on https://utilities-us-east.up.railway.app/raw right - just via Cloudflare?
7 hours ago
timeouts only observed via cloudflare so far
7 hours ago
latency spikes on non-cloudflare still a thing but my monitors havent tripped on them
7 hours ago
cf-ray became lhr, then timed out the following refresh
(and once again I am able to fetch it fine..)
7 hours ago
I'm assuming you disabled the ams pop
Attachments
7 hours ago
Nope it's still up, do you see that on the non-CF domain as well?
7 hours ago
I do not
Attachments
7 hours ago
I have a suspicion on what it could be. I'm going to disable something and we can see if that resolves it.
swag42dev
 railway/europe-west4-drams3a
7 hours ago
Is your domain also proxied by Cloudflare?
phin
Is your domain also proxied by Cloudflare?
7 hours ago
no
7 hours ago
I've disabled something in AMS - let me know if you see improvement over the next half hour or so
phin
Is your domain also proxied by Cloudflare?
7 hours ago
Domain management is on CF, but this particular domain is not proxied.
7 hours ago
still seeing these (or should I just wait a bit more)
7 hours ago
What latency is that tracking? ICMP or HTTP?
7 hours ago
HTTP
phin
May I know the domain or service this is happening to?
7 hours ago
project/b1f6fd55-ba7c-423a-8784-21ae43029bab/service/88da5b4b-99c2-4e7d-8946-b51c75496f9f
6 hours ago
still going cf ams -> hikari lhr
6 hours ago
non-proxied latency spikes still occurring
6 hours ago
Is there a specific domain this is occuring on or is it all of your domains?
6 hours ago
I'll dm
5 hours ago
This is starting to affect other deployments.
The problem is becoming widespread.
Attachments
5 hours ago
Hey.
Same here.
Using a VPN, we tested a few other edges.
In a nutshell, railway/europe-west4-drams3a is definitely troubles for us. us-west2 works fine as far as we can tell.
4 hours ago
Monitors tripped over *.up.railway.app timeout
4 hours ago
so it's no longer necessarily CF specific
2 hours ago
I'm also seeing latency spikes from Singapore
(Singapore Railway service -> EU-W Railway service over pubnet)