a month ago
All of my services are seeing extremely high response times (> 2mins) for simple HTTP requests. The upstream response is 1 ms. So clearly something's up with the Railway network. My app is hosted in Southeast Asia (Singapore) and behind a cloudflare proxy. The edge network is europe-west4-drams3a. Not sure how and if that has an impact.
Attached is a sample response of request with high response time (~ 2 mins ). This is happening across multiple services.
Attachments
22 Replies
a month ago
Hey there! We've found the following might help you get unblocked faster:
🧵 Very high and inconsistent round trip times to backend from end user
📚 Deploy Static Sites with Zero Configuration and Custom Domains
🧵 502 Bad Gateway error despite successful container startup
If you find the answer from one of these, please let us know by solving the thread!
a month ago
Hey,
This sounds like a routing issue. Can you give us some more diagnostic information and run a few commands for me?
The
x-railway-edgeheader from your http requesttracerouteto66.33.22.11tracerouteto66.33.22.1The deployment ID that is experiencing the issue
Status changed to Awaiting User Response Railway • about 1 month ago
a month ago
Sharing the request HAR file along with traceroute to 66.33.22.11 and 66.33.22.1
The host is in BLR. Not sure why the edge is railway/europe-west4-drams3atraceroute to 66.33.22.11 (66.33.22.11), 64 hops max, 40 byte packets
1 192.168.1.1 (192.168.1.1) 3.738 ms 4.312 ms 3.940 ms
2 10.240.9.208 (10.240.9.208) 5.966 ms 5.854 ms 11.043 ms
3 * * *
4 125.18.215.57 (125.18.215.57) 7.855 ms
125.18.215.61 (125.18.215.61) 9.723 ms
125.18.215.57 (125.18.215.57) 7.913 ms
5 116.119.42.61 (116.119.42.61) 42.656 ms
116.119.119.198 (116.119.119.198) 43.526 ms
116.119.42.27 (116.119.42.27) 43.272 ms
6 400940.sgw.equinix.com (27.111.231.33) 44.674 ms 51.554 ms 46.890 ms
7 * * *
8 * * *
9 * * *
10 * * *
11 * * *
12 * * *
13 * * *
14 * * *
15 * * *
16 * * *
17 * * *
18 * *
traceroute to 66.33.22.1 (66.33.22.1), 64 hops max, 40 byte packets
1 192.168.1.1 (192.168.1.1) 3.890 ms 3.724 ms 3.265 ms
2 10.240.9.208 (10.240.9.208) 5.461 ms 5.904 ms 6.579 ms
3 * * *
4 125.18.215.57 (125.18.215.57) 6.264 ms
125.18.215.61 (125.18.215.61) 6.566 ms
125.18.215.57 (125.18.215.57) 10.110 ms
5 116.119.161.51 (116.119.161.51) 57.500 ms
116.119.81.143 (116.119.81.143) 44.676 ms
116.119.161.0 (116.119.161.0) 56.681 ms
6 400940.sgw.equinix.com (27.111.231.33) 42.922 ms 44.104 ms 42.155 ms
7 * * *
8 * * *
9 * * *
10 * * *
11 * * *
12 * * *
13 * * *
14 * * *
15 * * *
16 * * *
17 * * *
18 * * *
19 * * *
20 * * *
21 * * *
22 * * *
23 * * *
24 * * *
25 * * *
26 * * *
27 * * *
28 * * *
29 * * *
30 * * *
31 *
Attachments
Status changed to Awaiting Railway Response Railway • about 1 month ago
a month ago
All deployments are facing the same issue
a month ago
Noted, I have pass this on to our network team. We do have something in flight that should address, we are targeting next week for the fix. The problem is that some ISPs are not keeping the updated peering tables that we have published to get the hops in the right order.
How much customer impact are you facing?
Status changed to Awaiting User Response Railway • about 1 month ago
angelo-railway
Noted, I have pass this on to our network team. We do have something in flight that should address, we are targeting next week for the fix. The problem is that some ISPs are not keeping the updated peering tables that we have published to get the hops in the right order. How much customer impact are you facing?
a month ago
At the moment, this is happening on one of our realtime endpoints (which is not high impact). I'm worried that this might start affecting other high impact API response times. Hope this can be resolved at the earliest.
Status changed to Awaiting Railway Response Railway • about 1 month ago
a month ago
Hello!
We've escalated your issue to our engineering team.
We aim to provide an update within 1 business day.
Please reply to this thread if you have any questions!
Status changed to Awaiting User Response Railway • about 1 month ago
a month ago
We're actively working on it, going to subscribe you to notifications as the ticket gets completed.
a month ago
🛠️ The ticket Global Connectivity Improvement has been marked as todo.
a month ago
Howdy!
Just heard back from our infra team and we have recently implemented a global backbone. It should improve latency across regions however if there is any application level misconfiguration it won't help with that.
How does your service latency look now?
a month ago
It isn't resolved. Attaching the http response logs for a specific endpoint. See the upstream response duration ( 1ms). These are from India and should be routed directly to Singapore (where our instances are) but are getting routed through Europe - look at the edge
Attachments
Status changed to Awaiting Railway Response Railway • 26 days ago
25 days ago
@noahd Can you please share update on this? This is still not resolved for us.
25 days ago
We had a similar issue recently with networking from Sydney being routed through us-west instead of Singapore.
We have an update in place now which should resolve routing to Singapore.
Are you able to give the services a redeploy and see if its routing as expected?
Status changed to Awaiting User Response Railway • 25 days ago
23 days ago
Not resolved. It is the same. Nothing changed. View the screenshots
Attachments
Status changed to Awaiting Railway Response Railway • 23 days ago
21 days ago
Can you provide an update on this
19 days ago
🛠️ The ticket Slow Response Times has been marked as triage.
Status changed to Awaiting User Response Railway • 19 days ago
19 days ago
🛠️ The ticket Slow Web Response Times has been marked as triage.
18 days ago
I'm still awaiting a response/update on this
Status changed to Awaiting Railway Response Railway • 18 days ago
18 days ago
Hey! Apologies for the lack of update here.
Can you share what that endpoint is doing? Is it keeping connections open?
Status changed to Awaiting User Response Railway • 18 days ago
18 days ago
From what we can see, all other requests to the endpoint (POST, OPTIONS, etc.) is fast and completes within miliseconds.
Only GET /api/realtime is precisely 2min5sec on every request, which indicates it could be something keeping the connection alive and closing it after 2min5sec such as a WebSocket.
13 days ago
✅ The ticket Slow Web Response Times has been marked as completed.
13 days ago
✅ The ticket Global Connectivity Enhancement has been marked as completed.
6 days ago
This thread has been marked as solved automatically due to a lack of recent activity. Please re-open this thread or create a new one if you require further assistance. Thank you!
Status changed to Solved Railway • 6 days ago