High HTTP response times
gauthamses
PROOP

a month ago

All of my services are seeing extremely high response times (> 2mins) for simple HTTP requests. The upstream response is 1 ms. So clearly something's up with the Railway network. My app is hosted in Southeast Asia (Singapore) and behind a cloudflare proxy. The edge network is europe-west4-drams3a. Not sure how and if that has an impact.

Attached is a sample response of request with high response time (~ 2 mins ). This is happening across multiple services.

Attachments

Solved

22 Replies

Railway
BOT

a month ago

Hey there! We've found the following might help you get unblocked faster:

If you find the answer from one of these, please let us know by solving the thread!


Hey,

This sounds like a routing issue. Can you give us some more diagnostic information and run a few commands for me?

  • The x-railway-edge header from your http request

  • traceroute to 66.33.22.11

  • traceroute to 66.33.22.1

  • The deployment ID that is experiencing the issue


Status changed to Awaiting User Response Railway about 1 month ago


I also have the requestId from your screenshot which should help.


gauthamses
PROOP

a month ago

Sharing the request HAR file along with traceroute to 66.33.22.11 and 66.33.22.1

The host is in BLR. Not sure why the edge is railway/europe-west4-drams3a

traceroute to 66.33.22.11 (66.33.22.11), 64 hops max, 40 byte packets

1 192.168.1.1 (192.168.1.1) 3.738 ms 4.312 ms 3.940 ms

2 10.240.9.208 (10.240.9.208) 5.966 ms 5.854 ms 11.043 ms

3 * * *

4 125.18.215.57 (125.18.215.57) 7.855 ms

125.18.215.61 (125.18.215.61) 9.723 ms

125.18.215.57 (125.18.215.57) 7.913 ms

5 116.119.42.61 (116.119.42.61) 42.656 ms

116.119.119.198 (116.119.119.198) 43.526 ms

116.119.42.27 (116.119.42.27) 43.272 ms

6 400940.sgw.equinix.com (27.111.231.33) 44.674 ms 51.554 ms 46.890 ms

7 * * *

8 * * *

9 * * *

10 * * *

11 * * *

12 * * *

13 * * *

14 * * *

15 * * *

16 * * *

17 * * *

18 * *

traceroute to 66.33.22.1 (66.33.22.1), 64 hops max, 40 byte packets

1 192.168.1.1 (192.168.1.1) 3.890 ms 3.724 ms 3.265 ms

2 10.240.9.208 (10.240.9.208) 5.461 ms 5.904 ms 6.579 ms

3 * * *

4 125.18.215.57 (125.18.215.57) 6.264 ms

125.18.215.61 (125.18.215.61) 6.566 ms

125.18.215.57 (125.18.215.57) 10.110 ms

5 116.119.161.51 (116.119.161.51) 57.500 ms

116.119.81.143 (116.119.81.143) 44.676 ms

116.119.161.0 (116.119.161.0) 56.681 ms

6 400940.sgw.equinix.com (27.111.231.33) 42.922 ms 44.104 ms 42.155 ms

7 * * *

8 * * *

9 * * *

10 * * *

11 * * *

12 * * *

13 * * *

14 * * *

15 * * *

16 * * *

17 * * *

18 * * *

19 * * *

20 * * *

21 * * *

22 * * *

23 * * *

24 * * *

25 * * *

26 * * *

27 * * *

28 * * *

29 * * *

30 * * *

31 *

Attachments


Status changed to Awaiting Railway Response Railway about 1 month ago


gauthamses
PROOP

a month ago

All deployments are facing the same issue


Noted, I have pass this on to our network team. We do have something in flight that should address, we are targeting next week for the fix. The problem is that some ISPs are not keeping the updated peering tables that we have published to get the hops in the right order.

How much customer impact are you facing?


Status changed to Awaiting User Response Railway about 1 month ago


angelo-railway

Noted, I have pass this on to our network team. We do have something in flight that should address, we are targeting next week for the fix. The problem is that some ISPs are not keeping the updated peering tables that we have published to get the hops in the right order. How much customer impact are you facing?

gauthamses
PROOP

a month ago

At the moment, this is happening on one of our realtime endpoints (which is not high impact). I'm worried that this might start affecting other high impact API response times. Hope this can be resolved at the earliest.


Status changed to Awaiting Railway Response Railway about 1 month ago


Railway
BOT

a month ago

Hello!

We've escalated your issue to our engineering team.

We aim to provide an update within 1 business day.

Please reply to this thread if you have any questions!

Status changed to Awaiting User Response Railway about 1 month ago


We're actively working on it, going to subscribe you to notifications as the ticket gets completed.


Railway
BOT

a month ago

🛠️ The ticket Global Connectivity Improvement has been marked as todo.


noahd
EMPLOYEE

a month ago

Howdy!
Just heard back from our infra team and we have recently implemented a global backbone. It should improve latency across regions however if there is any application level misconfiguration it won't help with that.

How does your service latency look now?


gauthamses
PROOP

a month ago

It isn't resolved. Attaching the http response logs for a specific endpoint. See the upstream response duration ( 1ms). These are from India and should be routed directly to Singapore (where our instances are) but are getting routed through Europe - look at the edge


Status changed to Awaiting Railway Response Railway 26 days ago


gauthamses
PROOP

25 days ago

@noahd Can you please share update on this? This is still not resolved for us.


noahd
EMPLOYEE

25 days ago

We had a similar issue recently with networking from Sydney being routed through us-west instead of Singapore.
We have an update in place now which should resolve routing to Singapore.

Are you able to give the services a redeploy and see if its routing as expected?


Status changed to Awaiting User Response Railway 25 days ago


gauthamses
PROOP

23 days ago

Not resolved. It is the same. Nothing changed. View the screenshots


Status changed to Awaiting Railway Response Railway 23 days ago


gauthamses
PROOP

21 days ago

Can you provide an update on this


Railway
BOT

19 days ago

🛠️ The ticket Slow Response Times has been marked as triage.


Status changed to Awaiting User Response Railway 19 days ago


Railway
BOT

19 days ago

🛠️ The ticket Slow Web Response Times has been marked as triage.


gauthamses
PROOP

18 days ago

I'm still awaiting a response/update on this


Status changed to Awaiting Railway Response Railway 18 days ago


ray-chen
EMPLOYEE

18 days ago

Hey! Apologies for the lack of update here.

Can you share what that endpoint is doing? Is it keeping connections open?


Status changed to Awaiting User Response Railway 18 days ago


ray-chen
EMPLOYEE

18 days ago

From what we can see, all other requests to the endpoint (POST, OPTIONS, etc.) is fast and completes within miliseconds.

Only GET /api/realtime is precisely 2min5sec on every request, which indicates it could be something keeping the connection alive and closing it after 2min5sec such as a WebSocket.


Railway
BOT

13 days ago

✅ The ticket Slow Web Response Times has been marked as completed.


Railway
BOT

13 days ago

✅ The ticket Global Connectivity Enhancement has been marked as completed.


Railway
BOT

6 days ago

This thread has been marked as solved automatically due to a lack of recent activity. Please re-open this thread or create a new one if you require further assistance. Thank you!

Status changed to Solved Railway 6 days ago


Loading...