High HTTP response times

gauthamses

PROOP

8 months ago

All of my services are seeing extremely high response times (> 2mins) for simple HTTP requests. The upstream response is 1 ms. So clearly something's up with the Railway network. My app is hosted in Southeast Asia (Singapore) and behind a cloudflare proxy. The edge network is europe-west4-drams3a. Not sure how and if that has an impact.

Attached is a sample response of request with high response time (~ 2 mins ). This is happening across multiple services.

Attachments

Screenshot%...

Solved

22 Replies

Railway

BOT

8 months ago

Hey there! We've found the following might help you get unblocked faster:

If you find the answer from one of these, please let us know by solving the thread!

angelo-railway

EMPLOYEE

8 months ago

Hey,

This sounds like a routing issue. Can you give us some more diagnostic information and run a few commands for me?

The x-railway-edge header from your http request
traceroute to 66.33.22.11
traceroute to 66.33.22.1
The deployment ID that is experiencing the issue

Status changed to Awaiting User Response Railway • 8 months ago

angelo-railway

EMPLOYEE

8 months ago

I also have the requestId from your screenshot which should help.

gauthamses

PROOP

8 months ago

Sharing the request HAR file along with traceroute to 66.33.22.11 and 66.33.22.1

The host is in BLR. Not sure why the edge is railway/europe-west4-drams3a

traceroute to 66.33.22.11 (66.33.22.11), 64 hops max, 40 byte packets

1 192.168.1.1 (192.168.1.1) 3.738 ms 4.312 ms 3.940 ms

2 10.240.9.208 (10.240.9.208) 5.966 ms 5.854 ms 11.043 ms

3 * * *

4 125.18.215.57 (125.18.215.57) 7.855 ms

125.18.215.61 (125.18.215.61) 9.723 ms

125.18.215.57 (125.18.215.57) 7.913 ms

5 116.119.42.61 (116.119.42.61) 42.656 ms

116.119.119.198 (116.119.119.198) 43.526 ms

116.119.42.27 (116.119.42.27) 43.272 ms

6 400940.sgw.equinix.com (27.111.231.33) 44.674 ms 51.554 ms 46.890 ms

7 * * *

8 * * *

9 * * *

10 * * *

11 * * *

12 * * *

13 * * *

14 * * *

15 * * *

16 * * *

17 * * *

18 * *

traceroute to 66.33.22.1 (66.33.22.1), 64 hops max, 40 byte packets

1 192.168.1.1 (192.168.1.1) 3.890 ms 3.724 ms 3.265 ms

2 10.240.9.208 (10.240.9.208) 5.461 ms 5.904 ms 6.579 ms

3 * * *

4 125.18.215.57 (125.18.215.57) 6.264 ms

125.18.215.61 (125.18.215.61) 6.566 ms

125.18.215.57 (125.18.215.57) 10.110 ms

5 116.119.161.51 (116.119.161.51) 57.500 ms

116.119.81.143 (116.119.81.143) 44.676 ms

116.119.161.0 (116.119.161.0) 56.681 ms

6 400940.sgw.equinix.com (27.111.231.33) 42.922 ms 44.104 ms 42.155 ms

7 * * *

8 * * *

9 * * *

10 * * *

11 * * *

12 * * *

13 * * *

14 * * *

15 * * *

16 * * *

17 * * *

18 * * *

19 * * *

20 * * *

21 * * *

22 * * *

23 * * *

24 * * *

25 * * *

26 * * *

27 * * *

28 * * *

29 * * *

30 * * *

31 *

Attachments

realtime_sl...

Status changed to Awaiting Railway Response Railway • 8 months ago

gauthamses

PROOP

8 months ago

All deployments are facing the same issue

angelo-railway

EMPLOYEE

8 months ago

Noted, I have pass this on to our network team. We do have something in flight that should address, we are targeting next week for the fix. The problem is that some ISPs are not keeping the updated peering tables that we have published to get the hops in the right order.

How much customer impact are you facing?

Status changed to Awaiting User Response Railway • 8 months ago

angelo-railway

gauthamses

PROOP

8 months ago

At the moment, this is happening on one of our realtime endpoints (which is not high impact). I'm worried that this might start affecting other high impact API response times. Hope this can be resolved at the earliest.

Status changed to Awaiting Railway Response Railway • 8 months ago

Railway

BOT

8 months ago

Hello!

We've escalated your issue to our engineering team.

We aim to provide an update within 1 business day.

Please reply to this thread if you have any questions!

Status changed to Awaiting User Response Railway • 8 months ago

angelo-railway

EMPLOYEE

8 months ago

We're actively working on it, going to subscribe you to notifications as the ticket gets completed.

Railway

BOT

8 months ago

🛠️ The ticket Global Connectivity Improvement has been marked as todo.

noahd

EMPLOYEE

8 months ago

Howdy!

Just heard back from our infra team and we have recently implemented a global backbone. It should improve latency across regions however if there is any application level misconfiguration it won't help with that.

How does your service latency look now?

gauthamses

PROOP

8 months ago

It isn't resolved. Attaching the http response logs for a specific endpoint. See the upstream response duration ( 1ms). These are from India and should be routed directly to Singapore (where our instances are) but are getting routed through Europe - look at the edge

Attachments

Screenshot%...

Status changed to Awaiting Railway Response Railway • 8 months ago

gauthamses

PROOP

8 months ago

@noahd Can you please share update on this? This is still not resolved for us.

noahd

EMPLOYEE

8 months ago

We had a similar issue recently with networking from Sydney being routed through us-west instead of Singapore.

We have an update in place now which should resolve routing to Singapore.

Are you able to give the services a redeploy and see if its routing as expected?

Status changed to Awaiting User Response Railway • 8 months ago

gauthamses

PROOP

8 months ago

Not resolved. It is the same. Nothing changed. View the screenshots

Attachments

Screenshot%...

Status changed to Awaiting Railway Response Railway • 8 months ago

gauthamses

PROOP

7 months ago

Can you provide an update on this

Railway

BOT

7 months ago

🛠️ The ticket Slow Response Times has been marked as triage.

Status changed to Awaiting User Response Railway • 7 months ago

Railway

BOT

7 months ago

🛠️ The ticket Slow Web Response Times has been marked as triage.

gauthamses

PROOP

7 months ago

I'm still awaiting a response/update on this

Status changed to Awaiting Railway Response Railway • 7 months ago

ray-chen

EMPLOYEE

7 months ago

Hey! Apologies for the lack of update here.

Can you share what that endpoint is doing? Is it keeping connections open?

Status changed to Awaiting User Response Railway • 7 months ago

ray-chen

EMPLOYEE

7 months ago

From what we can see, all other requests to the endpoint (POST, OPTIONS, etc.) is fast and completes within miliseconds.

Only GET /api/realtime is precisely 2min5sec on every request, which indicates it could be something keeping the connection alive and closing it after 2min5sec such as a WebSocket.

Railway

BOT

7 months ago

✅ The ticket Slow Web Response Times has been marked as completed.

Railway

BOT

7 months ago

✅ The ticket Global Connectivity Enhancement has been marked as completed.

Railway

BOT

7 months ago

This thread has been marked as solved automatically due to a lack of recent activity. Please re-open this thread or create a new one if you require further assistance. Thank you!

Status changed to Solved Railway • 7 months ago

Welcome!