OperationalError - connection to server at "<...>.proxy.rlwy.net" (), port failed: Connection timed out
Anonymous
PROOP

a year ago

At times, connections fail to be established to the database. This is on the cloud and everything else internet related works fine - only connecting to the database seems to be problematic.

I've increased the DB max connections from 100 to 200, and there seems to be no issue with memory or CPU usage. Why are my instances failing to connect randomly at times? They all run with the same exact network infrastructure (being called from AWS lambda, from the same lambda).

Solved

11 Replies

jake
EMPLOYEE

a year ago

Could definitely be the connections. When you run into these, can you try to run SELECT * FROM pg_stat_activity; on Postgres and see how many connections you have open?

Lambdas pretty wirey in the sense that it'll just create a new connection for every request so you may end up flooding the DB. We do connection pooling at the edge, but if you're opening a lot of connections here we might rate limit you on connection opens

I'll circle back with what our rate limits are on connections in the next hour


Status changed to Awaiting User Response Railway about 1 year ago


Anonymous
PROOP

a year ago

Thanks for the answer jake. Usually it's around thirty by the time I get to running that query, so I'm not sure that that's the reason, although rate limit makes sense since we'd be opening and closing a lot of connections in a very small timeframe (thousands of connections per minute possibly).

Need a way to bypass any rate limiting if that's the case.


Status changed to Awaiting Railway Response Railway about 1 year ago


jake
EMPLOYEE

a year ago

I'm being informed there's no rate limit for TCP connections

Can we get a timestamp for when this occured?


Status changed to Awaiting User Response Railway about 1 year ago


Status changed to Solved jake about 1 year ago


Status changed to Awaiting Railway Response Anonymous about 1 year ago


Status changed to Solved jake about 1 year ago


Status changed to Awaiting Railway Response Anonymous about 1 year ago


Anonymous
PROOP

a year ago

Here's a few timestamps.

Attachments


Anonymous
PROOP

a year ago

This is a very small sample out of 2000+ events.


Status changed to In Progress jake about 1 year ago


Updating you on the progress from our side- nothing conclusive but we are having eng. look into this. In the meantime, have you observed any more behavior like this as of late?


Anonymous
PROOP

a year ago

No, the most recent instance was 4 days ago. But the system that we use here has barely been used in that time period so it’s hard to tell.


I will mark this as resolved but do you have access to our Slack. Feel free to ping us immediately so we can address it.


Status changed to Awaiting User Response Railway about 1 year ago


Anonymous
PROOP

a year ago

No, I don't have access to the Slack. How do I get access here? This issue is ongoing by the way.


Status changed to Awaiting Railway Response Railway 12 months ago


We invited you to join a Railway Slack channel on November 12. I've resent the invite emails from Slack.


Status changed to Awaiting User Response Railway 12 months ago


We've identified a possible issue causing the timeouts and deployed a fix for it. Please let us know if it happens again.

In the meantime, I'm this thread in favor of the conversation in Slack.


Status changed to Solved christian 12 months ago


Loading...