a year ago
At times, connections fail to be established to the database. This is on the cloud and everything else internet related works fine - only connecting to the database seems to be problematic.
I've increased the DB max connections from 100 to 200, and there seems to be no issue with memory or CPU usage. Why are my instances failing to connect randomly at times? They all run with the same exact network infrastructure (being called from AWS lambda, from the same lambda).
11 Replies
a year ago
Could definitely be the connections. When you run into these, can you try to run SELECT * FROM pg_stat_activity; on Postgres and see how many connections you have open?
Lambdas pretty wirey in the sense that it'll just create a new connection for every request so you may end up flooding the DB. We do connection pooling at the edge, but if you're opening a lot of connections here we might rate limit you on connection opens
I'll circle back with what our rate limits are on connections in the next hour
Status changed to Awaiting User Response Railway • about 1 year ago
a year ago
Thanks for the answer jake. Usually it's around thirty by the time I get to running that query, so I'm not sure that that's the reason, although rate limit makes sense since we'd be opening and closing a lot of connections in a very small timeframe (thousands of connections per minute possibly).
Need a way to bypass any rate limiting if that's the case.
Status changed to Awaiting Railway Response Railway • about 1 year ago
a year ago
I'm being informed there's no rate limit for TCP connections
Can we get a timestamp for when this occured?
Status changed to Awaiting User Response Railway • about 1 year ago
Status changed to Solved jake • about 1 year ago
Status changed to Awaiting Railway Response Anonymous • about 1 year ago
Status changed to Solved jake • about 1 year ago
Status changed to Awaiting Railway Response Anonymous • about 1 year ago
a year ago
This is a very small sample out of 2000+ events.
Status changed to In Progress jake • about 1 year ago
a year ago
Updating you on the progress from our side- nothing conclusive but we are having eng. look into this. In the meantime, have you observed any more behavior like this as of late?
a year ago
No, the most recent instance was 4 days ago. But the system that we use here has barely been used in that time period so it’s hard to tell.
a year ago
I will mark this as resolved but do you have access to our Slack. Feel free to ping us immediately so we can address it.
Status changed to Awaiting User Response Railway • about 1 year ago
a year ago
No, I don't have access to the Slack. How do I get access here? This issue is ongoing by the way.
Status changed to Awaiting Railway Response Railway • 12 months ago
a year ago
We invited you to join a Railway Slack channel on November 12. I've resent the invite emails from Slack.
Status changed to Awaiting User Response Railway • 12 months ago
a year ago
We've identified a possible issue causing the timeouts and deployed a fix for it. Please let us know if it happens again.
In the meantime, I'm this thread in favor of the conversation in Slack.
Status changed to Solved christian • 12 months ago