2 months ago
Hi Railway Support team,
We've been experiencing recurring performance degradation on our platform since yesterday evening (March 17), and we're reaching out to check whether any infrastructure-level issues on your end could be contributing.
Here's what we're observing:
- Our API response times increase significantly over time, eventually leading to timeouts and 5xx errors for our users
- The issue has occurred at least 3 times: yesterday evening, this morning, and around noon today
- Each time, a manual redeployment immediately restores normal behavior
- The pattern suggests a progressive degradation rather than a sudden crash
We've already looked into potential causes on our side (memory leaks, connection pool exhaustion, background jobs), but the recurring nature and the fact that a redeployment fixes it temporarily makes us wonder if there could be an underlying issue at the infrastructure level — such as noisy neighbors, network instability, or storage I/O degradation on the nodes hosting our services.
Could you check whether any incidents or anomalies were recorded on your infrastructure during the following windows?
- March 17, evening
- March 18, morning
- March 18, around noon
We're running a NestJS API and PostgreSQL database on Railway.
Thanks in advance for your help.
0 Replies