Redis connection timeout causing n8n Primary to crash loop - data at risk
Anonymous
HOBBYOP

a month ago

Hi Railway Support/Community,

My n8n instance has been down since approximately 2:20 AM UTC on May 20, 2026, following what appears to have been an infrastructure incident on Runway end (Postgres logs show database system was interrupted; last known up at 2026-05-20 02:20:39 UTC).

Current situation:

Postgres: Online and recovered

Redis: Shows "Online" but displays "Your application is not running or in an unexpected state" in the Database tab

Primary (n8n): Crash-looping with [Redis client] connect ETIMEDOUT — unable to connect to redis.railway.internal

Worker: Online but useless without Primary

I have no access to the n8n interface and I am very concerned about losing my workflows

What I have tried:

Restarted all services in order: Postgres → Redis → Primary → Worker

Verified Variable References are correctly set: QUEUE_BULL_REDIS_HOST=${{Redis.REDISHOST}}, QUEUE_BULL_REDIS_PORT=${{Redis.REDISPORT}}, QUEUE_BULL_REDIS_PASSWORD=${{Redis.REDIS_PASSWORD}}

Confirmed QUEUE_BULL_REDIS_HOST resolves to redis.railway.internal

On advice from Railway's AI Agent, I also tried:

Ensuring only one region is set (instead of 2) in both Redis and Primary under Scale settings

Checking whether a REDIS_URL variable was needed

Changing Redis / Primary region to Europe

None of these resolved the issue.

My main concerns:

Restoring connectivity between Primary and Redis so n8n comes back online

Ensuring my n8n workflows are not lost — this is critical

This appears to be a network issue on Railway's internal network that did not properly recover after last night's incident.

Thank you

Solved

1 Replies

Railway
BOT

a month ago

Your services were affected by a platform-wide service disruption that occurred between 02:25 and 07:57 UTC on May 20. The incident is now resolved, but your Redis service's private networking did not fully recover, which is why your Primary service cannot reach it despite Redis showing as "online." To fix this, please redeploy the Redis service (go to Redis service settings, click the three-dot menu on the latest deployment, and select Redeploy), then once Redis is healthy, redeploy Primary as well. Regarding your workflows, n8n stores workflow definitions in Postgres (which has recovered successfully), so your data should be intact.


Status changed to Awaiting User Response Railway about 1 month ago


Status changed to Solved Anonymous about 1 month ago


Welcome!

Sign in to your Railway account to join the conversation.

Loading...