Helpppppppppp my service is unstable

8 days ago

Bug Description

I have a customer support bot on WhatsApp built with n8n (Cloud).

Between Friday afternoon and Monday afternoon, the bot stopped responding to my customers. Checking the logs from my WhatsApp provider, I noticed that the workflow webhook in the n8n started responding with a 404 error.

Image

At first, I thought that the workflow might have been disabled for some reason. However, today (Tuesday), another workflow that runs via cron (schedule) did not execute, even though it is active.
This indicates that the issue is likely with the n8n Cloud platform itself.

When I tried to check the execution logs of the workflows to better understand the problem, I noticed something strange: all the logs for my cron-based workflow had disappeared.

Image

In addition, for the webhook-triggered workflow, the logs are incomplete. For today (June 17), I can only view logs up until 14:39.

Image

To Reproduce

I couldn't reproduce the issue.

Expected behavior

It should work correctly; it’s n8n’s own Cloud environment.

Operating System

Cloud

n8n Version

1.92.2

Node.js Version

I don’t know.

Database

SQLite (default)

Execution mode

main (default)

4 Replies

8 days ago

Can you help us ?!


8 days ago

Our suspects https://railway.com/project/e7c65131-7ee1-45fd-8c26-15df2491709f/logs?environmentId=f9a252a9-3ea0-4024-be15-aaf91dd5192f&filter=%40level%3Aerror&start=1750171760000&end=1750172268000

Postgres error problems

You reached the start of the range → Jun 17, 2025 11:49 AM

Jun 17 11:53:28

Postgres

2025-06-17 14:53:25.094 UTC [23] LOG: checkpoint complete: wrote 2604 buffers (15.9%); 0 WAL file(s) added, 0 removed, 2 recycled; write=264.248 s, sync=25.638 s, total=296.288 s; sync files=69, longest=5.504 s, average=0.372 s; distance=25979 kB, estimate=25979 kB; lsn=2/FB7FB20, redo lsn=2/F214F40

Jun 17 11:53:28

Postgres

2025-06-17 14:53:28.097 UTC [23] LOG: checkpoint starting: time

Jun 17 11:55:18

Postgres

2025-06-17 14:55:11.651 UTC [23] LOG: checkpoint complete: wrote 937 buffers (5.7%); 0 WAL file(s) added, 0 removed, 0 recycled; write=92.980 s, sync=5.324 s, total=103.554 s; sync files=37, longest=1.196 s, average=0.144 s; distance=9769 kB, estimate=24358 kB; lsn=2/10093FC8, redo lsn=2/FB9F3E8


8 days ago

Apologies but this looks like an issue with the application level code. Due to volume, we can only answer platform level issues.

I've made this thread public so that the community might be able to help with you query.


Status changed to Awaiting User Response railway[bot] 8 days ago


8 days ago

Hello :-)

What do you mean? I don't understand, "application level code".

Regardless, looking at the logs, it's giving an error when accessing postgres, which I don't even have access to. It seems to me to be something from the N8N core.

Look, I'd like some support on this, since I'm not a CLOUD admin.

My N8N is unstable. Its infrastructure is managed by Railsway. Is there anyone at Railsway who can give me support? If not, I'll migrate my flow to another cloud and solve the problem.

I'll summarize the problem in other words here in case it wasn't clear:

1 - Over the weekend, my webhooks returned 404. Is there any way I can see what happened?

2 - The cloud execution logs simply disappeared. See the print above.

Can any Railsway employee help me here?

Thanks, everyone. Good morning.