Cron Missed Completely, No Logs
xevion
HOBBYOP

2 years ago

I worked on a cron application yesterday that fired at 6AM CST correctly on the first night, but did not do it again last night.

49 Replies

xevion
HOBBYOP

2 years ago

3c0425aa-9fe4-449a-a161-8a1efb3b53ee


xevion
HOBBYOP

2 years ago

The schedule is 0 11 * * *, which means everyday at 11:00 AM UTC.


xevion
HOBBYOP

2 years ago

Logs show the last invocation was on June 1st at 6:01 AM CST (correct, although 1 minute late?).


xevion
HOBBYOP

2 years ago

The expected last invocation would actually be June 2nd at 6:00AM CST, but there is no activity, logs, or even an invocation on the dashboard.


xevion
HOBBYOP

2 years ago

I only found out about this because my cron monitor raised an issue.


xevion
HOBBYOP

2 years ago

The history below doesn't show anything interesting, if you're curious.

1246964730141343700


brody
EMPLOYEE

2 years ago

from my understanding, there are too many jobs being ran at 11am utc that some get skipped, until the team addresses this i would recommend switching to an in-code scheduler


xevion
HOBBYOP

2 years ago

are you serious? i literally just undid the node cron scheduler because it was consuming 50 MB memory constantly and i thought it'd be fun to move away from that


xevion
HOBBYOP

2 years ago

not mad at you, just… that's pretty sucky


brody
EMPLOYEE

2 years ago

i feel you, you could also try another time? 10:30am utc?


xevion
HOBBYOP

2 years ago

Yeah, I was thinking that some weird off-color time would be less likely to incur issues.


xevion
HOBBYOP

2 years ago

Something like XX:48


xevion
HOBBYOP

2 years ago

The whole job is done in 5 seconds usually.


brody
EMPLOYEE

2 years ago

yep you got the right idea


xevion
HOBBYOP

2 years ago

Gonna try 10:48 UTC and see what happens.


brody
EMPLOYEE

2 years ago

sounds good!


brody
EMPLOYEE

2 years ago

i have also sent this thread in a thread i have with cooper for gathering cron issues like yours


xevion
HOBBYOP

2 years ago

Alrighty; just my thought: a little detail about this being skipped, or likely to be skipped, or some transparency on the issues with cron would be nice.


xevion
HOBBYOP

2 years ago

I don't mind that Railway's platform is in need of improvement; but letting users explore until they hit a landmine isn't ideal.


brody
EMPLOYEE

2 years ago

i assume they had never thought they would be over scheduled so they never designed error handing and the ui around it


xevion
HOBBYOP

2 years ago

Kinda an interesting problem to think about in retrospect.


brody
EMPLOYEE

2 years ago

ideally the only issues that you could get out of a cron job would be an issue with the build or deploy


xevion
HOBBYOP

2 years ago

Maybe a check-mark that says "I'm okay with this being rescheduled slightly" would be good.
Crons that are more important could be charged at a higher rate, but they'll be prioritized on runners or whatever.


xevion
HOBBYOP

2 years ago

If you have 600 jobs every day at 11AM UTC, spinning up tons of machines to work on them is not exactly smart. Especially when most of them are tiny jobs.


xevion
HOBBYOP

2 years ago

Working in bursts would be better. And working early + late, like queueing. Start at 10:58 or even earlier to start executing.


brody
EMPLOYEE

2 years ago

im sure they have more than 600 at 11am utc, and if i recall correctly, its only a single schedular on their backplane


xevion
HOBBYOP

2 years ago

aha i really have no idea about the scale railway works at tbh


brody
EMPLOYEE

2 years ago

i dont really either, im just going off the crumbs they give us


brody
EMPLOYEE

2 years ago

i mean they do tell us a fair bit, but more info can never hurt in our position of community help


xevion
HOBBYOP

2 years ago

@Brody

1249188566031138800


xevion
HOBBYOP

2 years ago

This is pretty sucky as crons go.


xevion
HOBBYOP

2 years ago

I'm not sure what's going on, actually; I cannot tell if Railway is to fault here.


xevion
HOBBYOP

2 years ago

Nevermind, seems like something with Sentry is going wrong?
Error while running backup: AxiosError: connect EHOSTUNREACH 34.120.195.249:443


brody
EMPLOYEE

2 years ago

host unreachable eh? you aren't the first person to see this error even after they resolved the incident


brody
EMPLOYEE

2 years ago

are you on the legacy or v2 runtime? check your service settings, if legacy, switch it to v2


xevion
HOBBYOP

2 years ago

Got it, switched it to V2. Didn't know that was a setting lol.


brody
EMPLOYEE

2 years ago

just for clarity, the v2 runtime has been confirmed to fix host unreachable, but it has no impact on cron being skipped since that's a completely different system


xevion
HOBBYOP

2 years ago

👍


xevion
HOBBYOP

a year ago

Working good!

1253090840746659800


xevion
HOBBYOP

a year ago

No failures since.


brody
EMPLOYEE

a year ago

they made changes to the cron scheduler too


xevion
HOBBYOP

a year ago

1255748537069736000


brody
EMPLOYEE

a year ago

the changes they made did not help 😦


brody
EMPLOYEE

a year ago

but thank you for trying


xevion
HOBBYOP

a year ago

i mean i guess at least it happened the same day?


xevion
HOBBYOP

a year ago

lol


xevion
HOBBYOP

a year ago

very odd


brody
EMPLOYEE

a year ago

lol


brody
EMPLOYEE

a year ago

back to in code


Loading...