Cron Missed Completely, No Logs

from my understanding, there are too many jobs being ran at 11am utc that some get skipped, until the team addresses this i would recommend switching to an in-code scheduler

xevion

HOBBYOP

2 years ago

are you serious? i literally just undid the node cron scheduler because it was consuming 50 MB memory constantly and i thought it'd be fun to move away from that

xevion

HOBBYOP

2 years ago

not mad at you, just... that's pretty sucky

brody

EMPLOYEE

2 years ago

i feel you, you could also try another time? 10:30am utc?

xevion

HOBBYOP

2 years ago

Yeah, I was thinking that some weird off-color time would be less likely to incur issues.

xevion

HOBBYOP

2 years ago

Something like XX:48

xevion

HOBBYOP

2 years ago

The whole job is done in 5 seconds usually.

brody

EMPLOYEE

2 years ago

yep you got the right idea

xevion

HOBBYOP

2 years ago

Gonna try 10:48 UTC and see what happens.

brody

EMPLOYEE

2 years ago

sounds good!

brody

EMPLOYEE

2 years ago

i have also sent this thread in a thread i have with cooper for gathering cron issues like yours

xevion

HOBBYOP

2 years ago

Alrighty; just my thought: a little detail about this being skipped, or likely to be skipped, or some transparency on the issues with cron would be nice.

xevion

HOBBYOP

2 years ago

I don't mind that Railway's platform is in need of improvement; but letting users explore until they hit a landmine isn't ideal.

brody

EMPLOYEE

2 years ago

i assume they had never thought they would be over scheduled so they never designed error handing and the ui around it

xevion

HOBBYOP

2 years ago

Kinda an interesting problem to think about in retrospect.

brody

EMPLOYEE

2 years ago

ideally the only issues that you could get out of a cron job would be an issue with the build or deploy

xevion

HOBBYOP

2 years ago

Maybe a check-mark that says "I'm okay with this being rescheduled slightly" would be good.

Crons that are more important could be charged at a higher rate, but they'll be prioritized on runners or whatever.

xevion

HOBBYOP

2 years ago

If you have 600 jobs every day at 11AM UTC, spinning up tons of machines to work on them is not exactly smart. Especially when most of them are tiny jobs.

xevion

HOBBYOP

2 years ago

Working in bursts would be better. And working early + late, like queueing. Start at 10:58 or even earlier to start executing.

brody

EMPLOYEE

2 years ago

im sure they have more than 600 at 11am utc, and if i recall correctly, its only a single schedular on their backplane

xevion

HOBBYOP

2 years ago

aha i really have no idea about the scale railway works at tbh

brody

EMPLOYEE

2 years ago

i dont really either, im just going off the crumbs they give us

brody

EMPLOYEE

2 years ago

i mean they do tell us a fair bit, but more info can never hurt in our position of community help

xevion

HOBBYOP

2 years ago

@Brody

1249188566031138866

xevion

HOBBYOP

2 years ago

This is pretty sucky as crons go.

xevion

HOBBYOP

2 years ago

I'm not sure what's going on, actually; I cannot tell if Railway is to fault here.

xevion

HOBBYOP

2 years ago

Nevermind, seems like something with Sentry is going wrong?

Error while running backup: AxiosError: connect EHOSTUNREACH 34.120.195.249:443

brody

EMPLOYEE

2 years ago

host unreachable eh? you aren't the first person to see this error even after they resolved the incident

brody

EMPLOYEE

2 years ago

are you on the legacy or v2 runtime? check your service settings, if legacy, switch it to v2

xevion

HOBBYOP

2 years ago

Got it, switched it to V2. Didn't know that was a setting lol.

brody

EMPLOYEE

2 years ago

just for clarity, the v2 runtime has been confirmed to fix host unreachable, but it has no impact on cron being skipped since that's a completely different system

xevion

HOBBYOP

2 years ago

👍

xevion

HOBBYOP

2 years ago

Working good!

1253090840746659881