3 months ago
Very strange thing just happened; two of our services (which are not inter-dependent at all) suddenly appeared down on monitoring (timeout), which hits health endpoint over private networking.
Attempting to connect to the services via public networking failed with 502 after a while.
Looking at the deploy logs of the affected services, they didn't log anything at all since around 12:10 CET — despite being quite chatty usually. No errors were visible, it's like they suddenly froze up.
After redeploying the services, they came back to life immediately.
Any idea what happened? Don't think this was caused by anything on our end
2 Replies
3 months ago
were the metrics fine at that time? memory/cpu all good?
Yeah memory / cpu were all good. I saw in retrospect that we received this email right around the time the services went down, but only for 1/2. very weird. :/
Attachments