prefect exit with code -11 (SIGSEGV) — silent, no OOMKilled or stderr trace

Question

Child Python subprocesses spawned by a long-lived Prefect process worker (prefect worker start --pool ...) exit silently with

code -11 (SIGSEGV), ~5–10 seconds after spawn, during Python module import. The parent worker container stays alive — only the

children die. Affects both prefect-worker and  prefect-ml-worker in our dev environment, on

python:3.11.8-slim-bookworm.

Evidence

- Loki shows 3 subprocesses starting within 30 ms (17:45:57.976, 17:45:58.005, 17:45:58.006), all reaching module-level imports,

then dying by 17:46:00.494.

- Zero matches across 28,709 worker log lines on 2026-05-18 for OOMKilled, Segmentation fault, Fatal Python error, terminated by

signal, etc. Crash record exists only in Prefect API logs (Flow run process exited with status code: -11).

- Railway list-deployments confirms container was NOT restarted at any crash time — same deployment ID through each crash window.
  - Image loads heavy native deps per child: grpcio 1.78.0, cryptography 46.0.5, protobuf 6.33.5, mlflow 3.10.1, opentelemetry

0.60b1. We suspect a C-extension init race under concurrent fork/exec.

Questions

1. Does Railway emit any platform-level event when a process inside a container is killed by the kernel (cgroup OOM, signal,

scheduler eviction) that wouldn't appear on stdout? We're not seeing OOMKilled anywhere.

2. What memory cap is configured on these two services? (list-services via your CLI returns "Connection reset by peer"

intermittently.)

3. Can the runtime capture a core dump from a child subprocess that segfaults?
  4. Have you seen this pattern from other customers on python:3.11.8-slim-bookworm + grpcio ≥ 1.78?