this happened once or twice in dev and I want to make sure it doesn't happen in prod. should send a heartbeat if this failure state happens, so that diagnosis can be attempted.