So I work on a system that has a slew of applications, all working together via event based processing of sorts. However we have a single application that deals with the state of the system and broadcasts out the current state at a rate if once a second. All great, until he goes down, in which case we lose state completely until he comes back alive. And when that happens he loses current state of the system and all is lost.
This seems problematic to me, does anyone know of a solution to this? To resolve the single point of failure?