Messy. If you are doing this kind of shit for real (and I mean real as in
the 'we have 300 seconds to get back on our feet or the furnace is
6000ft into the upper atmosphere' type real) then you are taking snapshots
of both the program and data onto remote machines. (Thats quite easy to do
with a log based fs). You are also making synchronization points and each
of these you mark the file log as 'believed coherent' so that a crash
won't leave junk on the files to fool a restart.
A logging file system is all part of a dump/restore recovery mechanism. A
blind recovery can be very bad - even for non critical systems. A crash
and restart of a large engineering model could slip a small but critical
error into the calculations by seeking to the wrong place, writing a value
and crashing because of an earlier error that causes the bad write . You might
do a few weeks further modelling before you see that error.
Alan