Crash Fault Tolerance
Crash fault tolerance refers to the ability of a distributed system to continue functioning correctly despite the sudden failure or stopping of one or more of its nodes. Unlike Byzantine fault tolerance, which accounts for malicious behavior, crash fault tolerance assumes that nodes are either functioning correctly or have stopped entirely.
This model is often sufficient for permissioned financial networks or private blockchains where the identity of participants is known and verified. Protocols like Raft or Paxos are classic examples of crash fault-tolerant algorithms used to manage distributed logs and coordinate updates.
In these systems, as long as a majority of nodes are online and communicating, the service remains available and consistent. While less secure against active attacks than BFT, crash-tolerant systems are often more performant and easier to implement for specific enterprise use cases.
They ensure that ledger updates are not lost due to hardware crashes or network outages. This approach balances the need for high availability with the practical requirements of controlled financial environments.