ROLLBACK RECOVERY MECHANISM

Boomi Nathan
1 Min Read
Disclosure: This website may contain affiliate links, which means I may earn a commission if you click on the link and make a purchase. I only recommend products or services that I personally use and believe will add value to my readers. Your support is appreciated!

Log-Based rollback Recovery Mechanism

Checkpoint-Based Rollback Recovery Mechanism

In checkpointing based rollback recovery is a well-established technique to deal with process failures and increase the system reliability and fault-tolerance in distributed systems [23]. In this approach, the state of each process in the system is periodically saved on stable storage, which is called a checkpoint of a process. To recover from a failure, the system restarts its execution from a previous error-free, consistent global state [3]. In a distributed system, since the processes in the system do not share memory, a global state of the system is defined as a set of local states, one from each process. The processes exchange information with each other through messages. A global state is said to be “consistent” if it contains no orphan message; i.e., a message whose receive event is recorded, but its send event is lost [3]. There are several applications of checkpointing including: rollback recovery, playback debugging, process migration, job swapping and load balancing

Checkpointing Related Notations

Checkpoint Algorithms Assumptions

Share This Article

J. BoomiNathan is a writer at SenseCentral who specializes in making tech easy to understand. He covers mobile apps, software, troubleshooting, and step-by-step tutorials designed for real people—not just experts. His articles blend clear explanations with practical tips so readers can solve problems faster and make smarter digital choices. He enjoys breaking down complicated tools into simple, usable steps.

Leave a review