Topic: Distributed Checkpoint: Efficient checkpointing in large-scale jobs