fix(l2): remove one-time checkpoint if already exists #5083

ilitteri · 2025-10-28T16:14:34Z

Motivation

In a previous PR, DB checkpoints were introduced to ensure old state availability in the current path-based fashion. Every time a batch is sealed, a checkpoint whose state is the state of the latest block of the sealed batch is created to be used in the next batch.

The checkpoint is needed in two different steps of the batch commitment: for batch preparation (this is essentially building the batch) and for witness generation. Both steps need a non-modified checkpoint, but they both need to modify the checkpoint to be able to re-execute the batch.

As batch preparation occurs before witness generation, we opted to create a one-time checkpoint out of the main checkpoint that can be modified during batch preparation if needed (sometimes the batch was already available in the DB, and there's no need to re-execute anything); then, witness generation modifies the original checkpoint as needed because it is no longer needed.

Once the one-time checkpoint fulfills its purpose, it is removed. Currently, if batch preparation fails, the one-time checkpoint is not removed, and after retrying batch preparation, there's another attempt at creating the one-time checkpoint, which ends in an error because the directory already exists. We need to either avoid creating the one-time checkpoint again or to remove the existing one.

Description

Remove the existing one-time checkpoint if it already exists.

Copilot

Pull Request Overview

This PR addresses a bug where batch preparation retries would fail due to an existing one-time checkpoint directory. The fix ensures that if a one-time checkpoint already exists (from a previous failed attempt), it is removed before attempting to create a new one.

Key Changes:

Added a check to detect if a one-time checkpoint directory already exists
Implemented removal of existing one-time checkpoint directories before creating new ones

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-10-28T16:14:57Z

crates/l2/sequencer/l1_committer.rs


+        if one_time_checkpoint_path.exists() {
+            remove_dir_all(&one_time_checkpoint_path).map_err(|e| {
+                CommitterError::FailedToCreateCheckpoint(format!(


The error type FailedToCreateCheckpoint is misleading when removing an existing checkpoint. Consider creating a more specific error variant like FailedToRemoveCheckpoint or using a more generic error message that reflects the cleanup operation.

Suggested change

CommitterError::FailedToCreateCheckpoint(format!(

CommitterError::FailedToRemoveCheckpoint(format!(

github-actions · 2025-10-28T16:17:25Z

Lines of code report

Total lines added: 0
Total lines removed: 9
Total lines changed: 9

Detailed view

+--------------------------------------------+-------+------+
| File                                       | Lines | Diff |
+--------------------------------------------+-------+------+
| ethrex/crates/l2/sequencer/l1_committer.rs | 973   | -9   |
+--------------------------------------------+-------+------+

crates/l2/sequencer/l1_committer.rs

avilagaston9 · 2025-10-29T13:39:05Z

The PR was changed to use a random one_time_checkpoint_path on each attempt, preventing the following error:

2025-10-28T17:55:42.302454Z ERROR ethrex_l2::sequencer::l1_committer: L1 Committer Error: Committer failed retrieve block from storage: Failed to open RocksDB: IO error: lock hold by current process, acquire time 1761674125 acquiring thread 50: /root/.local/share/ethrex/temp_checkpoint_batch_1/LOCK: No locks available

tomip01 · 2025-10-29T13:41:50Z

crates/l2/sequencer/l1_committer.rs

+                    .inspect_err(|_| {
+                        if one_time_checkpoint_path.exists() {
+                            // Remove one-time checkpoint directory
+                            let _ = remove_dir_all(&one_time_checkpoint_path);
+                        }
+                    })?;
+
+                if one_time_checkpoint_path.exists() {
+                    remove_dir_all(&one_time_checkpoint_path).map_err(|e| {
+                        CommitterError::FailedToCreateCheckpoint(format!(
+                            "Failed to remove one-time checkpoint directory {one_time_checkpoint_path:?}: {e}"
+                        ))
+                    })?;


If we remove the one_time_checkpoint_path whether or not it returns an error, we can do it in one place

Done bd44892!

**Motivation** In a previous PR, DB checkpoints were introduced to ensure old state availability in the current path-based fashion. Every time a batch is sealed, a checkpoint whose state is the state of the latest block of the sealed batch is created to be used in the next batch. The checkpoint is needed in two different steps of the batch commitment: for batch preparation (this is essentially building the batch) and for witness generation. Both steps need a non-modified checkpoint, but they both need to modify the checkpoint to be able to re-execute the batch. As batch preparation occurs before witness generation, we opted to create a one-time checkpoint out of the main checkpoint that can be modified during batch preparation if needed (sometimes the batch was already available in the DB, and there's no need to re-execute anything); then, witness generation modifies the original checkpoint as needed because it is no longer needed. Once the one-time checkpoint fulfills its purpose, it is removed. Currently, if batch preparation fails, the one-time checkpoint is not removed, and after retrying batch preparation, there's another attempt at creating the one-time checkpoint, which ends in an error because the directory already exists. We need to either avoid creating the one-time checkpoint again or to remove the existing one. **Description** Remove the existing one-time checkpoint if it already exists. --------- Co-authored-by: avilagaston9 <[email protected]> Co-authored-by: Gianbelinche <[email protected]>

Fix

978542b

ilitteri self-assigned this Oct 28, 2025

ilitteri requested a review from a team as a code owner October 28, 2025 16:14

Copilot AI review requested due to automatic review settings October 28, 2025 16:14

github-actions bot added the L2 Rollup client label Oct 28, 2025

github-project-automation bot added this to ethrex_l2 Oct 28, 2025

Copilot AI reviewed Oct 28, 2025

View reviewed changes

gianbelinche approved these changes Oct 28, 2025

View reviewed changes

avilagaston9 approved these changes Oct 28, 2025

View reviewed changes

crates/l2/sequencer/l1_committer.rs Outdated Show resolved Hide resolved

avilagaston9 added 2 commits October 29, 2025 10:29

generate one time checkpoints at random dirs

3b51a52

Fix typo

5dd41fd

tomip01 reviewed Oct 29, 2025

View reviewed changes

Refactor error handling

bd44892

tomip01 approved these changes Oct 29, 2025

View reviewed changes

avilagaston9 and others added 7 commits October 29, 2025 11:11

Only remove directory if it exists

51b52e5

Add one time checkpoint for witness

c6d6357

Delete checkpoints when batch is verified

cc15cf2

Merge branch 'main' into fix_checkpoint

0f5bce0

Merge branch 'main' into fix_checkpoint

9ce8500

Revert checkpoints removal

02ac09d

Remove wrong validations

1ca0647

ilitteri added this pull request to the merge queue Oct 29, 2025

Merged via the queue into main with commit ee97e8e Oct 29, 2025
27 checks passed

ilitteri deleted the fix_checkpoint branch October 29, 2025 23:49

github-project-automation bot moved this to Done in ethrex_l2 Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(l2): remove one-time checkpoint if already exists #5083

fix(l2): remove one-time checkpoint if already exists #5083

Uh oh!

ilitteri commented Oct 28, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 28, 2025

Uh oh!

github-actions bot commented Oct 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

avilagaston9 commented Oct 29, 2025

Uh oh!

tomip01 Oct 29, 2025

Uh oh!

avilagaston9 Oct 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	CommitterError::FailedToCreateCheckpoint(format!(
	CommitterError::FailedToRemoveCheckpoint(format!(

fix(l2): remove one-time checkpoint if already exists #5083

fix(l2): remove one-time checkpoint if already exists #5083

Uh oh!

Conversation

ilitteri commented Oct 28, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Lines of code report

Uh oh!

Uh oh!

avilagaston9 commented Oct 29, 2025

Uh oh!

tomip01 Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

avilagaston9 Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Oct 28, 2025 •

edited

Loading