HBASE-29039 Seek past delete markers instead of skipping one at a time by junegunn · Pull Request #8001 · apache/hbase

junegunn · 2026-03-29T01:13:09Z

Context

HBASE-30036 (#7993) consolidates redundant delete markers on flush, preventing them from growing unbounded in HFiles. However, markers still accumulate in the memstore before flush, degrading read performance. HBASE-29039 addresses this from the read path side. Both are needed for full coverage. There is an open PR (#6557), but the review process has been stalled. This is an alternative approach with fewer code changes, hopefully making it easier to reach consensus.

Test result

Using the test code in HBASE-30036.

`DeleteFamily`

Substantial read performance improvement before flushes.
Without HBASE-30036, delete markers still accumulate in store files.

`DeleteColumnContiguous`

Substantial read performance improvement before flushes.
Without HBASE-30036, delete markers still accumulate in store files.

`DeleteColumnInterleaved`

No difference, as expected. Already triggers SEEK_NEXT_COL via the masked put.

Description

When a DeleteColumn or DeleteFamily marker is encountered during a normal user scan, the matcher currently returns SKIP, forcing the scanner to advance one cell at a time. This causes read latency to degrade linearly with the number of accumulated delete markers for the same row or column.

Since these are range deletes that mask all remaining versions of the column, seek past the entire column immediately via columns.getNextRowOrNextColumn(). This is safe because cells arrive in timestamp descending order, so any puts newer than the delete have already been processed.

For DeleteFamily, also fix getKeyForNextColumn in ScanQueryMatcher to bypass the empty-qualifier guard (HBASE-18471) when the cell is a DeleteFamily marker. Without this, the seek barely advances past the current cell instead of jumping to the first real qualified column.

The optimization is skipped when:

seePastDeleteMarkers is true (KEEP_DELETED_CELLS)
newVersionBehavior is enabled (sequence IDs determine visibility)
the delete marker is not tracked (visibility labels)

junegunn · 2026-03-29T03:25:48Z

TestVisibilityLabelsWithDeletes is failing, which likely explains the additional changes in #6557. I'll try to fix it, but if it ends up resembling the previous approach, I'll drop this.

When a DeleteColumn or DeleteFamily marker is encountered during a normal user scan, the matcher currently returns SKIP, forcing the scanner to advance one cell at a time. This causes read latency to degrade linearly with the number of accumulated delete markers for the same row or column. Since these are range deletes that mask all remaining versions of the column, seek past the entire column immediately via columns.getNextRowOrNextColumn(). This is safe because cells arrive in timestamp descending order, so any puts newer than the delete have already been processed. For DeleteFamily, also fix getKeyForNextColumn in ScanQueryMatcher to bypass the empty-qualifier guard (HBASE-18471) when the cell is a DeleteFamily marker. Without this, the seek barely advances past the current cell instead of jumping to the first real qualified column. The optimization is only applied with plain ScanDeleteTracker, and skipped when: - seePastDeleteMarkers is true (KEEP_DELETED_CELLS) - newVersionBehavior is enabled (sequence IDs determine visibility) - visibility labels are in use (delete/put label mismatch)

junegunn · 2026-03-29T03:42:15Z

TestVisibilityLabelsWithDeletes is failing

Fixed by:

-          !seePastDeleteMarkers && !(deletes instanceof NewVersionBehaviorTracker)
+          !seePastDeleteMarkers && deletes.getClass() == ScanDeleteTracker.class

junegunn marked this pull request as draft March 29, 2026 03:08

junegunn force-pushed the HBASE-29039-alt branch from 018a268 to 3a87682 Compare March 29, 2026 03:30

junegunn marked this pull request as ready for review March 29, 2026 03:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HBASE-29039 Seek past delete markers instead of skipping one at a time#8001

HBASE-29039 Seek past delete markers instead of skipping one at a time#8001
junegunn wants to merge 1 commit intoapache:masterfrom
junegunn:HBASE-29039-alt

junegunn commented Mar 29, 2026 •

edited

Loading

Uh oh!

junegunn commented Mar 29, 2026

Uh oh!

junegunn commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

junegunn commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Test result

DeleteFamily

DeleteColumnContiguous

DeleteColumnInterleaved

Description

Uh oh!

junegunn commented Mar 29, 2026

Uh oh!

junegunn commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

junegunn commented Mar 29, 2026 •

edited

Loading

`DeleteFamily`

`DeleteColumnContiguous`

`DeleteColumnInterleaved`