Promql cpp functions by cherep58 · Pull Request #272 · deckhouse/prompp

cherep58 · 2026-03-24T15:45:19Z

Summary

Implement C++ pushdown optimizations for PromQL functions at the storage layer. When the query engine provides SelectHints.Func, the storage can now reduce data before it reaches the Go PromQL engine — returning only the samples needed for the specific function instead of the full series.

New decorator iterators

Aggregation: MinOverTimeIterator, MaxOverTimeIterator, LastOverTimeIterator, SumOverTimeIterator (via OverTimeFuncIterator template with pluggable handlers)
Counter/rate family: RateIterator (rate, increase), IRateIterator (irate, idelta)
Transform: DeltaIterator, ChangesIterator, ResetsIterator
Downsampling: DownsamplingDecodeIterator — interval-based sample reduction

Refactored decode iterator infrastructure

CRTP-based DecodeIteratorTrait with seek(), seek_to(), invalidate(), set() methods
SeekResult enum for composable seek logic (kUpdateSample, kNext, kStop, kUpdateSampleNextAndStop)
UniversalDecodeIterator extended with seek(), seek_to(), invalidate(), set() that dispatch through std::visit
Outer DecodeIterator variant wrapping all iterator types for the Go binding layer

Function dispatch

gperf-generated perfect hash (FunctionNamesHash) for O(1) function name lookup
create_decode_iterator() dispatches on SelectHints.Func to construct the appropriate iterator

Go bridge changes

SelectHints passed through cppbridge via ABI-compatible GenericSelectHints<Go::String, Go::SliceView>
downsamplingMs parameter added to Query() and ChunkRecoder paths
DataStorageSerializedDataIterator control block simplified: direct Timestamp()/Value() accessors replacing raw field access
Namespace change: entrypoint::head → entrypoint::series_data

Testing

Parametric C++ unit tests for each new iterator (boundary conditions, StaleNaN handling, time interval filtering, reset scenarios)
Go integration tests for all dispatched PromQL functions (downsampling, min/max/last/sum_over_time, rate, increase, changes, delta, irate, idelta, resets)

…ncIterator

# Conflicts: # pp/entrypoint/go_constants.h # pp/entrypoint/head/serialization.h # pp/go/cppbridge/entrypoint.h # pp/go/cppbridge/head.go # pp/go/cppbridge/head_test.go # pp/head/chunk_recoder.h # pp/series_data/decoder.h # pp/series_data/decoder/decorator/interval_decode_iterator.h # pp/series_data/decoder/traits.h # pp/series_data/decoder/universal_decode_iterator.h # pp/series_data/serialization/serialized_data.h

# Conflicts: # pp/go/storage/querier/querier.go

gshigin · 2026-05-05T12:26:22Z

  const auto out = static_cast<Result*>(res);

-  RangeQuerierWithArgumentsWrapperV2 querier(*in->data_storage, in->query, out->serialized_data);
+  RangeQuerierWithArgumentsWrapperV2 querier(*in->data_storage, in->query, *in->hints, out->serialized_data, in->downsampling_ms);


Do we always know that hints are valid? Go's SelectHints states that this parameter is optional. Even in our code (pp_api.go) : seriesSet := q.Select(ctx, false, nil, matchers...), where hint is nil

Hints is mandatory parameter for range query in cpp. I will research go code and fix nil hints, thanks)

# Conflicts: # pp/go/storage/querier/merge_series_set_test.go

vporoshok

Review: WindowFunctionIterator sub-interval logic vs ADR-001

Analyzed the sub-interval splitting algorithm in WindowFunctionIterator against ADR-001 (range function iteration algorithm). The code works correctly for the most common case (s < w < 2s), but has critical issues in two other cases.

Summary of findings

#	Severity	Finding
1	Bug/Critical	`advance_interval()` breaks for `w > 2s` — produces invalid interval (min > max), potential infinite loop
2	Bug/Critical	Incorrect boundaries for `w < s` — aggregation captures samples from gaps between windows
3	Bug/Moderate	Possible double-count for `sum_over_time` at interval boundaries (depends on `seek_to` inclusive/exclusive semantics)
4	Design	`count_over_time` missing from dispatch (falls back to universal iterator — correct but no pushdown)
5	Design	`sum_over_time` result timestamp is last raw sample's ts, not sub-interval end
6	Observation	min/max sub-interval collapsing optimization from ADR not implemented (correct but sub-optimal)

vporoshok · 2026-05-07T21:10:56Z

+  [[nodiscard]] PROMPP_ALWAYS_INLINE TimeInterval advance_interval() const noexcept {
+    auto interval = iterator_.interval();
+    if (interval.difference() == parameters_->step) {
+      interval.min = interval.max;
+
+      const auto diff = parameters_->range - parameters_->step;
+      interval.max += (diff == 0) ? parameters_->step : diff;
+    } else {
+      interval.min = std::exchange(interval.max, next_interval_boundary(interval.min));


Bug (critical): advance_interval() produces invalid intervals when w > 2s

Trace for w = 9, s = 4, d = 1 (e.g. max_over_time(m[9m]) with 4m step):

1st interval: [c, c+4] len 4 = s ✓ 2nd: diff == step → [c+4, c+9] len 5 = w−s 3rd: diff ≠ step → min = c+9, max = next_boundary(c+4) = c+8 → [c+9, c+8] min > max! INVALID

When w − s > s, the second interval overshoots the next grid boundary. next_interval_boundary(old.min) returns a value less than the current max, producing an inverted interval. The advance() loop will either spin forever or produce UB on the unsigned difference.

Root cause: advance_interval alternates between adding step and w − s, but assumes w − s ≤ s. This breaks for w > 2s.

Per ADR-001, when w > s and d ≠ 0, the boundary grid should alternate at d and s − d intervals (not s and w − s). For w > 2s the grid still works: boundaries at a + n·s and a + n·s − d with sub-interval lengths always d and s − d.

vporoshok · 2026-05-07T21:10:56Z

+  [[nodiscard]] PROMPP_ALWAYS_INLINE static Timestamp next_interval_boundary(Timestamp start, Timestamp step_ms, Timestamp range_ms) noexcept {
+    if (range_ms <= step_ms) [[likely]] {
+      return start + range_ms;
+    }
+
+    return start + range_ms - (range_ms - step_ms);


Bug (critical): next_interval_boundary() produces wrong grid for w < s

When w ≤ s, this returns start + w. Trace for w = 2, s = 5:

Go windows: (c, c+2], (c+5, c+7], (c+10, c+12], ... (gaps of 3) Code produces: [c, c+2] ✓ (first window) [c+2, c+2] len 0 → skip [c+2, c+4] ← WRONG: not aligned to any Go window! [c+4, c+4] len 0 → skip [c+4, c+6] ← partially in gap (c+4,c+5] + partially in window (c+5,c+6]

The function creates a grid with spacing w regardless of s. But per ADR-001 case 3 (w < s), boundaries should be at a + n·s and a + n·s − w, giving alternating sub-intervals of length s − w (gap, skip) and w (actual window). The correct grid spacing is driven by s, not w.

Suggestion: next_interval_boundary should always produce boundaries from the grid {a + n·s, a + n·s − d} where d = w % s.

vporoshok · 2026-05-07T21:10:56Z

+    }
+
+    kahan_sum_inc(value, sum_.value, c_);
+    sum_.timestamp = timestamp;


Moderate: possible double-count at interval boundaries

Consecutive sub-intervals share a boundary: new interval's min = previous interval's max. OverTimeFuncIterator::find_element() calls seek_to(interval_.min) which positions at the first sample ≥ min.

If a sample has a timestamp exactly equal to the shared boundary, it would be included in both adjacent sub-intervals. For max/min this is idempotent, but for sum_over_time it causes a double-count.

Please verify: does seek_to position at > min (exclusive) or ≥ min (inclusive)?

vporoshok · 2026-05-07T21:10:56Z

+    case kChanges:
+      return DecodeIterator(std::in_place_type<DecodeIterator::ChangesIterator>, select_hints.function_parameters);
+
+    default:


Design: count_over_time not dispatched

ADR-001 lists count_over_time in the per-function output contract. It's not in the WindowFunction enum and falls through to the default branch here (universal iterator). This is functionally correct but misses the pushdown optimization.

Considering it only needs a count per sub-interval (similar to sum_over_time), it would be a small addition.

cherep58 added 30 commits March 23, 2026 16:11

update gcc to 14.2.0

f2dc0d7

update clang-tidy to 21.1.8

76f1d93

added clang-tidy bugprone-* diagnostics

5109b0b

changed logic of IntervalDecodeIterator

685fe57

added downsamplingMs parameter into Go-bindings

4bae01e

added GO-test for downsampling

b41272d

optimized IntervalDecodeIterator

e1876b3

renamed IntervalDecodeIterator to DownsamplingDecodeIterator

2ab9506

optimized DownsamplingDecodeIterator

1a52eb2

review fixes

8e75529

review fixes

3802273

added downsampling feature in ChunkRecoder

518e1f8

fixed compilation error

947e996

added SelectHints to Go-binding

d168fa5

refactored DownsamplingDecodeIterator

eb872f1

created DecodeIterator for Go-bindings

6d46395

created MinOverTimeIterator

17c94f8

created MaxOverTimeIterator

7768c98

created LastOverTimeIterator

17020f2

created SumOverTimeIterator

7a1d865

refactored over_time_func iterators

93bb6df

removed test metrics

3ec6f5c

reformatted

70185e9

removed invalid test case

62335ca

created DecodeIteratorTrait::seek_to method and used it in OverTimeFu…

8a9d5e5

…ncIterator

created RateIterator

122cb6f

created ChangesIterator

1962d02

created DeltaIterator

e072ec6

added increase promql function handling

b43e26f

created IRateIterator

afe8d8c

cherep58 marked this pull request as ready for review April 3, 2026 15:48

cherep58 added 8 commits April 3, 2026 18:59

fixed golint error

07af82c

fixed gcc false-positive warning

aa756c5

Merge branch 'pp' into promql_cpp

5eaa9bb

Merge branch 'pp' of https://github.com/deckhouse/prompp into promql_cpp

9ddb76a

added continue after reset iterator unit test for WindowFunctionIterator

44d0566

Merge branch 'pp' of https://github.com/deckhouse/prompp into promql_cpp

85c4275

fixed clang-tidy warning

7d3ee73

changed logic of SumOfElements

cf2ce92

cherep58 marked this pull request as draft April 24, 2026 08:27

cherep58 mentioned this pull request Apr 28, 2026

Chunk recoder optimization #297

Merged

cherep58 added 4 commits April 29, 2026 14:24

refactoring after merge

a69c5f2

created benchmark ChunkRecoderWithDownsampling

ad613ac

removed ChunkRecoder test from performance tests

803a812

cherep58 marked this pull request as ready for review April 29, 2026 14:15

cherep58 added 2 commits April 29, 2026 17:26

fixed clang-tidy warnings

1d197de

Merge branch 'pp' of https://github.com/deckhouse/prompp into promql_cpp

dd7ea66

# Conflicts: # pp/go/storage/querier/querier.go