Goldilocks by TomWambsgans · Pull Request #210 · leanEthereum/leanMultisig

TomWambsgans · 2026-05-04T14:02:29Z

No description provided.

Co-authored-by: Copilot <copilot@github.com>

Bring main's MTU-XMSS structure (tweak table, public_param, T-Sponge with replacement) into the goldilocks branch with all poseidon-related sizes halved: field-element widths main (KoalaBear) goldilocks ------------------ ----------------- ---------- TWEAK_LEN 2 1 XMSS_DIGEST_LEN 4 2 RANDOMNESS_LEN_FE 6 3 MESSAGE_LEN_FE 8 4 PUBLIC_PARAM_LEN_FE 4 2 POSEIDON1_WIDTH 16 8 DIGEST_LEN_FE 8 4 Tweak table slots are 2 FE (1 actual tweak FE + 1 zero pad). The packed tweak fits in a single 64-bit Goldilocks element via `(tweak_type << 42) | (sub_position << 32) | index`. Port main's poseidon precompile features (`half_output`, `hardcoded_offset_left`) from Poseidon16 to Poseidon8, with new committed columns for the flags and `effective_index_left_first/second`. The half-output trace tail values are filled in a post-pass from `memory_padded` (lookup-only — the AIR doesn't constrain them). Encoding decomposition uses the goldilocks-proven 21 chunks of W=3 bits per FE with a factored 1-bit canonical check `(diff)·(diff − 2^63) == 0`, applied to the first 2 of 4 output FE for exactly V = 42 chunks (no V_GRINDING). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…h, like the cubic extension of goldilocks)

…onky3#1606) Routes div_2exp_u64(1) through halve() instead of mul_2exp_u64(191), which walks the 96-entry POWERS_OF_TWO table and does a multiply. Microbench: ~0.95 ns/op -> ~0.43 ns/op (~2.2x throughput). The same upstream PR also makes halve() branchless; that change measured ~10% slower here on Zen 4 (LLVM already emits cmov for the simple `if x & 1 == 0` form), so it is not included. Co-authored-by: Robin Salen <salenrobin@gmail.com>

Symmetric counterpart to the div_2exp_u64 fast path. mul_2exp_u64(1) no longer indexes into POWERS_OF_TWO and does a full Goldilocks mul — it returns *self + *self. Microbench: ~0.55 ns/op -> ~0.43 ns/op (~25% faster).

Brings main into the goldilocks branch. The bulk of the work was porting main's PR #223 (duplex-sponge Fiat-Shamir) to the Goldilocks field, since goldilocks never adopted it. Conflict resolutions of note: - AIR trait: kept main's `n_shift_columns` / shift-columns-first layout; dropped the `low_degree` feature (goldilocks removed it — the Goldilocks poseidon8 AIR uses direct x^7 constraints, not `low_degree_block`). - extension_op/air.rs: cubic (DIM=3) layout reordered shift-columns-first. - Duplex Challenger ported to Goldilocks (WIDTH=8, RATE=4, CAPACITY=4); added a `Permutation` trait to the `symetric` crate. - New `poseidon8_permute` precompile: AIR (flag_permute column, outputs_left/right, mutex constraints), trace gen, ISA, simplifier. - Duplex `fiat_shamir.py` rewritten for DIGEST_LEN=4. - poseidon8 MAX_LOG_N_ROWS lowered 21 -> 20: the permute variant widened the table by 5 columns, which would otherwise exceed the WHIR commitment surface cap. cargo fmt + clippy clean; full `cargo test --workspace` passes; `recursion --n 2` aggregation runs end-to-end. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

TomWambsgans and others added 30 commits April 15, 2026 18:18

reduce degree AIR poseidon

3714bb6

wip

68e4e4c

wip

b9f7c21

test_plonky3_compatibility

bb7be6f

wip

82c624e

wip

d1c525f

degree 7 air (instead of 3) for poseidon

beaf0d6

w

c7448bc

wip

4b78c6e

wip

ae2401d

w

7771454

w

ff61a47

wip

ec26c52

w

4d91224

w

1daffe2

Merge branch 'main' into goldilocks

abc56ce

Co-authored-by: Copilot <copilot@github.com>

w

a635928

Co-authored-by: Copilot <copilot@github.com>

fix

ec4acbd

Merge remote-tracking branch 'origin/goldilocks' into goldilocks

26e06b9

low level optis

1663a9e

w

84f208b

w

80b3a98

2x faster poseidon

89a2dc5

much faster poseidn on avx512

6efc061

Merge commit 'a6f398eb3841acc74e424b788c0c50fd64df26f5' into goldilocks

7baaf62

w

c308fb6

better encoding

0470d7a

clippy

4c1209a

f

086ab06

TomWambsgans added 11 commits May 4, 2026 15:23

whir: remove folding pow grinding (not needed when field is big enoug…

37f052f

…h, like the cubic extension of goldilocks)

chunks of 2w in xmss

ddfd849

128 bit security

e5617e0

EFFECTIVE_TWO_ADICITY = 24

c0befb9

Merge branch 'main' into goldilocks

d2027e6

change whir params

41a1529

fix

6acacb7

Merge remote-tracking branch 'origin/main' into goldilocks

7a84afa

faster poseidon on avx

f1846dd

w

c155bc2

w

4fe8ea7

TomWambsgans force-pushed the goldilocks branch from ca75bbe to 4fe8ea7 Compare May 11, 2026 17:09

TomWambsgans and others added 4 commits May 11, 2026 22:05

w

8918c6b

Merge remote-tracking branch 'origin/main' into goldilocks

a890068

fast path for mul_2exp_u64(0) and mul_2exp_u64(1) in Goldilocks

dcef38d

Symmetric counterpart to the div_2exp_u64 fast path. mul_2exp_u64(1) no longer indexes into POWERS_OF_TWO and does a full Goldilocks mul — it returns *self + *self. Microbench: ~0.55 ns/op -> ~0.43 ns/op (~25% faster).

TomWambsgans force-pushed the goldilocks branch from 3aea9eb to dcef38d Compare May 14, 2026 06:08

TomWambsgans force-pushed the main branch from b01b199 to 0295672 Compare May 19, 2026 17:38

TomWambsgans force-pushed the goldilocks branch from 48a27e6 to 357c947 Compare May 19, 2026 17:41

log_size_guess = 19

e3432ae

TomWambsgans force-pushed the goldilocks branch from 357c947 to e3432ae Compare May 19, 2026 17:42

TomWambsgans force-pushed the main branch from 0295672 to 13408cc Compare May 19, 2026 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Goldilocks#210

Goldilocks#210
TomWambsgans wants to merge 47 commits into
mainfrom
goldilocks

TomWambsgans commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TomWambsgans commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant