feat: ADVZ PayloadProver support requests that span multiple polynomial #438

ggutoski · 2023-11-29T21:24:15Z

Description

closes: #424

Naive implementation:

SmallRangeProof: concatenate individual KZG proofs from multiple polynomials, do not aggregate
LargeRangeProof: rebuild and verify multiple KZG commitments. There's not much more we could do to improve here. But verification inside a smart contract is now more complex because it might need to compute multiple KZG commitments instead of one.

Before we can merge this PR, please make sure that all the following items have been
checked off. If any of the checklist items are not applicable, please leave them but
write a little note why.

Targeted PR against correct branch (main)
Linked to GitHub issue with discussion and accepted design OR have an explanation in the PR that describes this work.
Wrote unit tests
Updated relevant documentation in the code
Added a relevant changelog entry to the Pending section in CHANGELOG.md
Re-reviewed Files changed in the GitHub PR explorer

…ltiple polynomials

chancharles92 · 2023-12-04T21:19:14Z

LargeRangeProof: rebuild and verify multiple KZG commitments. There's not much more we could do to improve here. But verification inside a smart contract is now more complex because it might need to compute multiple KZG commitments instead of one.

Can the smart contract just generate a random scalar and randomly combine the field vectors? Finally, it only needs to compute a single KZG commitment and compare it with the random linear combinations of the original commitments.

ggutoski · 2023-12-04T21:30:24Z

Can the smart contract just generate a random scalar and randomly combine the field vectors? Finally, it only needs to compute a single KZG commitment and compare it with the random linear combinations of the original commitments.

Interesting idea. Basically it replaces subsequent MSMs with a faster hash of the field vectors.

In any case I don't want to do it in this PR. 😉

primitives/src/vid/advz/payload_prover.rs

chancharles92 · 2023-12-04T21:41:36Z

primitives/src/vid/advz/payload_prover.rs

-        );
+        let range_poly_byte = self.range_poly_to_byte_clamped(&range_poly, payload.len());
+        let offset_elem = self.offset_poly_to_elem(range_poly.start, range_elem.start);
+        let final_points_range_end =


For the case where the range goes across n >= 3 polys, are there any advantages of using SmallRangeProof?
My argument is that if the range covers at least n-2 >= 1 full polys, compared to LargeRangeProof, the proof size and the verification time of SmallRangeProof won't be much more efficient even if we use KZG multiproof in the future. This is because the proof size is at least the number of evaluations (even if the KZG proof is O(1)-sized), and the verification is more than O(n*|poly_deg|) G-ops. (Recall that the KZG multi-verification still needs O(|eval_points|) group operations.)

...the proof size is at least the number of evaluations (even if the KZG proof is O(1)-sized)

Why is that? A SmallRangeProof with constant-size KZG proof is constant-sized. The remaining fields are all constant-size. (The length of prefix_bytes and suffix_bytes is always less than 32.)

...the verification is more than O(n*|poly_deg|) G-ops.

Yes but I wonder whether the n-factor could be eliminated by aggregating multiproofs across multiple polynomials as described here. But that's a cutting edge optimization that will need to wait until later.

In any case, I doubt there is any advantage to the use of SmallRangeProof for large ranges--that's what LargeRangeProof is for. If your range spans multiple polynomials then LargeRangeProof is probably a better choice.

Is there a terminology misunderstanding? "Small" here refers to the size of the range of payload bytes that will be proven, not the size of the proof itself. (ie. [small range] proof vs. small [range proof] 😛 ) We tried to choose names that generalize beyond the narrow tx-namespace application. The intention is that Small should be used for tx proofs (typically only a small number of bytes) in a setting where a pairing is feasible (such as a off-chain light client). By contrast, Large should be used for namespace proofs (typically a large number of bytes) in a setting where a pairing is impractical (such as on-chain light client). Perhaps we could change these names.

Is there a terminology misunderstanding?

Oh the understanding of terminology is correct. My thought was: if range in SmallRangeProof::payload_proof() is not small (e.g., if it goes across >= 3 polynomials), the API should directly return an error and ask the user to use LargeRangeProof::payload_proof() instead (because there is no advantage of using SmallRangeProof::payload_proof() in this case). While in the current code, it is still dealing with this case and returns a valid but inefficient proof.

Why is that? A SmallRangeProof with constant-size KZG proof is constant-sized. The remaining fields are all constant-size. (The length of prefix_bytes and suffix_bytes is always less than 32.)

Because anyway you need to send the txs content to the verifier (my understanding of the so-called proof is data + auxiliary_proof). E.g., if the tx data is exactly a polynomial, in the LargeRangeProof case, you don't need to send any extra info (as the tx itself is already a proof); while here you need to additionally send an O(1)-sized KZG multiproof.

Thanks for clarifying.

My thought was: if range in SmallRangeProof::payload_proof() is not small (e.g., if it goes across >= 3 polynomials), the API should directly return an error and ask the user to use LargeRangeProof::payload_proof() instead (because there is no advantage of using SmallRangeProof::payload_proof() in this case). While in the current code, it is still dealing with this case and returns a valid but inefficient proof.

An alternative is to add another impl of PayloadProver that intelligently selects the proof type based on the user's input. The return type would be an enum with Small and Large variants. The existing impls for Small and Large remain as-is so that the user retains the ability to enforce a preference for Small or Large. (Example: perhaps the user always wants Large because it's pairing-free. I can't think of a reason to enforce Small at this time but you never know.)

... in the LargeRangeProof case, you don't need to send any extra info (as the tx itself is already a proof)

A LargeRangeProof needs enough auxiliary data to complete a polynomial. This auxiliary data could be quite large if for example the user's range just barely crosses into another polynomial. But yes, if the data range coincides with a polynomial then there's near-zero overhead.

Proposal

I propose we merge as-is and punt this question to a new issue. This discussion is about performance but our current goal is feature-completeness. Any opinions?

New issue #441

chancharles92 · 2023-12-04T22:17:00Z

Interesting idea. Basically it replaces subsequent MSMs with a faster hash of the field vectors.

In any case I don't want to do it in this PR. 😉

In the interactive smart contract setting, we can even generate the randomness via oracles without doing Fiat-Shamir. But yes, there's no need to implement that for now.

chancharles92

LGTM, will approve after addressing the comment in #438 (comment)

ggutoski added 7 commits November 28, 2023 13:21

refactor test, add failing test case

6b71fff

wip untested payload_proof for SmallRangeProof allow range to span mu…

539807d

…ltiple polynomials

support arbitrary ranges for ShortRangeProof with tests (yay)

5ac88b9

support arbitrary ranges for LongRangeProof with tests (yay)

fa8ba2d

offset_elem slightly less ugly

0c12ba1

final_points_range_end slightly less ugly

c000411

remove TODO

fd603b4

ggutoski requested a review from chancharles92 November 29, 2023 21:24

ggutoski added 2 commits November 29, 2023 18:15

Merge branch 'main' into gg/424

2b8103c

update changelog

62f9681

chancharles92 requested changes Dec 4, 2023

View reviewed changes

add test for a wrong proof as per #438 (comment)

55d49ce

ggutoski requested a review from chancharles92 December 5, 2023 19:50

chancharles92 reviewed Dec 5, 2023

View reviewed changes

chancharles92 approved these changes Dec 5, 2023

View reviewed changes

ggutoski mentioned this pull request Dec 5, 2023

ADVZ PayloadProver intelligently use Small or Large range proof #441

Open

ggutoski merged commit 080ff32 into main Dec 5, 2023
5 checks passed

ggutoski deleted the gg/424 branch December 5, 2023 21:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: ADVZ PayloadProver support requests that span multiple polynomial #438

feat: ADVZ PayloadProver support requests that span multiple polynomial #438

ggutoski commented Nov 29, 2023 •

edited

Loading

chancharles92 commented Dec 4, 2023

ggutoski commented Dec 4, 2023

chancharles92 Dec 4, 2023

ggutoski Dec 5, 2023

chancharles92 Dec 5, 2023

ggutoski Dec 5, 2023

ggutoski Dec 5, 2023

chancharles92 commented Dec 4, 2023 •

edited

Loading

chancharles92 left a comment

feat: ADVZ PayloadProver support requests that span multiple polynomial #438

feat: ADVZ PayloadProver support requests that span multiple polynomial #438

Conversation

ggutoski commented Nov 29, 2023 • edited Loading

Description

chancharles92 commented Dec 4, 2023

ggutoski commented Dec 4, 2023

chancharles92 Dec 4, 2023

Choose a reason for hiding this comment

ggutoski Dec 5, 2023

Choose a reason for hiding this comment

chancharles92 Dec 5, 2023

Choose a reason for hiding this comment

ggutoski Dec 5, 2023

Choose a reason for hiding this comment

Proposal

ggutoski Dec 5, 2023

Choose a reason for hiding this comment

chancharles92 commented Dec 4, 2023 • edited Loading

chancharles92 left a comment

Choose a reason for hiding this comment

ggutoski commented Nov 29, 2023 •

edited

Loading

chancharles92 commented Dec 4, 2023 •

edited

Loading