BaseFold: Extract Merkle tree inner into a new struct. #560

yczhangsjtu · 2024-11-06T01:40:51Z

Extract small PR from #294

Currently the inner (without the leafs) of the Merkle tree is represented by a Vec<Vec<Digest>>. Sometimes (for optimization reasons, e.g., when the leaf values should stay in the original variable and we don't want to clone it into the Merkle tree) this inner part needs to be accessed and stored separately from the leaves. So we encapsulate it in a struct MerkleTreeDigests for more readability, implementing common methods on them, and allow enforcing particular constraints over the Vec<Vec<Digest>>.

This refactor provides new APIs for MerkleTreeDigests and make the corresponding MerkleTree APIs a thin wrapper around the APIs for MerkleTreeDigests. There are some new APIs and the behavior of existing APIs stay the same.

matthiasgoergens

Please see comments.

Btw, you don't need to give the types everywhere, Rust can figure much of that out on its own. Here's an example of two places, but there are more:

diff --git a/mpcs/src/basefold/commit_phase.rs b/mpcs/src/basefold/commit_phase.rs
index 992745bb..5df3aa19 100644
--- a/mpcs/src/basefold/commit_phase.rs
+++ b/mpcs/src/basefold/commit_phase.rs
@@ -98,8 +98,7 @@ where
         );
 
         if i > 0 {
-            let running_tree =
-                MerkleTree::<E>::new(running_tree_inner, FieldType::Ext(running_oracle));
+            let running_tree = MerkleTree::new(running_tree_inner, FieldType::Ext(running_oracle));
             trees.push(running_tree);
         }
 
@@ -114,7 +113,7 @@ where
             // Then the oracle will be used to fold to the next oracle in the next
             // round. After that, this oracle is free to be moved to build the
             // complete Merkle tree.
-            running_tree_inner = MerkleTreeDigests::<E>::from_leaves_ext(&new_running_oracle);
+            running_tree_inner = MerkleTreeDigests::from_leaves_ext(&new_running_oracle);
             let running_root = running_tree_inner.root();
             write_digest_to_transcript(&running_root, transcript);
             roots.push(running_root.clone());
diff --git a/mpcs/src/util/merkle_tree.rs b/mpcs/src/util/merkle_tree.rs
index 53568af8..81af42bd 100644
--- a/mpcs/src/util/merkle_tree.rs
+++ b/mpcs/src/util/merkle_tree.rs
@@ -83,9 +83,7 @@ where
                 .iter()
                 .take(self.height() - 1)
                 .enumerate()
-                .map(|(index, layer)| {
-                    Digest::<E::BaseField>(layer[(leaf_group_index >> index) ^ 1].clone().0)
-                })
+                .map(|(index, layer)| Digest(layer[(leaf_group_index >> index) ^ 1].clone().0))
                 .collect(),
         )
     }

(Sometimes adding extra redundant type information helps readability, but here it just seems to add verbosity.)

mpcs/src/util/merkle_tree.rs

matthiasgoergens · 2024-11-06T08:48:16Z

mpcs/src/util/merkle_tree.rs

-    pub fn root_from_inner(inner: &[Vec<Digest<E::BaseField>>]) -> Digest<E::BaseField> {
-        inner.last().unwrap()[0].clone()
+    pub fn root_ref(&self) -> &Digest<E::BaseField> {
+        &self.inner.last().unwrap()[0]


It looks like we require that inner not be empty. I suggest we encode that in the type, instead of going with Vec which is happy being empty.

Yes, it would be better to let the compiler help us enforce such constraints. But there are more to constrain than this, e.g., the size of the vectors should be 1, 2, 4, 8, and so on. I don't know how to encode all these in the type. Currently, it's just relying on inner being private, and is never changed once the Merkle tree is built, and all the unwrap() are concealed.

mpcs/src/util/merkle_tree.rs

matthiasgoergens · 2024-11-06T08:49:56Z

mpcs/src/util/merkle_tree.rs

 where
    E::BaseField: Serialize + DeserializeOwned,
 {
-    pub fn compute_inner(leaves: &FieldType<E>) -> Vec<Vec<Digest<E::BaseField>>> {
-        merkelize::<E>(&[leaves])
+    pub fn from_leaves(leaves: &FieldType<E>) -> Self {


Consider implementing From instances instead? (Not completely sure.)

I'm not sure, either. Computing the Merkle tree is an expensive procedure, so we probably want it to be always invoked explicitly.

Rust never invokes from implicitly.

No, not in the language. I just have the impression (which may be wrong) that .into() gives people the hint that this is a cheap type conversion, invoking some wrapper functions, without any expensive stuff.

mpcs/src/util/merkle_tree.rs

matthiasgoergens · 2024-11-06T08:55:05Z

Could you please add in the PR description why we are doing this?

What do we want to accomplish that gets easier or simpler with this refactoring? Oh, and is this supposed to be a pure refactoring, or does it change any behaviour?

…tor-extract-3

yczhangsjtu added 4 commits November 6, 2024 09:37

Extract merkle tree inner into a new struct.

332d302

Fix clippy.

826d0e2

Remove group size (to add in later PR)

643a66a

Add some doc to avoid confusion.

f7e2b40

yczhangsjtu changed the title ~~Extract Merkle tree inner into a new struct.~~ BaseFold: Extract Merkle tree inner into a new struct. Nov 6, 2024

matthiasgoergens reviewed Nov 6, 2024

View reviewed changes

yczhangsjtu added 7 commits November 11, 2024 15:10

Merge remote-tracking branch 'origin/master' into feat/basefold-refac…

98d0710

…tor-extract-3

Update according to comments.

68e32af

Add doc.

8a5dbc7

Merge remote-tracking branch 'origin/master' into feat/basefold-refac…

05cf0b6

…tor-extract-3

Merge remote-tracking branch 'origin/master' into feat/basefold-refac…

3643110

…tor-extract-3

Add comments.

a8dec01

Merge remote-tracking branch 'origin/master' into feat/basefold-refac…

b57c188

…tor-extract-3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BaseFold: Extract Merkle tree inner into a new struct. #560

BaseFold: Extract Merkle tree inner into a new struct. #560

yczhangsjtu commented Nov 6, 2024 •

edited

Loading

matthiasgoergens left a comment

matthiasgoergens Nov 6, 2024

yczhangsjtu Nov 11, 2024 •

edited

Loading

matthiasgoergens Nov 6, 2024

yczhangsjtu Nov 12, 2024

matthiasgoergens Nov 12, 2024

yczhangsjtu Nov 28, 2024

matthiasgoergens commented Nov 6, 2024 •

edited

Loading

BaseFold: Extract Merkle tree inner into a new struct. #560

Are you sure you want to change the base?

BaseFold: Extract Merkle tree inner into a new struct. #560

Conversation

yczhangsjtu commented Nov 6, 2024 • edited Loading

matthiasgoergens left a comment

Choose a reason for hiding this comment

matthiasgoergens Nov 6, 2024

Choose a reason for hiding this comment

yczhangsjtu Nov 11, 2024 • edited Loading

Choose a reason for hiding this comment

matthiasgoergens Nov 6, 2024

Choose a reason for hiding this comment

yczhangsjtu Nov 12, 2024

Choose a reason for hiding this comment

matthiasgoergens Nov 12, 2024

Choose a reason for hiding this comment

yczhangsjtu Nov 28, 2024

Choose a reason for hiding this comment

matthiasgoergens commented Nov 6, 2024 • edited Loading

yczhangsjtu commented Nov 6, 2024 •

edited

Loading

yczhangsjtu Nov 11, 2024 •

edited

Loading

matthiasgoergens commented Nov 6, 2024 •

edited

Loading