You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jul 7, 2020. It is now read-only.
The HyperLogLogPlus implementation only supports parameters p and sp for which 4 <= p <= sp <= 25. However, there is only a check for 4 <= p <= sp <= 32. Here is some test code with sp=26 that demonstrates the problem:
If we try to estimate the error rate boundaries for p=20 we should get 3 * 1.04 / sqrt (2^20) which is 0.003046875 or 0.3046875%, but error rate for p=20,sp=32 is much higher.
Given the described behaviour, should the implementation be patched to allow only the following input values 4 <= p <= sp <= 25 as @oertl suggested?
If so, I can submit a patch to fix it.
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
The HyperLogLogPlus implementation only supports parameters p and sp for which 4 <= p <= sp <= 25. However, there is only a check for 4 <= p <= sp <= 32. Here is some test code with sp=26 that demonstrates the problem:
The expected cardinality is 2, because the index (first 25 bits) as well as the sparse index (first 26 bits) are different.
The text was updated successfully, but these errors were encountered: