Optimisation for UInt256.fromBytesBE #9547

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

thomas-quadratic wants to merge 10 commits into hyperledger:main from thomas-quadratic:feat/fromBytesBE-optim

+664 −247

Contributor

thomas-quadratic commented Dec 7, 2025

PR description

@ahamlat improved the performance of UInt256.fromBytesBE using loops.
Motivated by that finding and the migration to big-endian limbs, we propose further improvements in this PR.
Notably, the implementation could use Arrays.mismatch to deal with 0 padding effectively, which was missing previously.

Fixed Issue(s)

Thanks for sending a pull request! Have you done the following?

Checked out our contribution guidelines?
Considered documentation and added the doc-change-required label to this PR if updates are required.
Considered the changelog and included an update if required.
For database changes (e.g. KeyValueSegmentIdentifier) considered compatibility and performed forwards and backwards compatibility tests

Locally, you can run these tests to catch failures early:

spotless: ./gradlew spotlessApply
unit tests: ./gradlew build
acceptance tests: ./gradlew acceptanceTest
integration tests: ./gradlew integrationTest
reference tests: ./gradlew ethereum:referenceTests:referenceTests
hive tests: Engine or other RPCs modified?

github-project-automation bot added this to RC 25.12.0

github-project-automation bot moved this to Open PRs in RC 25.12.0

thomas-quadratic force-pushed the feat/fromBytesBE-optim branch from ced26d3 to c760630 Compare

December 9, 2025 14:39

thomas-quadratic added 10 commits

December 11, 2025 14:02


          ENH: represent UInt256 as big-endian limbs

767bca3

Before, limbs were stored in little-endian.
But to use Arrays.mismatch to our advantage, it is better to have it big-endian.
This commit makes UInt256.java big-endian in limbs.
We still need to migrate all tests and benchmark.

Signed-off-by: Thomas Zamojski <[email protected]>


          FIX: array indices bugs for fromBytesBE and mulMod.

20b4e35

Also added tests that were failing and now pass.

Signed-off-by: Thomas Zamojski <[email protected]>


          FIX: offset indices in mulMod and fromBytesBE

1d12fc0

Signed-off-by: Thomas Zamojski <[email protected]>


          FIX: spotless

6bb781f

Signed-off-by: Thomas Zamojski <[email protected]>


          ENH: represent UInt256 as big-endian limbs

abbf656

Before, limbs were stored in little-endian.
But to use Arrays.mismatch to our advantage, it is better to have it big-endian.
This commit makes UInt256.java big-endian in limbs.
We still need to migrate all tests and benchmark.

Signed-off-by: Thomas Zamojski <[email protected]>


          FIX: array indices bugs for fromBytesBE and mulMod.

9cdc0e2

Also added tests that were failing and now pass.

Signed-off-by: Thomas Zamojski <[email protected]>


          FIX: spotless

4f6b4ae

Signed-off-by: Thomas Zamojski <[email protected]>


          ENH: optimization for fromBytesBE

bd55edc

Signed-off-by: Thomas Zamojski <[email protected]>


          ADD: benchmarks for fromBytesBE

b14ec51

Signed-off-by: Thomas Zamojski <[email protected]>


          FIX: spotless

e324e55

Signed-off-by: Thomas Zamojski <[email protected]>

lu-pinto force-pushed the feat/fromBytesBE-optim branch from c760630 to e324e55 Compare

December 11, 2025 14:02

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                private final int[] limbs;

                private final int length;

                private final int offset;

Member

lu-pinto Dec 11, 2025

please add a comment as to what this offset means. Above it is still mentioned length

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                    int limb = 0;

                  int i = N_LIMBS - 1; // Index in int array

                  int b = bytes.length - 1; // Index in bytes array

                  int limb;

Member

lu-pinto Dec 11, 2025 •

edited

Loading

nit: move limb definition inside loop

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                  return new UInt256(limbs, len);

                  int offset = N_LIMBS - len;

                  System.arraycopy(arr, 0, limbs, offset, len);

                  return new UInt256(limbs, offset);

Member

lu-pinto Dec 11, 2025 •

edited

Loading

not guaranteed that the offset is the real offset. For instance if you have int[] arr = new int[]{0,0,1,2} you will have offset = 0 and it should be offset = 2. This is important because you are constructing a new UInt256 so it should follow same guarantees as fromBytesBE

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                 * @return Big-endian ordered bytes for this UInt256 value.

                 */

                public byte[] toBytesBE() {

                public byte[] toBytesBEOld() {

Member

lu-pinto Dec 11, 2025

why do we have this on? Just for benchmarks? Remove after PR is sound.

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                 */

                public byte[] toBytesBE() {

                  byte[] result = new byte[BYTESIZE];

                  for (int i = this.limbs.length - N_LIMBS, j = 0; i < this.limbs.length; i++, j += 4) {

Member

lu-pinto Dec 11, 2025

could use offset to shorten the for loop

Member

lu-pinto Dec 11, 2025

never mind... I get it now, we cannot trust it after computations.

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                public boolean isZero() {

                  return (limbs[0] | limbs[1] | limbs[2] | limbs[3] | limbs[4] | limbs[5] | limbs[6] | limbs[7])

                      == 0;

                  return Arrays.mismatch(limbs, ZERO.limbs) == -1;

Member

lu-pinto Dec 11, 2025

probably the older impl was more performant though.

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                    if (comp != 0) return comp;

                  }

                  return 0;

                  int i = Arrays.mismatch(a.limbs, b.limbs);

Member

lu-pinto Dec 11, 2025

faster path is to check on offset?

if (a.offset != b.offset) return Integer.compare(b.offset, a.offset);

Then run mismatch on remainder.

Member

lu-pinto Dec 11, 2025

Actually there's Arrays.compareUnsigned already if you want to use it

Member

lu-pinto Dec 11, 2025

ok again I get why you did it, cause you cannot trust offset after computations.

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                          | (this.limbs[6] ^ other.limbs[6])

                          | (this.limbs[7] ^ other.limbs[7]);

                  return xor == 0;

                  return Arrays.mismatch(this.limbs, other.limbs) == -1;

Member

lu-pinto Dec 11, 2025

There's also Arrays.equals() that is vectorized and simplifies the code.

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                  absInto(x, this.limbs, N_LIMBS);

                  absInto(y, modulus.limbs, N_LIMBS);

                  System.arraycopy(this.limbs, 0, x, 0, N_LIMBS);

                  System.arraycopy(modulus.limbs, 0, y, 0, N_LIMBS);

Member

lu-pinto Dec 11, 2025

why not offset instead of src=0?

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                  int divLen = effectiveLength(dividend);

                  // Shortcut: if dividend < modulus or dividend == modulus

                  int cmp = compareLimbs(dividend, modulus);

Member

lu-pinto Dec 11, 2025 •

edited

Loading

Why not using modLen and divLen in compareLimbs to short circuit on size?

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                  this.offset = offset;

                }

                UInt256(final int[] limbs) {

Member

lu-pinto Dec 11, 2025

either remove this constructor or make sure there's a guarantee on offset

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                // --------------------------------------------------------------------------

                UInt256(final int[] limbs, final int length) {

                UInt256(final int[] limbs, final int offset) {

Member

lu-pinto Dec 11, 2025

make this constuctor private not just package private

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                public UInt256 mod(final UInt256 modulus) {

                  if (this.isZero() || modulus.isZero()) return ZERO;

                  return new UInt256(knuthRemainder(this.limbs, modulus.limbs), modulus.length);

                  return new UInt256(knuthRemainder(this.limbs, modulus.limbs));

Member

lu-pinto Dec 11, 2025

I see why you are using the the UInt256(int[]) constructor - basically when you don't care about the result because the UInt256 is not going to be reused...
I would still lean more towards removing that constructor as it's not easy to read unless I check in the place where it is used.

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                  int[] sum = addImpl(this.limbs, other.limbs);

                  int[] rem = knuthRemainder(sum, modulus.limbs);

                  return new UInt256(rem, modulus.length);

                  return new UInt256(rem, rem.length - modulus.limbs.length + modulus.offset);

Member

lu-pinto Dec 11, 2025

nit: not a big deal since UInt256 will be discarded but I cannot understand the formula to compute offset

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                // }

                // Comparing two int subarrays as big-endian multi-precision integers.

                private static int compareLimbs(final int[] a, final int[] b) {

Member

lu-pinto Dec 11, 2025

this can be as simple as I believe:

private static int compareLimbs(final int[] a, final int aLen, final int[] b, final int bLen) {
    if (aLen != bLen) {
      return Integer.compare(aLen, bLen);
    }
	int offset = a.length - ALen;
	int index = Arrays.mismatch(a, offset, a.length, b, offset, a.length);
	if (index == -1) {
       return 0;
	}
    return Integer.compareUnsigned(a[index], b[index]);
  }

lu-pinto reviewed

View reviewed changes

evm/src/main/java/org/hyperledger/besu/evm/UInt256.java

    
                //   // Unchecked : xLen <= x.length, xLen <= N_LIMBS

                //   int i = Arrays.mismatch(x, offset, length, ZERO.limbs, 0, N_LIMBS);

                //   return N_BITS_PER_LIMB * i + Integer.numberOfLeadingZeros(x[offset + i]);

                // }

Member

lu-pinto Dec 11, 2025

commented out code, remove?

lu-pinto requested changes

View reviewed changes

Member

lu-pinto left a comment

it's functionally OK but I would remove some dead and commented out code.
If you don't want to address performance callouts right now, I can do that in my PR later no worries at all, so feel free to go ahead an merge once it's cleaned

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet