Update EIP-7923: give more background on pages, thrashing and general setup of the EIP

charles-cooper · web-flow · commit ae8149684095 · 2025-04-13T19:11:38.000Z
Merged by EIP-Bot.
diff --git a/EIPS/eip-7923.md b/EIPS/eip-7923.md
@@ -22,11 +22,19 @@ The EVM currently uses a quadratic pricing model for its memory. This was origin
 2. The quadratic model makes it difficult to reason about how much memory a transaction can allocate. It requires solving an optimization problem which involves computing how many message calls are available to recurse into based on the call stack limit (and, post [EIP-150](./eip-150.md), the 63/64ths rule), and then maximizing the memory used per message call.
 3. The quadratic model makes it impossible for high-level smart contracts languages to get the benefits of virtual memory. Most modern programming languages maintain what is known as the "heap" and the "call stack". The heap is used to allocate objects which live past the lifetime of their current function frame, whereas the call stack is used to allocate objects which live in the current function frames. Importantly, the call stack starts at the top of memory and grows down, while the heap starts at the bottom of memory and grows up, thus the language implementation does not need to worry about the two regions of memory interfering with each other. This is a feature which is enabled by virtual, paged memory, which has been present in operating systems since the early 90's. However, smart contract languages like Vyper and Solidity are not able to implement this, leading to inefficiencies in their memory models.
 
-This EIP proposes a linear costing model which more closely reflects the hardware of today. It uses a virtual addressing scheme so that memory pages are not allocated until they are actually accessed. Notably, the data structures used for costing memory do not need to be part of the memory implementation itself, which suggests an elegant implementation using `mmap`.
+This EIP proposes a linear costing model which more closely reflects the hardware of today, which is hierarchical ("hot" memory is much faster to access than "cold" memory), and virtually addressed (memory does not need to be allocated contiguously, but is rather served "on-demand" by the operating system).
+
+First, some preliminaries. A page is 4096 bytes on most architectures. Given a memory address, its page is simple to compute by masking out the rightmost 12 bits.
+
+There are two factors which contribute to "cold" memory (i.e. not-recently-used) being slower: CPU cache and TLB (Translation Lookaside Buffer) cache. The CPU cache is a least-recently-used memory cache, which is significantly faster than fetching all the way from RAM. The TLB is usually some hash table which maps virtual pages (used by the user) to physical pages in RAM. "Thrashing", or accessing a lot of different memory addresses, does two things: it pushes memory out of the hot cache and into cold memory, and it pushes pages out of the TLB cache.
+
+This EIP uses a virtual addressing scheme so that memory pages are not allocated until they are actually accessed. Further, it adds a surcharge for accessing memory outside of an EVM-defined "hot" area.
+
+Notably, the data structures used for costing memory do not need to be part of the memory implementation itself, which suggests an elegant implementation using the POSIX `mmap` syscall (or, its counterpart on Windows, `VirtualAlloc`).
 
 The implementation can be approached in two ways. The first way is to implement the virtual addressing "manually". This is intended for systems without `mmap` or a virtual addressing capability. The implementation needs to maintain a map from `map[page_id -> char[4096]]`, where `page_id` is an integer, computed as `memory_address >> 12`. Additionally, for costing purposes, a set of 512 `page_id`s (`set[page_id]`) is maintained. This is only used for pricing the operation, it doesn't actually contain the data.
 
-The other implementation is easier, for systems with `mmap` or a similar facility. To hold the actual data of the memory, the implementation `mmap`s a `2**32` byte region of memory. Then, memory operations can be implemented simply as reads or writes against this buffer. (With an anonymous `mmap`, the operating system will allocate pages "on demand", as they are touched). The `pages` map is still necessary, but it doesn't hold any data, it is just to track which pages have been allocated, for pricing purposes. In this implementation, there are three data structures: `memory char[2**32]`, `allocated_pages set[page_id]`, `hot_pages set[page_id]`. The `memory` data structure is only used for memory reads and writes. The `allocated_pages` and `hot_pages` are only used for gas costing.
+The other implementation is easier, for systems with `mmap` or a similar facility. To hold the actual data of the memory, the implementation `mmap`s a `2**32` byte region of memory. Then, memory operations can be implemented simply as reads or writes against this buffer. (With an anonymous `mmap`, the operating system will not allocate the entire buffer up-front, rather, it will allocate pages "on demand", as they are touched). The `pages` map is still necessary, but it doesn't hold any data, it is just to track which pages have been allocated, for pricing purposes. In this implementation, there are three data structures: `memory char[2**32]`, `allocated_pages set[page_id]`, `hot_pages set[page_id]`. The `memory` data structure is only used for memory reads and writes. The `allocated_pages` and `hot_pages` are only used for gas costing.
 
 ## Specification
 
@@ -55,7 +63,7 @@ A transaction-global memory limit is imposed. If the number of pages allocated i
 
 ## Rationale
 
-Benchmarks were performed on a 2019-era CPU, with the ability to keccak256 around 256MB/s, giving it a gas-to-ns ratio of 20 ns per 1 gas. The following benchmarks were performed:
+Benchmarks were performed on a 2019-era CPU, with the ability to `keccak256` around 256MB/s, giving it a gas-to-ns ratio of 20 ns per 1 gas (given that `keccak256` costs 6 gas per 32 bytes). The following benchmarks were performed:
 
 - Time to allocate a fresh page: 1-2us
 - Time to randomly read a byte from a 2MB range: 1.8ns
@@ -64,13 +72,16 @@ Benchmarks were performed on a 2019-era CPU, with the ability to keccak256 aroun
 - Time to update a hashmap with 512 items: 8ns
 - Time to update a hashmap with 8192 items: 9ns
 - Time to update a hashmap with 5mm items: 108ns
+- Time to execute the `mmap` syscall: 230ns
 
 These suggest the following prices:
 
 - 100 gas to allocate a page, and
 - 6 gas for a page thrash
 
-Since the delta between hitting a page and thrashing a page (including bookkeeping overhead) is ~120ns, we could ignore the resource cost and simply increase the base cost per memory operation from 3 gas to 6 gas. However, since memory operations which exploit cost-locality are so cheap, it leaves "room on the table" for future improvements to the gas schedule, including reducing the base cost of a memory operation to 1 gas. Furthermore, as the reference implementation below shows, it takes very little bookkeeping overhead (one additional data structure, and four lines of code) to check for the thrash.
+Note that the cost to execute `mmap` (~11 gas) is already well-paid for by the base cost of the CALL series of instructions (100 gas).
+
+Since the delta between hitting a page and thrashing a page (including bookkeeping overhead) is ~120ns, we could ignore the resource cost and simply increase the base cost per memory operation from 3 gas to 6 gas. However, since memory operations which exploit cost-locality are so cheap, it leaves "room on the table" for future improvements to the gas schedule, including reducing the base cost of a memory operation to 1 gas. Furthermore, as the reference implementation below shows, it takes very little bookkeeping overhead (one additional data structure, and four lines of code) to check for the thrash. Therefore, we model memory with a one-level hierarchy. While this is simpler than most real CPUs, which may have several levels of memory hierarchy, it is granular enough for our purposes.
 
 There is a desire among client implementations to be able to enforce global limits separately from the gas limit due to DoS reasons. For example, RPC providers may be designed to allow many concurrent `eth_call` computations with a much higher gas limit than on mainnet. Not implicitly tying the memory limit to the gas limit results in one less vector for misconfiguration. That is not to say that in the future, a clean formula cannot be created which allows the memory limit to scale with future hardware improvements (e.g., proportional to the sqrt of the gas limit), but to limit the scope of things that need to be reasoned about for this EIP, the hard limit is introduced.
 
@@ -89,14 +100,14 @@ Addressed in Security Considerations section. No backwards compatibility is brok
 
 ## Reference Implementation
 
-A ~50-line reference implementation is provided below. It is implemented as a patch against the `py-evm` codebase at commit ethereum/py-evm@fec63b8c4b9dad9fcb1022c48c863bdd584820c6.
+A ~60-line reference implementation is provided below. It is implemented as a patch against the `py-evm` codebase at commit ethereum/py-evm@fec63b8c4b9dad9fcb1022c48c863bdd584820c6. (This is a reference implementation, it does not, for example, contain fork choice rules).
 
 ```diff
 diff --git a/eth/vm/computation.py b/eth/vm/computation.py
-index bf34fbee..477f969e 100644
+index bf34fbee..db85aee7 100644
 --- a/eth/vm/computation.py
 +++ b/eth/vm/computation.py
-@@ -454,34 +454,37 @@ class BaseComputation(ComputationAPI, Configurable):
+@@ -454,34 +454,40 @@ class BaseComputation(ComputationAPI, Configurable):
          validate_uint256(start_position, title="Memory start position")
          validate_uint256(size, title="Memory size")
 
@@ -106,12 +117,12 @@ index bf34fbee..477f969e 100644
 -        before_cost = memory_gas_cost(before_size)
 -        after_cost = memory_gas_cost(after_size)
 -
-         if self.logger.show_debug2:
-             self.logger.debug2(
-                 f"MEMORY: size ({before_size} -> {after_size}) | "
-                 f"cost ({before_cost} -> {after_cost})"
-             )
-
+-        if self.logger.show_debug2:
+-            self.logger.debug2(
+-                f"MEMORY: size ({before_size} -> {after_size}) | "
+-                f"cost ({before_cost} -> {after_cost})"
+-            )
+-
 -        if size:
 -            if before_cost < after_cost:
 -                gas_fee = after_cost - before_cost
@@ -126,35 +137,86 @@ index bf34fbee..477f969e 100644
 -                        )
 -                    ),
 -                )
+-
+-            self._memory.extend(start_position, size)
 +        if size == 0:
 +            return
 +
 +        ALLOCATE_PAGE_COST = 100
 +        THRASH_PAGE_COST = 6
++        LOWER_BITS = 12  # bits ignored for page calculations
++        PAGE_SIZE = 4096
++        assert 2**LOWER_BITS == PAGE_SIZE   # sanity check
++        MAXIMUM_MEMORY_SIZE = 64 * 1024 * 1024
++        TRANSACTION_MAX_PAGES = MAXIMUM_MEMORY_SIZE // PAGE_SIZE
 +
 +        end = start_position + size
 +
-+        start_page = start_position >> 12
-+        end_page = end >> 12
-+
-+        gas = 0
++        start_page = start_position >> LOWER_BITS
++        end_page = end >> LOWER_BITS
 +
 +        for page in range(start_page, end_page + 1):
 +            if page not in self._memory.pages:
-+                gas += ALLOCATE_PAGE_COST
++                if self.transaction_context.num_pages >= TRANSACTION_MAX_PAGES:
++                    raise VMError("Out Of Memory")
++                self.transaction_context.num_pages += 1
 +
-+            if page not in self._memory.lru_pages:
-+                gas += THRASH_PAGE_COST
++                reason = f"Allocating page {hex(page << LOWER_BITS)}"
++                self._gas_meter.consume_gas(ALLOCATE_PAGE_COST, reason)
++                self._memory.pages[page] = True
 +
-+        for page in range(start_page, end_page + 1):
-+            self._memory.lru_pages[page] = True
-
--            self._memory.extend(start_position, size)
-+        reason = f"Expanding memory {before_size} -> {after_size}"
-+        self._gas_meter.consume_gas(gas, reason)
++            if page not in self._memory.lru_pages:
++                reason = f"Page {hex(page << LOWER_BITS)} not in LRU pages"
++                self._gas_meter.consume_gas(THRASH_PAGE_COST, reason)
++                # insert into the lru_pages data structure.
++                # it's important to do it here rather than after
++                # the loop, since this could evict a page we haven't
++                # visited yet, increasing the cost.
++                self._memory.lru_pages[page] = True
 
      def memory_write(self, start_position: int, size: int, value: bytes) -> None:
          return self._memory.write(start_position, size, value)
+diff --git a/eth/vm/forks/frontier/computation.py b/eth/vm/forks/frontier/computation.py
+index 51666ae0..443f82b5 100644
+--- a/eth/vm/forks/frontier/computation.py
++++ b/eth/vm/forks/frontier/computation.py
+@@ -29,6 +29,7 @@ from eth.exceptions import (
+     InsufficientFunds,
+     OutOfGas,
+     StackDepthLimit,
++    VMError,
+ )
+ from eth.vm.computation import (
+     BaseComputation,
+@@ -87,12 +88,21 @@ class FrontierComputation(BaseComputation):
+
+         state.touch_account(message.storage_address)
+
+-        computation = cls.apply_computation(
+-            state,
+-            message,
+-            transaction_context,
+-            parent_computation=parent_computation,
+-        )
++        # implement transaction-global memory limit
++        num_pages_anchor = transaction_context.num_pages
++        try:
++            computation = cls.apply_computation(
++                state,
++                message,
++                transaction_context,
++                parent_computation=parent_computation,
++            )
++        finally:
++            # "deallocate" all the pages allocated in the child computation
++
++            # sanity check an invariant:
++            allocated_pages = len(computation._memory.pages)
++            assert transaction_context.num_pages == num_pages_anchor + allocated pages
++            transaction_context.num_pages = num_pages_anchor
+
+         if computation.is_error:
+             state.revert(snapshot)
 diff --git a/eth/vm/logic/memory.py b/eth/vm/logic/memory.py
 index 806dbd8b..247b3c74 100644
 --- a/eth/vm/logic/memory.py
@@ -169,7 +231,7 @@ index 806dbd8b..247b3c74 100644
 
  def mcopy(computation: ComputationAPI) -> None:
 diff --git a/eth/vm/memory.py b/eth/vm/memory.py
-index 2ccfd090..5950a4d4 100644
+index 2ccfd090..9002b559 100644
 --- a/eth/vm/memory.py
 +++ b/eth/vm/memory.py
 @@ -1,8 +1,11 @@
@@ -180,7 +242,7 @@ index 2ccfd090..5950a4d4 100644
  from eth._utils.numeric import (
      ceil32,
  )
-+from eth.exceptions import PyEVMError
++from eth.exceptions import VMError
  from eth.abc import (
      MemoryAPI,
  )
@@ -231,7 +293,7 @@ index 2ccfd090..5950a4d4 100644
 -    def __len__(self) -> int:
 -        return len(self._bytes)
 +        if start_position + size >= 2**32:
-+            raise PyEVMError("Non 32-bit address")
++            raise VMError("Non 32-bit address")
 
 -    def write(self, start_position: int, size: int, value: bytes) -> None:
 -        if size:
@@ -268,6 +330,19 @@ index 2ccfd090..5950a4d4 100644
 -        buf = memoryview(self._bytes)
 +        buf = memoryview(self.memview)
          buf[destination : destination + length] = buf[source : source + length]
+diff --git a/eth/vm/transaction_context.py b/eth/vm/transaction_context.py
+index 79b570e9..5943f897 100644
+--- a/eth/vm/transaction_context.py
++++ b/eth/vm/transaction_context.py
+@@ -36,6 +36,9 @@ class BaseTransactionContext(TransactionContextAPI):
+         # post-cancun
+         self._blob_versioned_hashes = blob_versioned_hashes or []
+
++        # eip-7923
++        self.num_pages = 0
++
+     def get_next_log_counter(self) -> int:
+         return next(self._log_counter)
 ```
 
 ## Security Considerations