geldata
diff --git a/‎README.md‎
Lines changed: 1 addition & 0 deletions b/‎README.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎gel-connpool/Cargo.toml‎
Lines changed: 54 additions & 0 deletions b/‎gel-connpool/Cargo.toml‎
Lines changed: 54 additions & 0 deletions
diff --git a/‎gel-connpool/README.md‎
Lines changed: 124 additions & 0 deletions b/‎gel-connpool/README.md‎
Lines changed: 124 additions & 0 deletions
@@ -10,6 +10,7 @@ docs can currently be found on docs.rs:
 | [gel-babelfish](https://docs.rs/gel-babelfish) | [Source](./gel-babelfish) | Babelfish is a Gel socket frontend that speaks Gel, Postgres, HTTP and more. |
 | [gel-captive](https://docs.rs/gel-captive) | [Source](./gel-captive) | Run a captive Gel server for testing purposes. |
 | [gel-config](https://docs.rs/gel-config) | [Source](./gel-config) | Configuration file parser for Gel. |
+| [gel-connpool](https://docs.rs/gel-connpool) | [Source](./gel-connpool) | Load-balancing connection pool for Gel database with QoS optimization. |
 | [gel-derive](https://docs.rs/gel-derive) | [Source](./gel-derive) | Derive macros for Gel database client. |
 | [gel-db-protocol](https://docs.rs/gel-db-protocol) | [Source](./gel-db-protocol) | Low-level protocol implementation of the EdgeDB/Gel wire protocol. |
 | [gel-dsn](https://docs.rs/gel-dsn) | [Source](./gel-dsn) | Data-source name (DSN) parser for Gel and PostgreSQL databases. |
 
@@ -0,0 +1,54 @@
+[package]
+name = "gel-connpool"
+version = "0.1.0"
+license = "MIT/Apache-2.0"
+authors = ["MagicStack Inc. <hello@magic.io>"]
+edition = "2021"
+description = "Load-balancing connection pool for Gel database with QoS optimization."
+readme = "README.md"
+rust-version.workspace = true
+
+[lints]
+workspace = true
+
+[features]
+optimizer = ["genetic_algorithm", "lru", "rand", "statrs", "anyhow", "tokio/test-util"]
+
+[dependencies]
+tokio.workspace = true
+tracing.workspace = true
+
+futures = "0"
+scopeguard = "1"
+itertools = "0"
+thiserror = "2"
+strum = { version = "0.26", features = ["derive"] }
+consume_on_drop = "0"
+smart-default = "0"
+serde = { version = "1", features = ["derive"] }
+
+# For the optimizer
+genetic_algorithm = { version = "0.9.0", optional = true }
+lru = { version = "0.12.4", optional = true }
+rand = { version = "0.8.5", optional = true }
+statrs = { version = "0.17.1", optional = true }
+anyhow = { version = "1", optional = true }
+
+derive_more = { version = "2", features = ["full"] }
+
+[dev-dependencies]
+tokio = { workspace = true, features = ["test-util"] }
+
+pretty_assertions = "1.2.0"
+test-log = { version = "0", features = ["trace"] }
+anyhow = "1"
+rstest = "0"
+
+statrs = "0.17.1"
+rand = "0.8.5"
+
+[lib]
+
+[[bin]]
+name = "optimizer"
+required-features = ["optimizer"]
@@ -0,0 +1,124 @@
+# Connection Pool
+
+## Overview
+
+The load-balancing algorithm is designed to optimize the allocation and
+management of database connections in a way that maximizes Quality of Service
+(QoS). This involves minimizing the overall time spent on connecting and
+reconnecting (connection efficiency) while ensuring that latencies remain
+similar across different streams of connections (fairness).
+
+## Architecture
+
+This library is split into four major components:
+
+ 1. The low-level blocks/block, connections, and metrics code. This code
+    creates, destroys and transfers connections without understanding of
+    policies, quotas or any sort of algorithm. We ensure that the blocks and
+    metrics are reliable, and use this as a building block for our pool.
+ 2. The algorithm. This performs planning operations for acquisition, release
+    and rebalancing of the pool. The algorithm does not perform operations, but
+    rather informs that caller what it should do.
+ 3. The pool itself. This drives the blocks and the connector interface, and
+    polls the algorithm to plan next steps during acquisition, release and
+    during the timer-based planning callback.
+ 4. The Python integration code. This is behind an optional feature, and exposes
+    PyO3-based interface that allows a connection factory to be implemented in
+    Python.
+
+## Details
+
+Demand for connections is measured in terms of “database time,” which is
+calculated as the product of the number of connections and the average hold time
+of these connections. This metric provides a basis for determining how resources
+should be distributed among different database blocks to meet their needs
+effectively.
+
+To maximize QoS, the algorithm aims to minimize the time spent on managing
+connections and keep the latencies low and uniform across various connection
+streams. This involves allocation strategies that balance the immediate needs of
+different database blocks with the overall system capacity and future demand
+predictions.
+
+When a connection is acquired, the system may be in a state where the pool is
+not currently constrained by demand. In such cases, connections can be allocated
+greedily without complex balancing, as there are sufficient resources to meet
+all demands. This allows for quick connection handling without additional
+overhead.
+
+When the pool is constrained, the “stealing” algorithm aims to transfer
+connections from less utilized or idle database blocks (victims) to those
+experiencing high demand (hunger) to ensure efficient resource use and maintain
+QoS. A victim block is chosen based on its idle state, characterized by holding
+connections but having low or no immediate demand for them.
+
+Upon releasing a connection, the algorithm evaluates which backend (database
+block) needs the connection the most (the hungriest). This decision is based on
+current demand, wait times, and historical usage patterns. By reallocating
+connections to the blocks that need them most, the algorithm ensures that
+resources are utilized efficiently and effectively.
+
+Unused connection capacity is eventually reclaimed to prevent wastage. The
+algorithm includes mechanisms to identify and collect these idle connections,
+redistributing them to blocks with higher demand or returning them to the pool
+for future use. This helps maintain an optimal number of active connections,
+reducing unnecessary resource consumption.
+
+To avoid excessive thrashing, the algorithm ensures that connections are held
+for a minimum period, which is longer than the time it takes to reconnect to a
+database or a configured minimum threshold. This reduces the frequency of
+reallocation, preventing performance degradation due to constant connection
+churn and ensuring that blocks can maintain stable and predictable access to
+resource
+
+## Detailed Algorithm
+
+The algorithm is designed to 1) maximize time spent running queries in a
+database and 2) minimize latency of queries waiting for their turn to run. These
+goals may be in conflict at times. We do this by optimizing the time spent
+switching between databases, which is considered "dead time" -- as the database
+is not actively performing operations.
+
+The demand for a connection is based on estimated total sequential processing
+time. We use the average time that a connection is held, times the number of
+connections in demand as a rough idea of how much total sequential time a
+certain block demands in the future.
+
+At a regular interval, we compute two items for each block: a quota, and a
+"hunger" metric. The hunger metric may indicate that a block is "hungry"
+(wanting more connections), satisfied (having the expected number of
+connections) or overfull (holding more connections than it should). The "hungry"
+score is determined by the estimated total sequential time needed for a block.
+The "overfull" score is determined by the number of extra connections held by
+this block, in combination with how old the longest-held connection is. Quota is
+determined by the connection rate.
+
+We then use the hunger metric and quota in an attempt to rebalance the pool
+proactively to ensure that the connection capacity of each block reflects its
+most recent demand profile. Blocks are sorted into a list of hungry and overfull
+blocks, and we attempt to transfer from the most hungry to the most overfull
+until we run out of either list. We may not be able to perform the rebalance
+fully because of block activity that cannot be interrupted.
+
+If a connection is requested for a block that is hungry, it is allowed to steal
+a connection from the block that most overfull and has idle connections. As the
+"overfull" score is calculated in part by the longest-held connection's age, we
+minimize context switching.
+
+When a connection is released, we choose what happens based on its state. If
+more connections are waiting on this block, we return the connection to the
+block to be re-used immediately. If no connections are waiting but the block is
+hungry, we return it. If the block is satisfied or overfull and we have hungry
+blocks waiting, we transfer it to a hungry block that has waiters.
+
+## Error Handling
+
+The pool will attempt to provide a connection where possible, but connection
+operations may not always be reliable. The error for a connection failure will
+be routed through the acquire operation if the pool detects there are no other
+potential sources for a connection for the acquire. Sources for a connection may
+be a currently-connecting connection, a reconnecting connection, a connection
+that is actively held by someone else or a connection that is sitting idle.
+
+The pool does not currently retry, and retry logic should be included in the
+connect operation.