Skip to content

Commit c5d7804

Browse files
authored
Merge pull request #14 from esl/design_notes
Add design notes to the documentation
2 parents 74cdf8e + 891696c commit c5d7804

File tree

3 files changed

+31
-3
lines changed

3 files changed

+31
-3
lines changed

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ Here instead, the cache uses a –bunch of– ets table with `read_concurrency`,
2929
All operations hove been carefully written following the latest OTP [efficiency guide](https://erlang.org/doc/efficiency_guide/users_guide.html), including maps operations that improve sharing, avoiding unnecessary copying from ETS tables, inline list functions, use `atomics` and `persistent_term` as in [this guide](https://blog.erlang.org/persistent_term/).
3030

3131
### Instrumentation
32-
These days, all modern services must be instrumented. This cache library helps follow the RED method –Rate, Errors, Duration–: that is, lookup operations raise telemetry events with name `[segmented_cache, request]` with information whether there was a hit or not (`hit := boolean()`), and the time the lookup took, in microseconds (`time := integer()`). With this, we can aggregate the total Rate, and extract the proportion of cache misses, or Errors, while knowing the Duration of the lookups.
32+
These days, all modern services must be instrumented. This cache library helps follow the RED method –Rate, Errors, Duration–: that is, lookup operations raise telemetry events with name `[segmented_cache, Name, request]` with information whether there was a hit or not (`hit := boolean()`), and the time the lookup took, in microseconds (`time := integer()`). With this, we can aggregate the total Rate, and extract the proportion of cache misses, or Errors, while knowing the Duration of the lookups. See the documentation for details.
3333

3434
## Configuration
3535

design_notes.md

+24
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
Here are the key points of the design:
2+
3+
## Cache state
4+
- The cache records are stored in multiple ETS tables called _segments_, stored in a tuple to improve access times.
5+
- There's one _current_ table pointed at by an index, stored in an [`atomic`](https://erlang.org/doc/man/atomics.html).
6+
- The `persistent_term` module is used to store the cache state, which contains the tuple of segments and the atomic reference. It's initialised just once, and it never changes after that.
7+
- The atomic value is changing, but most importantly: changing in a lock-free manner.
8+
- Writes are always done at the _current_ table.
9+
- Reads iterate through all of them in order, starting from the _current_ one.
10+
11+
## TTL implementation
12+
- Tables are rotated periodically and data from the last table is dropped.
13+
- `ttl` is not 100% accurate, as a record can be inserted during rotation and therefore live one segment less that expected: we can treat the `ttl` as a warranty that a record will live less than `ttl` but at least `ttl - ttl/N` where `N` is the number of segments (ETS tables).
14+
- There's a `gen_server` process responsible for the table rotation (see [Cache state notes](#cache-state-notes) for more details).
15+
16+
## LRU implementation
17+
- On `segmented_cache:is_member/2` and `segmented_cache:get_entry/2` calls, the record is reinserted in the _current_ ETS table.
18+
19+
## Distribution
20+
- In a distributed environment, cache is populated independently on every node.
21+
- However, we must ensure that on deletion the cache record is removed on all the nodes (and from all the ETS tables, see [LRU implementation notes](#lru-implementation-notes))
22+
- There's a `gen_server` process responsible for ETS tables rotation (see [TTL implementation notes](#ttl-implementation-notes))
23+
- The same process is reused to implement asynchronous cache record deletion on other nodes in the cluster.
24+
- In order to simplify discovery of these `gen_server` processes on other nodes, they all are added into a dedicated `pg` group.

rebar.config

+6-2
Original file line numberDiff line numberDiff line change
@@ -37,8 +37,12 @@
3737
{hex, [
3838
{doc, #{provider => ex_doc}}
3939
]}.
40+
4041
{ex_doc, [
4142
{source_url, <<"https://github.com/esl/segmented_cache">>},
42-
{extras, [<<"README.md">>, <<"LICENSE">>]},
43-
{main, <<"readme">>}
43+
{main, <<"readme">>},
44+
{extras, [{'README.md', #{title => <<"Overview">>}},
45+
{'design_notes.md', #{title => <<"Design Notes">>}},
46+
{'LICENSE', #{title => <<"License">>}}
47+
]}
4448
]}.

0 commit comments

Comments
 (0)