startie

Robust, cross-language, "random" tiebreaking for STAR elections

WORK IN PROGRESS - UNSTABLE

The goal here is to provide a way to break ties in STAR elections that:

Is reproducible across multiple language implementations.
Is reasonably fast and memory-frugal (depends on the number of candidates, but not on the number of voters).
Can't be predicted or manipulated, not even by an election administrator (but see "Limitations" below).
Has no external dependencies (e.g., no looking up winning lottery numbers, or stock market statistics, or web-based "randomness beacons", ...).

The idea is to produce a "random" permutation of the candidates. When a tie occurs, pick the candidate whose name appears earliest in that permutation.

Such a facility is widely assumed to be available in election research literature. but practical implementations are conspicuous by absence, or so feeble that they're easy to trick.

Instead we sort a list of candidate names, using as sort keys crypto hashes of (candidate_name, candidate_score) pairs. Nothing can be known about "score dicts" before the election closes, and when it does close it's too late to play dirty tricks. Of course the devil is in the details.

Suffice it to say that all known problems appear to have been worked out, and results should be wholly reproducible, with remarkably straightforward code, under any language with conforming implementations of Unicode UTF-8 encoding and SHA-512 crypto hashing. We're effectively using SHA-512 as a very high-quality PRNG.

API

The primary files are permute.py and permute.js. Each supplies one function, permute(score, magic=). which takes a dict (Python. or Object in Node) mapping names (Unicode strings) to scores (ints), and returns a "randomly permuted" list of names (the score dict's keys). magic is an optional bytes object intended to fold in external "genuine entropy", such as from Python's secrets.token_bytes(8} or Node's crypto.randomBytes(8).

Python

def permute(score: dict[str, int],
            magic: bytes=b'') -> list[str]:

$ py
>>> from permute import permute
>>> permute({'A': 1, 'B': 1, 'C': 1, 'D': 1})
['B', 'D', 'C', 'A']
>>> permute({'A': 1, 'B': 1, 'C': 1, 'D': 1}, bytes([42]))
['D', 'A', 'B', 'C']

Node.js

const EMPTY_BUFFER = Buffer.alloc(0);

function permute(score, magic=EMPTY_BUFFER) {

score is an Object with string properties and int values.
magic is a Node Buffer of little ints, a subclass of Uint8Array.

$ node
Welcome to Node.js v24.11.1.
Type ".help" for more information.
> {permute} = require("./permute")
{ permute: [Function: permute] }
> permute({'A': 1, 'B': 1, 'C': 1, 'D': 1})
[ 'B', 'D', 'C', 'A' ]
> permute({'A': 1, 'B': 1, 'C': 1, 'D': 1}, Buffer.from([42]))
[ 'D', 'A', 'B', 'C' ]

All implementations

Using magic is highly encouraged. Without it, there are known insecurities, as explained in "Limitations" below. While they appear to be at worst minor in elections of non-trivial size, better safe than sorry. An 8-byte "really random" magic expands the search space for all known attacks by a factor of $$2^{64}$$

Note that permute() is intended to be called exactly once per election, after the election is closed, to prepare for possible ties in the scoring phase. The code is written for clarity & simplicity rather than speed, but it's so fast you won't notice anyway. Time and RAM use scale with the number of candidates, which is never "large". The number of ballots is irrelevant

Other

compare_driver.py constructs random score dicts and ensures that the Python and Node implementations produce the same permutations. Edit it to change the number of test cases run, the number of candidates, and the maximum candidate score. That's less work for you too than trying to remember command line conventions 😉.

chitests.py uses chi-squared tests to measure how well permute() passes out all possible permutations about equally often. This gets very expensive even for as few as 10 candidates - and substantially larger than that would run out of RAM too! This work grows with the factorial of the number of candidates.

run_node.py supplies function node_permute(), with the same signature as the Python permute(), but invokes the Node version to return the result computed by the latter. Mostly for testing.

check_readme.py verifies the doctests in this file work. Very easy to miss otherwise.

Limitations

It's not actually true that nothing can be known about "score dicts" before the election closes. The names are known from the start, and the election admin contols what they are. In a Unicode world, there are many ways to change code points in ways that leave a given name "looking much the same", or even identical, despite that the UTF-8 encodings differ. Crypto hashes do a marvelous job of emulating true randomness, and changing a single bit in one's input changes about half the bits in the output, but they're still 100% deterministic. The outcomes of all "random" ties are determined solely by the final state of the score dict.

So a determined admin could, in theory, use "poke and hope" spelling changes and try all possible score dicts on each, and pick spellings that favor (or disfavor) some candidate(s) the most across all possible ties. This quickly becomes intractable as elections become larger: if there are B ballots and C candidates, there are $$(5B + 1)^C$$ possible score dicts. For example, even for a tiny 1-winner, 2-candidate STAR election with 2 voters, there are already $$11^2 = 121$$ possible score dicts.

Of course some score dicts are more likely than others, and there are many ways to try to optimize such brute forre hackery, but as B and C get ever larger so does the difficulty of finding even slightly advantageous spellings. As best anyone knows, there is currently no computationally tractable way "to out-think" what SHA-512 does, so brute force is needed.

No 100% deterministic method can be made wholly immune to this. I would love to incorporate some actual entropy (e.g., fold in 8 bytes from Python's secrets.token_bytes(8)), and then not even the admin could influence the outcome in any effective way, short of "stuffing the ballot box" with imaginary voters under whose names they cast their own ballots. I hope to make such a change, but it depends on whether clients are willing to change their UIs to report the magical bytes picked along with the final anonymized ballots, so that their claimed outcomes can be independently reproduced. Later: and I made that change. The API now supports an optional magic argument to incorporate genuine entropy. Its use is highly encouraged, but if it's ignored the results are the same as before.

A more fundamental limitation: given a fixed set of candidate names and a fixed salt, $C$ candidates, and $B$ ballots, there are $(5B+1)^C$ possible score dicts and $C!$ possible permutations. While the number of score dicts grows quickly with $C$, the number of permutations grows more quickly still. When the number of score dicts $\lt C!$, it's impossible to generate all possible permutations. A score dict determines the permutation, and there just aren't enough possibilities.

However, this doesn't appear to be a realistic concern. Even with just 2 ballots ($B=2$), the number of possible score dicts exceeds the number of permutations until there are 28 candidates:

>>> from math import factorial
>>> 11**27 < factorial(27)
False
>>> 11**28 < factorial(28)
True

Q&A

Q: What remains to be done? Do you want help?

A: I'm happy with the design and the Python code, and testing has gone very well. But I'm not a native JavaScript speaker, and would really appreciate it if someone who is reviewed the JS code for "common sense" and idiomatic expression. Most of it was pasted from code suggested by a chatbot! It works, but is still a foreign language to me.

Q: When building the "canonical salt", the Python code returns a hashlib object, but the JS code a buffer of raw bytes. Is that an error?

A: Sure hope not 😉. This is due to that JS's crypto-hash API doesn't appear to offer a .copy() method. We're feeding the same initial raw bytes into the hash for every candidates' sort key, and using .copy() for that is the purpose of .copy(). It's not really for efficiency (although it is faster), but for conceptual clarity. The JS code has no choice but to feed those raw prefix bytes into the sort key hashes repeatedly. The outcomes in the end are identical.

Q: Unicode always causes problems. Which ones have you missed?

A: Time will tell, but none that I know of. There are several standard ways of encoding Unicode code points, but correctly implemented conversions between any pair are lossless and reversible. We use UTF-8 because every language can convert to that, and a sequence of bytes is exactly what this algorithm needs. There's generally a minor problem when computing a crypto hash from multiple fields: the input bytes of one field are catenated with the input bytes of the next, and the idea that they're different fields gets lost. But, in UTF-8, a zero byte never appears unless it's the one-character ANSI 0 byte, and candidate names will never contain one of those. So we generally alternate UTF-8 input fields with integer fields, and the latter are guaranteed (by construction) to have a 0 byte at each end. So one field can't be accidentally mistaken as part of a different field.

Other potential problems could come from "normalization", fancy schemes that actually change code points. We certainly do none of that, and I doubt any voting service would either. They're just using Unicode to display candidate names faithfully, not analyzing them or doing computation on them.

Q: What happens if two sort keys are the same?

A: Won't happen. Note that two names' UTF-8 encodings can't be the same in this context. It's an election, and if ballots displayed two names the same way, voters couldn't distinguish between them. So all names display uniquely. That implies that their UTF-8 encodings must also be pairwise distinct ("different displays" implies "different sequence of Unicode code points" implies "different encodings").

So it remains that their crypto hashes may nevertheless be the same. That's phenomenally unlikely. Collison resistance is a primary design goal of crypto hashes, and to date there is no publicly known case of two distinct inputs of any kind whose hashes collide. That's not for lack of trying.

If it happens anyway, the tiebreaking may or may not match across implementations. The code doesn't care, and will proceed to deliver whatever the implementation language's sort does about equal keys. In Python, the sort is stable, so at least the results will be reproducible across runs of the Python implementation. I'm not sure about Node.js details. but believe that's true under that too. If both environments have stable sorts, their results will match (it's sorted by UTF-8 first, which forces the same order regardless of ioput order, and later sorted by crypto hash).

But, "won't happen", so don't worry about it 😄.

Acks

Thanks to Larry Hastings, for talks about this kind of approach when he was writing his lovely starvote library. The idea here is similar, but less ambitious, simpler, and much less tied to Python quirks. Larry uses a crypto hash to seed Python's Mersenne Twister; we don't use a conventional PRNG at all. Larry goes on to build a crypto hash of all the ballots, but we only look at the score dict. That's partly for efficiency, but also looking forward to a future when STAR is the only voting method in use 😄. Then "precinct summability" becomes important - there may not be a "collection of ballots" in one place to look at, just a sum of score dicts aggregated from different precincts.
Thanks to the folks at the STAR voting service. for discussions, interest, encouragement, and contributing code to an earlier attempt at writing a cross-language "good enough" pseudo-random list shuffler. I've given up on that, as finding a "provably fair and secure" way to seed it proved to be a bottomless pit. If the STAR folks adopt this new approach, I'll pester Larry to add it as an optional way for starvote to break ties too.
And special thanks to ChatGPT-5! It knows a whole lot more about JavaScript than I know, and most of its suggestions worked on the first try. It also showed shockingly deep insights into the nuances of the problem domain, such as why the central limit theorem played a key role in why an earlier attempt produced demonstrably biased permutations.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
check_readme.py		check_readme.py
chitests.py		chitests.py
compare_driver.py		compare_driver.py
permute.js		permute.js
permute.py		permute.py
permute_server.js		permute_server.js
permute_stdinout.js		permute_stdinout.js
run_node.py		run_node.py
versions.txt		versions.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

startie

API

Python

Node.js

All implementations

Other

Limitations

Q&A

Acks

About

Uh oh!

Releases

Packages

Languages

License

tim-one/startie

Folders and files

Latest commit

History

Repository files navigation

startie

API

Python

Node.js

All implementations

Other

Limitations

Q&A

Acks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages