InfiniteStream

A Docker-based HLS/DASH test server for video players. Generates LL-HLS and LL-DASH streams (plus 2s and 6s segment variants) from short VOD content on a shared clock, and lets you inject deterministic, streaming-aware failures — HTTP errors, hung responses, corrupted segments, transport drops, bandwidth limits — on a per-session basis so player bugs become reproducible.

Built for player QA, SDK development, and side-by-side comparison across HLS.js, Shaka, Video.js, native, iOS/tvOS, Android, and Roku. Or for anyone interested in how ABR interacts with content, networking, and players. Not a production streaming origin.

Everything is driven by a REST API (the dashboard is a thin client over it), so every fault, every shaping change, every session control available in the UI is also available to test scripts and CI.

Why you might want this

Player bugs are usually environmental — a network blip, a truncated segment, a slow manifest, a discontinuity the player didn't handle. Reproducing them is the hard part of player QA. Existing options each fall short:

Public test streams (Apple samples, DASH-IF vectors, Bitmovin demos) aren't deterministic, don't loop cleanly, and can't fail on command.
Production origins (Wowza, AWS Elemental, Unified Streaming) are built to serve real viewers, not to misbehave on purpose.
DIY stacks (ffmpeg + nginx-vod or nginx-rtmp) take days to wire up and don't give you LL-HLS and LL-DASH from the same clock, let alone a fault-injection UI.
Generic chaos tools (toxiproxy, tc alone, Chaos Mesh) are protocol-agnostic — they don't understand segments, playlists, or manifests, so the failures they create aren't meaningful at the streaming layer.
Man-in-the-middle proxies (Charles, mitmproxy) work per-request and per-operator; they don't script repeatable schedules across runs or isolate concurrent testers.

What makes InfiniteStream different:

Deterministic looping live. The sliding window moves on a stable clock and wraps on loop boundaries. The same run produces the same timing.
All variants from one worker. LL-HLS, LL-DASH, 2s, and 6s segment variants are generated from a single per-content worker on a shared clock, so cross-protocol comparisons are apples-to-apples.
Streaming-aware fault injection. Inject HTTP errors, hangs, and payload corruption at the segment, playlist, or manifest layer — not just generic TCP faults. The All override applies one rule to every HTTP request kind in a single click.
Master-playlist manipulation. Rewrite the HLS master on the fly — strip CODEC / AVERAGE-BANDWIDTH, overstate BANDWIDTH, allowlist specific rungs, change the live hold-back offset. Explore how ladder shape and optional HLS attributes affect startup, ABR, and live-edge latency without re-encoding.
Transport faults too. Port-level DROP/REJECT via nftables, composable with HTTP-layer faults.
Interactive rate shaping with deterministic patterns. Drag throughput, latency, loss, and pattern-mode sliders and watch the player react in real time. Or build a step-based throughput pattern with a fixed step duration so the same curve replays bit-for-bit on every run — what you saw rebuffer at 09:14 yesterday rebuffers again today.
Per-session isolation. Each browser session binds to a dedicated proxy port via player_id, so concurrent testers don't collide.
Streaming-aware HAR. The dashboard's network-log waterfall annotates each request with what actually happened on the wire. A row that looks like a 200 but ended badly is tagged with !✂ (proxy-injected body cut), !⏱ (server transfer-timeout), or !↩ (client gave up), so the chart tells the truth even when the status code can't.
One-tap incident capture. A "911" button on every player (iOS / iPadOS / tvOS / Android TV) snapshots the current session's HAR — a 10-minute window of every request, fault, and play, written to the Incidents browser for replay later. No "what did you click before it broke?" debugging.
Session grouping for differential testing. Link two or more independent sessions so fault injection and network shaping apply to all of them simultaneously, while everything else (player engine, codec, live offset, platform, ladder constraints) stays independent per session. One bandwidth collapse, two live_offset values, instant apples-to-apples comparison of which setting rebuffers. Or iOS vs Android reacting to the exact same throughput curve. See Session grouping.
Side-by-side comparison UI. Mosaic view for watching multiple players or encodings against the same source simultaneously. Quartet (alpha) extends this to four-panel layouts.
Accessible to non-programmers. The whole surface is a web UI — click a fault type, drag a throttle slider, flip a Content-tab toggle, watch the bitrate / buffer / FPS charts react in real time. No Python addons, no YAML, no CLI. A QA analyst, a producer, or a support engineer can run real experiments and see cause-and-effect without writing any code.
Everything is a REST API. No UI-only controls — anything a tester can do, a CI job can do.

Use cases:

Regression-test a player release against a known fault schedule before shipping.
Reproduce a customer-reported stall by replaying the exact fault sequence.
Compare HLS.js vs Shaka vs Video.js vs native on identical content under identical faults.
Differential testing via session grouping. Link N independent sessions so one fault or throughput change hits all of them at the same moment — while each session keeps its own player, platform, codec, ladder, or live-offset. See Session grouping.
Validate loop-boundary and discontinuity handling.
Characterize a player's ABR algorithm under controlled bandwidth steps.
Pre-validate ladder and HLS parameter decisions ("what if we dropped the 4K rung?", "what if we removed AVERAGE-BANDWIDTH?") without re-encoding.
Smoke-test a new encoding ladder before promoting to staging.

Comparison to alternatives:

Tool / approach	Good at	Why InfiniteStream instead
Public test streams (Apple, DASH-IF, Bitmovin demos)	Quick playback sanity checks	Not deterministic; no failure injection; no looping on your schedule
Production origins (Wowza, AWS Elemental, Unified Streaming)	Serving real viewers	Heavy, costly; not built for faults or side-by-side QA
FFmpeg + nginx-rtmp / nginx-vod (DIY)	Full control of the stack	Days of setup; no shared-clock LL-HLS + LL-DASH; no fault UI
Shaka Streamer	Packaging pipelines	Not a live test server; no looping, no faults, no dashboard
toxiproxy, `tc`, Chaos Mesh	Generic network faults	Protocol-agnostic — no awareness of segments, partials, playlists
Charles, mitmproxy	Per-request rewriting	Manual, per-operator; not scripted or repeatable across runs
mediamtx, SRS, OvenMediaEngine	Live ingest and serving	Not looping-VOD focused; not QA-focused; no fault injection

When not to use it:

You need a production streaming origin.
You need full standards conformance on every LL-HLS / LL-DASH edge case (see PRD.md for the list of known limitations).
You need DRM.
You need scale beyond a handful of concurrent test sessions.

Quick start

Prerequisites

Docker (and Docker Compose).
A media directory on the host (CONTENT_DIR) for source files and encoded output. It will be mounted into the container as /media.
TLS certificates in $CONTENT_DIR/certs/. Self-signed certs are auto-generated on first startup if none exist. To provide your own, drop localhost.pem and localhost-key.pem into $CONTENT_DIR/certs/ before starting.

Start it

git clone https://github.com/jonathaneoliver/infinite-streaming.git
cd infinite-streaming

cp .env.example .env       # edit CONTENT_DIR
docker compose up -d       # first run builds the image

Open http://localhost:30000/ in a browser. That's the dashboard — everything else happens there.

Other deployment options (pre-built images, single container, k3s) are in Advanced deployment at the bottom.

Your first session (web player walkthrough)

All of the testing features are accessible through the built-in web dashboard. A common first pass:

Load some content. Open the dashboard → Upload Content, pick a video and encoding options, and submit. Or copy a file straight into $CONTENT_DIR/originals/ and hit refresh on the Source Library page.
Watch it play. Open Playback, pick a protocol (HLS / DASH), segment duration (LL / 2s / 6s), codec, and player engine (HLS.js / Shaka / Video.js / Native / Auto). The player starts immediately.
Open a Mosaic comparison. Mosaic (Grid) shows multiple tiles filtered by protocol / codec / segment, all playing the same source, all in sync. Useful for spotting per-player differences at a glance.
Open a Testing Session. Right-click any tile in Mosaic → Open in Testing Window. The testing page binds your browser session to a dedicated proxy port, so failures and shaping you configure here only affect your session.
Break things on purpose. Inside the Testing Session card:
- Fault Injection → Segment / Manifest / Master tabs — inject 404s, timeouts, hangs, corrupted payloads on a repeatable schedule.
- Fault Injection → Transport tab — drop or RST the port via nftables.
- Fault Injection → Content tab — rewrite the HLS master (strip CODECs, hide rungs, overstate bandwidth, change live offset).
- Network Shaping — delay / loss / throughput sliders or scripted patterns (square wave, ramp up/down, pyramid).
Watch the reaction. The Bitrate Chart stacks three time-series charts (bitrate, buffer depth, FPS) on a shared 10-minute timeline. A bandwidth dip, buffer drop, and FPS stutter all line up visually, so root-causing is a matter of eyeballing the three charts together.
Group two sessions for a side-by-side. Open a second Testing Session for the same source but with a different player engine, live offset, or device — then link the two as a group. From then on, any fault or throughput change applies to both simultaneously, while each session keeps its own player config. See Session grouping below for the canonical use cases.

That walkthrough exercises the majority of the system. The sections below describe each part in depth.

Dashboard pages

Playback — single-stream view with protocol, codec, segment, and player selection. Auto-plays on change.
Mosaic (Grid) — multi-tile view with filters (protocol / codec / segment). Right-click a tile to open a Testing Session.
Quartet (alpha) — four-panel side-by-side comparison across encodings or players.
Live Offset (alpha) — compares live offset, buffer depth, and seekable ranges across variants.
Testing Session — per-session failure injection, traffic shaping, content manipulation, metrics charts. See below.
Incidents — browse recorded HAR snapshots from every session. Click any incident to render its waterfall inline; bulk-delete cleanup; reason filters distinguish auto-captures (stall / segment-stall / 911) from manual Save HAR clicks.
Go-Monitor — active workers, request counts, last-request time, idle timeout, tick timings.
Upload Content — web upload + encoding job tracking.
Source Library — list of $CONTENT_DIR/originals/. Click to kick re-encodes.

Selected content and URL persist across pages in localStorage (ismSelected*).

Testing session in depth

Open directly if you don't want to come from Mosaic:

http://localhost:30000/dashboard/testing-session.html?player_id=<uuid>&url=<encoded-stream-url>

The player_id is required. go-proxy uses it to allocate a session-specific port (30181..30881) so that failure injection and shaping stay scoped to your session.

Controls (top of the card)

Retry Fetch — re-issue the current stream request without resetting the player.
Restart Playback — destroy and rebuild the player, then reload the URL.
Reload Page — full page reload with current query params.
Player selector — HLS.js / Shaka / Video.js / Native / Auto.

Failure injection (per request kind)

The Fault Injection card has separate tabs for All, Segment, Manifest, Master, Transport, and Content.

All (default-selected, override) — one rule applied to every HTTP request kind. Same controls as the Segment tab; when its Failure Type is anything other than none, the per-kind tabs disable with an "All override active" banner. Use this when you want a blunt-instrument blast across segments, media manifests, and master at once.
Segment / Manifest / Master — per-kind rules with independent config. Same control shape:
- Failure Type (must be non-none to activate).
- Units: Requests / Seconds / Failures-per-Second.
- Consecutive: how wide the fault window is.
- Frequency: spacing between fault windows (full cycle length, not the gap after recovery).
- Scope: which variants the rule applies to (defaults to All).

Changes auto-save. The full fault matrix (status codes, socket misbehavior, corruption) is in docs/FAULT_INJECTION.md.

Transport faults (per session port)

Fault Type: None / Drop / Reject (via nftables).
Units: Seconds or Packets / Seconds.
Consecutive: duration (seconds mode) or packet threshold (packets mode).
Frequency (secs): cycle spacing. 0 means one-shot.
Counters: UI shows current/last Drop pkts and Reject pkts.

Linux-only (macOS Docker Desktop can't do this).

Network Shaping (per session)

Delay / Loss / Throughput sliders — steady-state shaping (0–250 ms, 0–10%, 0–50 Mbps). Throughput is disabled when a pattern is active.
Pattern mode: sliders (static), square_wave, ramp_up, ramp_down, pyramid. Non-sliders modes drive throughput through a scripted sequence of steps.
Step duration: 6s / 12s / 18s / 24s — how long each pattern step holds.
Margin: Exact / +10% / +25% / +50% — headroom added on top of each ladder bitrate when picking preset rates.
Throughput presets are generated from the current manifest's variants (video + audio, deduped). Effective rate: shaping_mbps = variant_mbps × (1 + margin_pct/100) + overhead_mbps where overhead_mbps is computed from the audio playlist bandwidth plus a fixed 0.05 Mbps playlist allowance. Presets below the lowest video variant (stall-risk threshold) are flagged.

Content manipulation (HLS master rewriting)

The Content tab rewrites the HLS master playlist on the fly — no re-encoding. A fast feedback loop for testing ladder decisions and optional HLS attribute choices.

Control	Effect	Typical impact
Strip CODEC	Remove `CODECS=` from `EXT-X-STREAM-INF`	Players can't do "chunkless prepare" — longer startup, extra probing fetches
Strip AVERAGE-BANDWIDTH	Remove `AVERAGE-BANDWIDTH=`	Players that weight average over peak may pick a different initial rung
Overstate Bandwidth	Inflate `BANDWIDTH` by 10%	Simulates over-conservative encoder declaration; players may sit on lower rungs
Allowed variants	Allowlist specific rungs — others removed	Sparse / top-heavy / single-rung ladder experiments
Live offset	`None` / `6s` / `18s` / `24s` hold-back hints	Trade live-edge latency vs rebuffer risk on jitter

HLS only; DASH is a placeholder. Two-phase: play once to populate the variant list, then replay with the same player_id. Full reference in docs/FAULT_INJECTION.md.

Server Timeouts (transfer active + idle)

ATS-style transfer timeouts the proxy enforces against the client — useful for testing player behavior when the server is slow or stalls mid-body. Two independent values, each gated by a per-kind Apply To checkbox:

Active timeout (s) — total wall-clock budget from request received → response complete. Fires whether bytes are flowing or not. Trips pre-headers as 504, mid-body as a transfer cut.
Idle timeout (s) — gap timer reset on every successful write to the client. Fires when the proxy → player flow goes silent for the configured window — i.e., the player stopped draining bytes.
Apply To: Segments / Media manifests / Master manifest. Off for a kind = no timeout for that kind. Most testing leaves segments-only checked since manifests / master are too small to meaningfully time out.

When a timeout fires the network-log waterfall renders the row with !⏱ so you can tell it apart from a fault-injection cut (!✂) or a player abort (!↩).

Bitrate chart (with buffer depth and FPS)

The session card has a collapsible Bitrate Chart that stacks an events timeline + up to three time-series charts, all sharing a 10-minute rolling window and unified zoom/pan. Legend entries toggle series, scroll zooms, drag pans, ⏸ pauses live updates. The four panels share an x-axis so a bandwidth dip lines up visually with its buffer / FPS / variant-shift impact.

Events timeline (top) — swim-lane visualization of what the player and server are doing right now:
- PLAYER variants — one lane per ladder rung (e.g. 1920×1080:7.1Mbps, 1280×720:3.5Mbps). Coloured blocks show which variant the player was on at each moment. The variant shift on a throughput collapse is visible as a downstep across lanes.
- DISPLAY RES — the actual rendered resolution per moment (green = matches the highest available, red = downstepped).
- PLAYERSTATE — playing / paused / stalled lane.
- PLAYBACK + IMPAIRMENT — markers for restarts, stalls, error events, fault windows.
- SERVER LOOP — content-loop boundaries from go-live, so wraparound effects line up with player behaviour.
Bitrate chart — up to ten series:
- Server metrics: mbps_shaper_rate (100 ms), mbps_shaper_avg (6 s rolling), mbps_transfer_rate (250 ms, byte-gated), mbps_transfer_complete (per segment). See the Metrics reference below.
- Player metrics: Player avg_network_bitrate (averaged ABR bandwidth estimate — iOS observedBitrate, Android DefaultBandwidthMeter), Player network_bitrate (short-window instantaneous throughput — iOS only, from LocalHTTPProxy wire-byte accounting), Rendition (bitrate of the current playing variant).
- Reference lines: Limit (shaping ceiling, stepped when a pattern is active), Server Rendition (what the server believes it delivered), one line per ladder Variant (hidden by default).
- Events: STALL and RESTART markers annotate player stalls and restarts.
- Y-axis: Auto or fixed 5 / 10 / 20 / 30 / 40 / 50 / 100 Mbps — pin the scale when comparing two sessions side by side.
Buffer depth chart — player buffered TimeRanges (player_metrics_buffer_depth_s) on the left axis; Wall-Clock Offset (player playhead vs encoder PDT) on the right axis.
FPS chart — rendered and dropped frames/s from player_metrics_frames_displayed / _dropped_frames, 2 s sliding window, exponential smoothing (α = 0.15). Series: FPS (smoothed), Low FPS (red below threshold), FPS Baseline (75th percentile), Low Threshold (0.75 × baseline), Dropped Frames/s (right Y-axis).

The four panels' plot areas all align on the same right edge so vertical x-axis ticks line up across every chart and the events timeline above.

Network log waterfall (HAR view)

Every request the proxy serves to the player is captured as a HAR-format network log entry. The dashboard renders them in a Chrome DevTools-style waterfall with a brushable overview pane: the brush at the top shows the full session at small scale; drag it to pick a time window, and the bars in the main view position themselves within that window.

Glyph	What it means	Status code on the row
`!`	HTTP fault (404 / 500 / 504 / etc.) — the wire response was the error	matches the fault (`404`, `500`, `504`, `429`, …)
`!✂`	Socket fault inject — proxy deliberately tore down the connection (`request_body_` / `request_first_byte_` / `request_connect_*`)	`200` (chunked headers went out) or `—` (connect-time abort)
`!⏱`	Server transfer timeout — proxy enforced active or idle timeout	upstream's status (typ. `200`) for mid-body, `504` if pre-headers
`!↩`	Client gave up — player aborted mid-transfer (broken pipe / `ECONNRESET`)	upstream's status (typ. `200`)
`↻`	Quick retry — same URL re-fetched faster than half the segment duration	—

A row that looks like a successful 200 download but has !↩ or !⏱ next to its method tells you the player abandoned it or the proxy gave up — exactly the signal you'd otherwise miss.

Capture controls:

Save HAR — manual snapshot. Writes a HAR file to the Incidents browser with reason=manual.
911 button — every player (iOS / iPadOS / tvOS / Android TV) has a "911" button right of Reload. One tap fires a user_marked event the server picks up, which triggers an automatic HAR snapshot of the current session timeline (10-minute clip, includes all plays). Files in the Incidents browser tagged reason="user 911". Cross-layer "911" log lines on Apple device console, adb logcat, and docker logs make it grep-friendly across all three layers.
Auto-snapshot — go-proxy detects extended player stalls / segment-stalls and triggers a HAR snapshot automatically with reason=stall / reason=segment_stall.

The Incidents page on the dashboard browses every saved HAR; click an incident to render its waterfall inline without leaving the page.

Session grouping (differential testing)

Link two or more independent Testing Sessions into a group. Faults and network shaping applied to any group member are applied to all members simultaneously — while everything else stays independent per session. This is the single highest-leverage feature for differential testing: change exactly one variable and watch several targets react to an identical stimulus.

What propagates vs what stays independent

Propagates to all group members	Stays per-session
HTTP fault configuration (segment / manifest / master — type, mode, consecutive, frequency, URL allowlist)	Player engine (HLS.js / Shaka / Video.js / Native)
Transport faults (DROP / REJECT, timing)	Stream URL (protocol / codec / segment duration)
Network shaping — delay, loss, throughput, pattern mode, step duration, margin	Content-tab manipulations (strip CODEC, allowed variants, live offset, etc.)
Shaping patterns (square wave, ramps, pyramid) and their step schedules	Player selector and URL parameters

Max 10 sessions per group. Endpoints: POST /api/session-group/link, POST /api/session-group/unlink, GET /api/session-group/{groupId} — see docs/FAULT_INJECTION.md.

Canonical use cases

Same stream, two live offsets. Session A at live_offset=6s, session B at live_offset=24s. Apply a 3-second throughput collapse to the group. A rebuffers, B absorbs it — now you know exactly how much headroom the larger offset is buying you.
iOS vs Android under identical conditions. Open one testing window driving an iOS simulator, another driving the Android player, group them, and drive a ramp_down pattern. The Bitrate / Buffer / FPS charts side by side show how each platform's ABR algorithm responds to the same wire-rate curve, with zero variance from network timing.
HLS.js vs Shaka on the same packet-loss schedule. One click, two engines, identical 2% loss + 80 ms delay — does one rebuffer and the other ride it out?
H.264 vs HEVC under identical throttle. Same variants on the ladder, different codec, grouped session — how does each codec's bitrate-for-quality affect rebuffer frequency when throughput drops below the 720p rung?
Sparse ladder vs full ladder. Drop rungs with the Content tab's Allowed variants allowlist and compare. This matters most when the gaps are in the low or middle of the ladder — a player with only 360p and 4K and no steps in between has nowhere to land when throughput drops through that gap. On a choppy network the ABR logic will overshoot or undershoot, and the player can stall or thrash between the two available rungs instead of glide-downshifting. Group a full-ladder session against a gap-ladder session, drive ramp_down with oscillation, and watch the gap-ladder session rebuffer while the full-ladder one rides it out. Critical for validating that your production ladder actually survives poorly-behaving networks, not just smooth downshifts.
Same player, different Content-tab settings. Identical engine, identical stream, grouped — but A has Strip CODEC on and B doesn't. Watch startup time diverge under the same network conditions.

The pattern is always: change one variable per session, apply the same fault or shaping change to all of them, compare charts side by side. The Bitrate / Buffer / FPS charts share a timeline across sessions, so you see effect alignment by eye.

Third-party players

The Testing Session flow isn't limited to the built-in browser players. Any HTTP client — a native iOS/tvOS/Android app, a Roku channel, a set-top-box player, a CI harness — can be the video engine inside a session, get the same faults and shaping applied to it, and optionally stream metrics back into the dashboard's charts.

The easy way: grab a session URL from Mosaic

Inside the dashboard, right-click any tile in Mosaic (Grid) → Open in Testing Window. That builds the session URL:

http://<host>:30000/dashboard/testing-session.html?url=<stream-url>&player_id=<uuid>&nav=1

For 3rd-party integrations you want the player_id and url values — those are the only two pieces of state a session needs. Paste the stream URL into your player and use the player_id on everything described below.

Programmatic integration pattern

A native app integrates in four steps, all plain HTTP:

List content. GET /api/content returns the encoded library — content names, protocol flags (has_hls / has_dash), codec metadata. Stream URLs are not embedded in the response; you build them from the content name.
Generate a player_id. Any stable string unique to your session — a UUID, "roku_<timestamp>", "android-rig-03". Reuse the same id across the session's lifetime.
Build the stream URL and play it. Append ?player_id=<id> to the manifest path:
```
http://<host>:30081/go-live/<content>/master.m3u8?player_id=<id>       # HLS LL
http://<host>:30081/go-live/<content>/master_2s.m3u8?player_id=<id>    # HLS 2s
http://<host>:30081/go-live/<content>/master_6s.m3u8?player_id=<id>    # HLS 6s
http://<host>:30081/go-live/<content>/manifest.mpd?player_id=<id>      # LL-DASH
```
The proxy allocates a dedicated session port on first request and responds with 302 to the allocated port (e.g. 30081 → 30281). Follow the redirect — all subsequent segment and playlist requests must stay on that port so faults and shaping apply. Every HLS.js/Shaka/AVPlayer/ExoPlayer/Roku player handles 302 redirects natively; this is zero work for the client.
Optional: subscribe to session updates. Open a Server-Sent Events stream at GET /api/sessions/stream to receive live updates when the dashboard operator changes faults, shaping, or group membership affecting your session. The iOS app uses this to mirror the operator's actions into its own testing UI.

That's it. The app is now a first-class Testing Session — any operator who opens the dashboard sees the session appear in the session list and can inject faults / shape bandwidth / group it alongside other sessions.

Reporting metrics back (optional but recommended)

To make your 3rd-party player show up on the Bitrate / Buffer / FPS charts alongside the built-in web players, POST /api/session/{player_id}/metrics periodically with a set payload:

{
  "set": {
    "player_metrics_video_bitrate_mbps": 4.2,
    "player_metrics_avg_network_bitrate_mbps": 5.1,
    "player_metrics_network_bitrate_mbps": 4.9,
    "player_metrics_buffer_depth_s": 18.4,
    "player_metrics_frames_displayed": 12345,
    "player_metrics_dropped_frames": 7,
    "player_metrics_last_event": "playing",
    "player_metrics_loop_count_player": 2,
    "player_metrics_profile_shift_count": 4
  }
}

Post cadence: 1–2 Hz is fine. The endpoint doesn't bump the session's control revision (it's observational data), so you won't fight with the operator's configuration changes. Report what your platform can measure — any subset is fine.

The full field catalogue and payload reference is in docs/API.md.

What the bundled clients do

The iOS, tvOS, Android, and Roku apps in this repo are reference implementations of the pattern above:

App	Content list	URL build	Session redirect	SSE	Metrics
`apple/InfiniteStreamPlayer/` — iOS + tvOS	yes (`/api/content`)	`?player_id=UUID`	auto-follow	yes	yes
`android/InfiniteStreamPlayer/`	yes (`/api/content`)	`?player_id=UUID`	auto-follow (ExoPlayer)	—	—
`roku/InfiniteStreamPlayer/`	yes (`/api/content`)	`?player_id=roku_<ts>`	auto-follow	—	—

They're all intended as very simple video consumption devices, each with slightly different ABR characteristics and error-recovery implementations. The web players (HLS.js / Shaka / Video.js) are the simplest to use and very handy to confirm everything is wired up — but they're also the least reliable for stress testing. Most of the validation of the server's content generation is done against iOS and tvOS.

Point at any of them as a starting template for a new platform.

How it works

Services (all run inside one Docker container)

Service	Port	Role
`go-live`	8010	LL-HLS + LL-DASH generator (2s / 6s / LL segment variants)
`go-upload`	8003	Upload API, encoding job orchestration, content discovery
`go-proxy`	30081 (+ per-session ports)	Failure injection, traffic shaping, SSE session stream
`nginx`	30000	Routing, static dashboard
`memcached`	11211	Session state (internal)

Live stream generation

On-demand: the first request for a piece of content starts a per-content worker.
Single worker, shared clock: each worker generates all HLS + DASH manifests (LL + 2s + 6s) in sync.
Low-latency: LL-HLS and LL-DASH update on partial boundaries (default 200 ms).
Segment variants: 2s and 6s update on their segment boundaries only.
Sliding window: fixed (e.g. 36 s) that moves forward and wraps on loop boundaries.
Auto shutdown: workers stop after an idle timeout when no requests are active.

Host filesystem & content

Host-mounted volume at /media inside the container:

/media/originals/ — source files (MP4, MOV, etc.)
/media/dynamic_content/{content}/ — encoded outputs
/media/certs/ — TLS certs (auto-generated if missing)

Three ways to add content:

Upload via the dashboard. Open Upload Content, pick a file and encoding options. The server writes the source to /media/originals/ and the encoded ladder to /media/dynamic_content/.
Drop a source file in and encode from the UI. Copy into $CONTENT_DIR/originals/, refresh Source Library, and trigger an encode from the UI.
Drop pre-encoded ladders in directly. If you've already run the pipeline offline (locally or on a build machine), copy the whole {content}/ directory into $CONTENT_DIR/dynamic_content/. It appears in the dashboard immediately — no import step.

To encode outside the dashboard (offline, in CI, or on a build box):

Run the pipeline locally with generate_abr/create_abr_ladder.sh. See generate_abr/QUICKSTART.md for common invocations and generate_abr/HARDWARE_ENCODING_QUICKREF.md for hardware-accelerated encodes.
Offload to AWS EC2 spot instances via docs/CLOUD_ENCODING.md — the cloud runner produces the same {content}_h264/ and {content}_hevc/ directory layout, so the output drops straight into /media/dynamic_content/.

Primary endpoints

HLS:

http://localhost:30000/go-live/{content}/master.m3u8 (LL)
http://localhost:30000/go-live/{content}/master_2s.m3u8
http://localhost:30000/go-live/{content}/master_6s.m3u8

DASH:

http://localhost:30000/go-live/{content}/manifest.mpd (LL)
http://localhost:30000/go-live/{content}/manifest_2s.mpd
http://localhost:30000/go-live/{content}/manifest_6s.mpd

Full API (/api/content, /api/jobs, /api/sessions/*, /api/nftables/*, etc.) is in docs/API.md.

Metrics reference

Throughput metrics

Metric	Update cadence	What it measures
`mbps_shaper_rate`	100 ms	Instantaneous shaped rate during active queue drain (1 s contiguous backlog-active run)
`mbps_shaper_avg`	100 ms	Rolling 6 s average of `mbps_shaper_rate` values
`mbps_transfer_rate`	250 ms	Byte-change-gated rate during segment transfer, aligned to HTB burst edges. Reports at drain/refill boundaries
`mbps_transfer_complete`	per segment	Total bytes / total time for one completed segment transfer (backlog drained to 0)

Metric semantics

Limit value (nftables shaping rate): configured ceiling for the session port; a control target, not a measured throughput.
Shaper metrics (mbps_shaper_rate, mbps_shaper_avg): from TC class byte counters on the 100 ms updatePort loop. Only active when TC shaping is configured (backlog > 0). shaper_rate goes to 0 on drain; shaper_avg smooths across segments.
Transfer metrics (mbps_transfer_rate, mbps_transfer_complete): from the 10 ms awaitSocketDrain goroutine. transfer_rate aligns to actual TC burst edges (250 ms min gap). transfer_complete is the ground-truth per-segment rate.
Player averaged bandwidth (player_metrics_avg_network_bitrate_mbps): player-side averaged ABR estimate; slow-moving, model-based; intended for ladder analysis, initial variant pick, and comparison against shaper average. Populated by iOS (AVPlayer observedBitrate), Android (DefaultBandwidthMeter.bitrateEstimate), and browser players (HLS.js / Shaka / native) — i.e., the one signal every player can provide.
Player instantaneous bandwidth (player_metrics_network_bitrate_mbps): short-window near-instantaneous wire throughput; reacts quickly to sudden rate drops. Requires per-request wire visibility, so currently iOS-only (via LocalHTTPProxy); null on clients without that plumbing.
Wall-clock offset (player_metrics_true_offset_s): how far behind live the player is, computed server-side and independent of the client's clock. Players post player_metrics_playhead_wallclock_ms (the encoder's PDT at the playhead); the server timestamps the receive moment with its own clock and reports the difference: true_offset_s = (server_received_at_ms − playhead_wallclock_ms) / 1000. Survives clock skew on the client device, drift between phone NTP and laptop NTP, and any offset the player engine applies. Surfaced as a session-item field, on the buffer-depth chart's right Y-axis, and as the basis for cross-client comparison on the Live Offset page.

Under steady conditions: shaper_rate and transfer_rate track near the configured limit; transfer_complete is the most trustworthy single number. The player averaged bandwidth broadly tracks wire metrics but is smoother; the instantaneous version catches rate drops fastest.

Implementation details (netlink counters, caching, scope of overhead inclusion) are in docs/ARCHITECTURE.md.

Encoding pipeline

Driven by generate_abr/create_abr_ladder.sh (ffmpeg + Shaka Packager v3.4.2, bundled in the container).

Defaults: segment duration 6 s, partial duration 200 ms, GOP duration 1 s.

Audio: source audio is normalized to AAC during transcode (always-AAC). Source tracks in non-AAC codecs are re-encoded so every variant on the ladder has a uniform audio layer, eliminating an entire class of "this segment plays on iOS but not Android" debugging.

Synthetic source content: make test-pattern generates a 4K test pattern clip you can use as a controlled source — no copyrighted material, deterministic visuals, useful when testing ABR / ladder behaviour without confounding "is this just because of the content?" questions.

See generate_abr/README.md for the pipeline, generate_abr/QUICKSTART.md for common commands, and docs/CLOUD_ENCODING.md for offloading encodes to AWS EC2 spot instances.

Known limitations

Common LL-HLS/LL-DASH features that are not fully implemented:

Blocking playlist reload (_HLS_msn, _HLS_part) and skip logic (_HLS_skip)
#EXT-X-RENDITION-REPORT and #EXT-X-PRELOAD-HINT
Chunked CMAF transfer for LL-DASH partials

Full list in PRD.md.

Client apps

Native client apps for device testing:

iOS / iPadOS / tvOS — SwiftUI app in apple/InfiniteStreamPlayer/
Android TV / Google TV — Jetpack Compose app in android/InfiniteStreamPlayer/
Roku — BrightScript channel in roku/InfiniteStreamPlayer/README.md

The Apple and Android TV apps share a cinematic dark UI built around a "Now Playing" hero + LIVE preview tiles, a slide-from-right Settings drawer, and a developer-mode HUD overlay during playback. Roku is intentionally minimal (basic playback channel).

First launch — Choose a server

On first launch the app routes to the Server Picker. It auto-lists servers it discovers on the network (rendezvous heartbeat), and offers Pair with code (cross-network) / Scan QR (same-LAN) / Add by URL (manual) as fallbacks. Once a server is saved, subsequent launches go straight to Home.

Home — NOW PLAYING + LIVE tiles

Hero "NOW PLAYING" tile carries the most recently played stream; the LIVE row below shows live preview thumbnails for every other available stream. On capable hardware the tiles run real LL-HLS previews (capped per the device's decode budget), so you see actual frames not stale stills. Tapping a tile pushes it into the hero / starts playback.

Playback — developer-mode HUD

Toggle Developer mode in Settings → Advanced to overlay a live HUD on the player. Reads the same metrics the dashboard's Bitrate Chart shows — wall-clock timestamp, AVG / PEAK Mbps observed, current codec / resolution / framerate, and which bandwidth meter the player is reporting from (SW = software / OS-reported, JEO = JS engine override). Useful when the dashboard isn't open and you just want to see what the player thinks at 10 feet from the screen.

Settings drawer

Slide-from-right drawer lists Server / Stream / Protocol / Segment length / Codec / Advanced. The first row jumps to the Server Picker; the rest are pickers that push in over the main list. Same pattern on iPad and Apple TV (D-pad on tvOS, touch on iPad).

Settings → Advanced + Reset All Settings

Advanced exposes the per-session toggles that affect playback behaviour: 4K (allow renditions above 1080p), Local Proxy (route through go-proxy for wire-byte metrics), Auto-Recovery (retry on player error), Go Live (snap to live edge), Live offset (seconds behind live), Skip Home on launch (auto-resume), Mute audio, Preview video (live tile decode budget), Developer mode (the HUD above).

The destructive Reset All Settings at the bottom wipes every persisted preference (server list, Advanced flags, playback history) and routes back to the Server Picker — equivalent to a fresh install but without losing the app itself. Available on iOS, iPadOS, tvOS, and Android TV.

Server discovery

The native client apps need a way to find your server URL on first launch — they can't ship a hardcoded address. There are four ways to add a server, each suited to a different network situation:

Method	When to use	What it needs
Same-WAN auto-discovery	Phone/TV on the same Wi-Fi as the server	Server announces to the rendezvous; client lists it
QR scan (iOS only)	Same-LAN, you have a phone with a camera	Open Server Info on the dashboard, scan the QR
Pair with code	Cellular, VPN, hotel Wi-Fi — TV on a different network than the dashboard	Dashboard publishes its URL keyed by a 6-character code shown on the TV
Manual entry	Always works as a fallback	Type host:port

How auto-discovery works

A small Cloudflare Worker (cloudflare/pair-rendezvous/) acts as a public rendezvous point. Every server with INFINITE_STREAM_ANNOUNCE_URL set heartbeats {server_id, url, label} to the Worker on boot, again every 12 hours, and on demand when the dashboard's Server Info modal is opened.

Each announce is keyed by the server's public IP (hashed for privacy). When a client app opens its "+ Add server" / "Pair…" screen, it asks the Worker which servers are visible from its public IP. Cloudflare's edge sees both sides' IPs and only returns servers that match — so the list is implicitly scoped to "servers on the same WAN as you".

Why not Bonjour/mDNS? The obvious LAN-discovery answer is mDNS (_infinitestream._tcp), and we tried it. It doesn't work when the server runs inside a Docker container: Docker's default bridge network filters multicast, so the advertisement never reaches the host's network. Even host-network mode is fragile across Linux/macOS/Windows. Cloudflare's same-public-IP check gives us the same "show me servers on my network" answer without depending on multicast working through the container.

How code-based pairing works

Camera-less TVs (Apple TV, Android TV) can't scan a QR. Instead the TV shows a 6-character code; the user types it into the Pair with code widget on the dashboard, which publishes the dashboard's URL keyed by that code. The TV is polling the rendezvous for that code and picks up the URL within a couple of seconds. Cross-network pairing only works if the dashboard URL is reachable from the TV (the Server Info modal warns you when the URL looks LAN-only).

External services

Cloudflare Workers + KV for the rendezvous. Fits comfortably in the free plan: each server costs ~2 KV writes/day at the default cadence (1,000 writes/day total budget, account-wide). The default Worker is at https://pair-infinitestream.jeoliver.com — to self-host, follow cloudflare/pair-rendezvous/README.md and override INFINITE_STREAM_RENDEZVOUS_URL (server) plus InfiniteStreamRendezvousURL (UserDefaults / SharedPreferences on the clients).

Forking? That default URL is the upstream maintainer's personal Worker. Please deploy your own Worker and change the defaultURL constant in apple/.../RendezvousService.swift and android/.../RendezvousService.java (plus the routes block in wrangler.toml) before shipping builds — otherwise your users hammer someone else's free-tier KV budget.

That's it — no third-party services beyond a Cloudflare account.

Server-side env

Var	Purpose
`INFINITE_STREAM_RENDEZVOUS_URL`	Rendezvous Worker base URL. Required to enable any pairing.
`INFINITE_STREAM_ANNOUNCE_URL`	URL that clients should use to reach this server (e.g. `http://lenovo.local:30000`). When set, this server appears in same-WAN auto-discovery.
`INFINITE_STREAM_ANNOUNCE_LABEL`	Optional friendly label. Defaults to `host:port` from the announce URL.
`INFINITE_STREAM_SERVER_ID`	Optional explicit announce ID (4–64 chars `[A-Za-z0-9_-]`). Defaults to a stable random ID persisted at `<data_dir>/server_id`. Set this when multiple deployments share the same data directory (e.g. dev + release pods on the same k3s node), otherwise their announces overwrite each other on the rendezvous.

HTTP, HTTPS, and iOS App Transport Security

The server defaults to plain HTTP on its dashboard / API / playback ports. That's fine for LAN use and HTTPS-fronted public deployments, but plain HTTP to a public hostname trips the platform-specific cleartext-traffic rules baked into modern OSes:

Client	Cleartext to LAN (`localhost`, `*.local`, RFC1918)	Cleartext to public hostname
iOS / tvOS	✅ via `NSAllowsLocalNetworking` in `Info.plist`	❌ — rejected by App Transport Security unless the domain is in `NSExceptionDomains`
Android	✅ via `usesCleartextTraffic="true"` (currently set)	✅ same flag covers all hosts
Roku	✅	✅ no cleartext restriction
Browser dashboard	✅	⚠️ mixed-content warnings if the dashboard itself is HTTPS

The iOS/tvOS Info.plist files in this repo include an explicit NSExceptionDomains entry for infinitestreaming.jeoliver.com (the upstream maintainer's public domain). If you fork and ship apps that talk to a different public-HTTP hostname (your own server, a Tailscale MagicDNS name like *.ts.net, a Tailscale CGNAT IP in 100.64.0.0/10, etc.) you must add it to both Info.plists or those clients will silently fail to load anything.

The cleaner long-term answer is to terminate TLS at the server so all clients use HTTPS and no per-domain ATS / cleartext exceptions are needed. The k3s manifests already mount a certs-vol for this; flipping the nginx template to listen … ssl and pointing it at a Let's Encrypt cert (or whichever cert lives in K3S_CERTS_DIR) gets you there.

Other ways to run it

Most users should stick with Docker Compose from the Quick start. These variants are for specific scenarios.

Docker run (single container, no compose)

export CONTENT_DIR=/path/to/your/media

docker run -d --name infinite-streaming \
  --cap-add NET_ADMIN --privileged \
  -p 30000:30000 \
  -p 30081:30081 \
  -p 30181:30181 -p 30281:30281 -p 30381:30381 -p 30481:30481 \
  -p 30581:30581 -p 30681:30681 -p 30781:30781 -p 30881:30881 \
  -v $CONTENT_DIR:/media \
  ghcr.io/jonathaneoliver/infinite-streaming:latest \
  /sbin/launch.sh 1

Ports 30181–30881 are the per-session proxy ports that testing sessions get redirected to. Without mapping them, testing-session.html works but segments never load because the allocated session port is unreachable from the host.

macOS / Docker Desktop note: Network shaping (TC/nftables) works on Docker Desktop for Mac with --cap-add NET_ADMIN, but the TC stats polling (every 100ms per session) spawns processes through the Linux VM layer, which causes significantly higher CPU usage and fan noise compared to native Linux. This is a Docker Desktop VM overhead issue, not a code issue. For sustained testing with shaping, use a native Linux host.

Pre-built images from GHCR (no source checkout)

mkdir infinite-streaming && cd infinite-streaming
curl -fsSL https://raw.githubusercontent.com/jonathaneoliver/infinite-streaming/main/docker-compose.ghcr.yml \
  -o docker-compose.yml
echo "CONTENT_DIR=/path/to/your/media" > .env
docker compose up -d

k3s, release tagging, GHCR publishing

See docs/DEPLOYMENT.md for running in a k3s cluster (release + dev side by side), pinning immutable release tags, and configuring GHCR publishing from a fork.

Screenshots

Captured from the live dashboard; files live in docs/screenshots/.


Playback state chart — events timeline + bitrate + buffer + FPS, time-aligned	Network log waterfall — HAR view with `!✂` / `!⏱` / `!↩` glyphs
Playback — single-stream view	Mosaic — multi-tile comparison
Source Library — content intake	Upload Content
Encoding Jobs	Live Offset (alpha) — cross-variant comparison
iPad — home	iPad — Settings → Advanced with destructive Reset All Settings

Documentation index

Reference:

docs/ARCHITECTURE.md — services, routing, port map, request flow
docs/API.md — HTTP endpoints across go-live, go-upload, go-proxy
docs/FAULT_INJECTION.md — full fault and shaping reference
docs/DEPLOYMENT.md — k3s, release tagging, GHCR publishing
docs/TROUBLESHOOTING.md — common issues and fixes
PRD.md — product behavior source of truth
CONTRIBUTING.md — development workflow

Encoding:

Subsystems:

AI No-Code note

This project is primarily an AI No-Code build. The Go services and web dashboard were generated using Codex / OpenCode, GitHub Copilot, and Claude Code, with human direction and iterative testing.

License

See LICENSE for attribution, internal-use, and redistribution terms.

Name		Name	Last commit message	Last commit date
Latest commit History 381 Commits
.github		.github
android		android
apple/InfiniteStreamPlayer		apple/InfiniteStreamPlayer
cloudflare/pair-rendezvous		cloudflare/pair-rendezvous
content		content
docker		docker
docs		docs
generate_abr		generate_abr
go-live		go-live
go-proxy		go-proxy
go-upload		go-upload
roku		roku
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
PRD.md		PRD.md
README.md		README.md
SECURITY.md		SECURITY.md
VERSION		VERSION
docker-compose.ghcr.yml		docker-compose.ghcr.yml
docker-compose.yml		docker-compose.yml
k8s-infinite-streaming-dev.yaml		k8s-infinite-streaming-dev.yaml
k8s-infinite-streaming.yaml		k8s-infinite-streaming.yaml

Folders and files

Latest commit

History

Repository files navigation

InfiniteStream

Why you might want this

Quick start

Prerequisites

Start it

Your first session (web player walkthrough)

Dashboard pages

Testing session in depth

Controls (top of the card)

Failure injection (per request kind)

Transport faults (per session port)

Network Shaping (per session)

Content manipulation (HLS master rewriting)

Server Timeouts (transfer active + idle)

Bitrate chart (with buffer depth and FPS)

Network log waterfall (HAR view)

Session grouping (differential testing)

What propagates vs what stays independent

Canonical use cases

Third-party players

The easy way: grab a session URL from Mosaic

Programmatic integration pattern

Reporting metrics back (optional but recommended)

What the bundled clients do

How it works

Services (all run inside one Docker container)

Live stream generation

Host filesystem & content

Primary endpoints

Metrics reference

Throughput metrics

Metric semantics

Encoding pipeline

Known limitations

Client apps

First launch — Choose a server

Home — NOW PLAYING + LIVE tiles

Playback — developer-mode HUD

Settings drawer

Settings → Advanced + Reset All Settings

Server discovery

How auto-discovery works

How code-based pairing works

External services

Server-side env

HTTP, HTTPS, and iOS App Transport Security

Other ways to run it

Docker run (single container, no compose)

Pre-built images from GHCR (no source checkout)

k3s, release tagging, GHCR publishing

Screenshots

Documentation index

AI No-Code note

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages