whisplay-talk

中文

A P2P voice intercom app for Whisplay HAT, designed for real-time voice broadcasting between multiple Whisplay devices.

Core flow:

Runs as a whisplay-daemon app
Discovers online devices through Tailscale MagicDNS, using the whisplay-talk- hostname prefix
While one device holds the talk button, microphone audio is compressed and streamed to all online peers over TCP
Other devices play the audio in real time, highlight the active speaker, and show a receive icon in the status box
While idle, the screen shows the device list with online state and heartbeat latency

Screenshots

Interface Overview

Header: Shows the WhisplayTalk title plus VPN, Wi-Fi signal, and battery status icons
Status card: Shows the current app state, the local device name, and a talk icon on the right while receiving audio
Device list: Keeps showing the peer list even while talking or receiving, with online / offline markers and heartbeat latency such as kitchen (42ms)
Active speaker highlight: Highlights the device currently speaking in yellow
Footer: Shows the current action hint such as Hold button to talk, Release to stop, or Listening...

Current Implementation

The project currently uses the following design:

Discovery: Polls tailscale status --json for devices whose hostname starts with whisplay-talk-, then probes the app TCP port on each device before marking it online and recording heartbeat latency
Transport: All devices listen on fixed TCP port 24680 for audio streams
Audio: Uses arecord / aplay, with default 16kHz / 16-bit / mono capture, Opus voice encoding, a small receive jitter buffer, and one-frame redundant resend
Display: Uses Pillow to render a 240x280 UI into the framebuffer provided by whisplay-daemon, including header VPN / Wi-Fi / battery icons and a live peer list
Input: Uses whisplay-daemon button events for push-to-talk

Project Structure

whisplay-talk/
├── main.py
├── application.py
├── config.py
├── audio/
├── display/
├── hardware/
├── network/
├── install.sh
├── run.sh
├── requirements.txt
└── .env.template

Installation

git clone <this-repo>
cd whisplay-talk
bash install.sh

install.sh will:

Install Python / ALSA utils / curl / libopus0
Create a venv
Install Pillow and python-dotenv
Download the NotoSansSC-Bold.ttf font
Auto-register the app if whisplay-daemon is detected

Tailscale Setup

Every device must join the same Tailscale tailnet before whisplay-talk can discover peers.

Install Tailscale on Raspberry Pi:

curl -fsSL https://tailscale.com/install.sh | sh
sudo tailscale up

After sudo tailscale up, open the login URL shown in the terminal and complete the device login in your browser.

You can verify the connection with:

tailscale status

If Tailscale is not installed, not logged in, or not running, the app will show a matching reminder on screen.

Configuration

First copy the config file:

cp .env.template .env

Important settings:

WHISPLAY_TALK_DEVICE_PREFIX Default: whisplay-talk-
WHISPLAY_TALK_DEVICE_NAME If empty, the system hostname is used. If it does not already have the prefix, the prefix is added automatically.
WHISPLAY_TALK_TCP_PORT Default: 24680
WHISPLAY_TALK_APP_HEARTBEAT_TIMEOUT_MS Default 3000, timeout for peer online probing and latency measurement
WHISPLAY_TALK_APP_HEARTBEAT_FAILS_BEFORE_OFFLINE Default 5, number of consecutive failed heartbeat probes allowed before a peer is marked offline
ALSA_INPUT_DEVICE Recording device. If unset, the app auto-detects whisplaysound, wm8960soundcard, or compatible Whisplay cards before falling back to default
ALSA_OUTPUT_DEVICE Playback device. If unset, the app auto-detects whisplaysound, wm8960soundcard, or compatible Whisplay cards before falling back to default
AUDIO_CODEC Default opus, recommended for current real-time talkback
AUDIO_FRAME_MS Default 40, which lowers packet rate and usually helps continuity on weaker links
AUDIO_REDUNDANCY_FRAMES Default 1, which resends the previous compressed frame to help recover a single lost packet
AUDIO_OPUS_BITRATE Default 16000, tuned for mono intercom voice with stronger continuity
AUDIO_OPUS_COMPLEXITY Default 6, still light enough for Raspberry Pi while improving encode quality a bit
AUDIO_OPUS_PACKET_LOSS_PERC Default 15, hints expected network loss to the Opus encoder
AUDIO_OPUS_ENABLE_FEC Default 1, enables Opus in-band forward error correction
WHISPLAY_TALK_RECEIVE_PREBUFFER_FRAMES Default 24, roughly one second of audio at the current 40ms Opus frame size

Device Naming

Peer discovery is based on Tailscale MagicDNS hostnames. Devices are only considered talk peers when their hostname starts with whisplay-talk-.

Recommended naming pattern:

whisplay-talk-kitchen
whisplay-talk-room1
whisplay-talk-office

The UI strips the whisplay-talk- prefix when showing device names, so whisplay-talk-kitchen is displayed as kitchen.

The recommended way to rename devices is directly in the Tailscale admin console, by editing each device name to match the whisplay-talk-<name> pattern.

For example:

whisplay-talk-kitchen
whisplay-talk-room1

After renaming a device in Tailscale, wait for the updated MagicDNS name to propagate to peers.

WHISPLAY_TALK_DEVICE_NAME can still be used as a local override when needed.

Also make sure all devices have joined the same Tailscale tailnet.

Run

bash run.sh

If whisplay-daemon is running on the system, it is recommended to launch Talk from the daemon app list.

For systems that do not use whisplay-daemon, you can configure boot startup with:

bash startup.sh

startup.sh installs a systemd service for this app. If it detects whisplay-daemon, it exits without making changes.

Interaction

While idle: The screen shows the device list, including self, online / offline markers, and peer heartbeat latency
If Tailscale is not installed: The screen shows an install reminder
If Tailscale is installed but not logged in or not running: The screen shows the matching login/start hint
While holding the button: The local device enters Speaking and stops local playback to avoid echo
After releasing the button: Sending stops and an end packet is broadcast
When remote audio is received: The device enters Receiving, plays audio, shows who is speaking, and displays the talk icon on the right side of the status box

Stream Packet Format

The current implementation uses a small custom packet header over a TCP stream:

magic: WT01
type: 1
flags: 1 = start, 2 = end
sender name
stream id
sequence
codec id
compressed audio payload, typically Opus
optional redundant payload for the previous frame

This makes it easy to evolve later toward:

Unicast priority
Push-to-talk arbitration
Half-duplex / full-duplex strategy
Stronger packet loss handling

Known Limits

This is still an MVP, so a few practical limitations remain:

The transport is still a custom TCP framing layer, not a standard voice/media protocol stack
There is no explicit channel lock or arbitration yet; overlapping talk attempts are not coordinated
Peer identity is still derived from the Tailscale hostname prefix, not from a separate nickname or contact system
The best experience still assumes whisplay-daemon; startup.sh only helps boot the app on systems without the daemon, it does not recreate the daemon UI/runtime model

License

This project is licensed under the GPL-3.0 license. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whisplay-talk

Screenshots

Interface Overview

Current Implementation

Project Structure

Installation

Tailscale Setup

Configuration

Device Naming

Run

Interaction

Stream Packet Format

Known Limits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assets		assets
audio		audio
display		display
hardware		hardware
network		network
.env.template		.env.template
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
application.py		application.py
config.py		config.py
install.sh		install.sh
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh
startup.sh		startup.sh

Folders and files

Latest commit

History

Repository files navigation

whisplay-talk

Screenshots

Interface Overview

Current Implementation

Project Structure

Installation

Tailscale Setup

Configuration

Device Naming

Run

Interaction

Stream Packet Format

Known Limits

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages