Aloud Text To Speech Obsidian Plugin

Highlight and speak text from your Obsidian notes. Converts text to audio using lifelike voices from various providers.

tts-interaction-audio.mp4

Just add your API key from a supported provider. Choose from available voices.

Supported TTS Models:

OpenAI: (e.g., tts-1, tts-1-hd, gpt-4o-mini). OpenAI charges Audio at $0.015 per 1,000 characters.
Google Gemini: (Gemini 2.5 series)
Hume AI: (Hume voices with customization)
ElevenLabs: (Model selection, voice selection, stability/similarity options)
Azure Speech Services: (Region, voice and output format selection)
MiniMax: (speech-2.5-hd-preview, speech-2.5-turbo-preview, speech-02-hd, speech-02-turbo, speech-01-hd, speech-01-turbo)
AWS Polly: (Region, voice, neural/standard engine, output format)

You can also configure a custom API endpoint if you have an OpenAI compatible API server that has an /v1/audio/speech endpoint. For example openedai-speech.

Features:

Visual Feedback: Active sentence is highlighted and updated as playback progresses.

Listen immediately: Audio is streamed sentence-by-sentence. Jump back and forth by skipping by sentence.

Variable Speeds: On device playback rate adjustor for improved audio quality.

Caching: Audio is cached in your vault to reduce costs, and automatically removed. Cache duration is configurable. Audio may be cached device local or in a vault directory.

Export and Embed Audio: Quickly export to audio files: export audio files from selection, or embed audio by pasting text from your clipboard.

Play text from anywhere: Lots of commands. Play text to speech directly from your clipboard.

OS Integration: Integrates with your mobile phone to play while locked. Pause/Play with OS controls on desktop.

Alternate TTS Models

You can also run alternate models if you have an OpenAI‑compatible API server that exposes /v1/audio/speech (for example, openedai-speech). Configure the URL and API key in the plugin settings under “OpenAI Compatible (Advanced)”.

Name		Name	Last commit message	Last commit date
Latest commit History 168 Commits
.github/workflows		.github/workflows
.husky		.husky
__mocks__		__mocks__
architecture-docs		architecture-docs
docs		docs
src		src
.editorconfig		.editorconfig
.eslintignore		.eslintignore
.eslintrc		.eslintrc
.gitignore		.gitignore
.npmrc		.npmrc
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
README.md		README.md
esbuild.config.mjs		esbuild.config.mjs
main.ts		main.ts
manifest.json		manifest.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
styles.css		styles.css
test-setup.ts		test-setup.ts
tsconfig.json		tsconfig.json
version-bump.mjs		version-bump.mjs
version-bump.sh		version-bump.sh
versions.json		versions.json
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Aloud Text To Speech Obsidian Plugin

Supported TTS Models:

Features:

Alternate TTS Models

About

Uh oh!

Releases 39

Packages

Uh oh!

Contributors 26

Uh oh!

Languages

License

adrianlyjak/obsidian-aloud-tts

Folders and files

Latest commit

History

Repository files navigation

Aloud Text To Speech Obsidian Plugin

Supported TTS Models:

Features:

Alternate TTS Models

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 39

Packages 0

Uh oh!

Contributors 26

Uh oh!

Languages

Packages