Date: 2025-10-03
Version: 0.3.5
- Comprehensive code review completed (see CODE_REVIEW.md)
- 61 tests passing, 6 test suites
- Zero security vulnerabilities
- Zero TODO/FIXME/HACK comments
- TypeScript strict mode
- Clean architecture
- Fully MCP compliant
- Zero stdout/stderr pollution
- All 6 tools implemented
- Resource access working
- Tested in Cursor ✅
- Tested in Claude Desktop ✅
- README.md comprehensive
- GETTING_STARTED.md for beginners
- MCP_SETUP.md for MCP integration
- CLAUDE_DESKTOP_SETUP.md for Claude Desktop
- INSTALL.md for detailed installation
- CHANGELOG.md with all versions
- LICENSE (MIT)
- Published to npm as
audio-transcription-mcp - Works with npx (zero installation)
- Semantic versioning
- Keywords for discoverability
- Repository linked
- prepublishOnly script ensures quality
- Code pushed to GitHub
- Add badges to README:
[](https://badge.fury.io/js/audio-transcription-mcp) [](https://github.com/pmerwin/audio-transcription-mcp/actions) [](https://opensource.org/licenses/MIT)
- Enable GitHub Discussions
- Add topics:
mcp,audio,transcription,whisper,openai,claude,cursor
- Submit to official MCP servers registry
- Repository: https://github.com/modelcontextprotocol/servers
- Create PR adding your server to the list
🎉 Just released audio-transcription-mcp - Real-time audio transcription for @cursor and @anthropicAI Claude Desktop!
✨ Features:
- Works with OpenAI Whisper
- Zero installation (uses npx)
- Complete session isolation
- 61 tests, production ready
Try it: npx audio-transcription-mcp
#MCP #AI #OpenAI
Title: [Release] Audio Transcription MCP Server - Real-time transcription for Cursor & Claude Desktop
Body:
I've built an MCP server that brings real-time audio transcription to Cursor and Claude Desktop using OpenAI's Whisper API.
**Features:**
- 🎙️ Real-time system audio capture (macOS)
- 📝 Timestamped markdown transcripts
- 🔒 Complete session isolation for privacy
- ✅ 61 passing tests, zero vulnerabilities
- 🚀 Works with npx (no installation!)
**Quick Start:**
Just add to your MCP config and restart:
```json
{
"command": "npx",
"args": ["-y", "audio-transcription-mcp"],
"env": {
"OPENAI_API_KEY": "your-key",
"INPUT_DEVICE_NAME": "BlackHole"
}
}
GitHub: https://github.com/pmerwin/audio-transcription-mcp npm: https://www.npmjs.com/package/audio-transcription-mcp
Feedback welcome!
#### Hacker News (Show HN)
Title: Show HN: Audio Transcription MCP Server for Cursor and Claude Desktop
Text: I built an MCP server that adds real-time audio transcription to Cursor and Claude Desktop using OpenAI Whisper.
It captures system audio (via BlackHole on macOS), transcribes it in configurable chunks, and saves timestamped markdown transcripts. Each session gets a unique file for privacy.
The interesting technical challenge was avoiding stdout/stderr contamination - MCP uses stdio for JSON-RPC, so any console output corrupts the protocol. We route all logging to a debug file instead.
Installation is just npx audio-transcription-mcp - no npm install needed. It's well-tested (61 tests) and works in production.
GitHub: https://github.com/pmerwin/audio-transcription-mcp
Happy to answer questions about building MCP servers or the implementation!
#### Dev.to / Medium Blog Post
Title: Building an MCP Server: Real-time Audio Transcription for AI Assistants
Outline:
- Introduction to Model Context Protocol
- Architecture: Clean separation of concerns
- The stdout/stderr Challenge (critical for MCP)
- Testing Strategy (61 tests, session isolation)
- Supporting Multiple Platforms (Cursor vs Claude Desktop)
- Publishing to npm
- Lessons Learned
- Future Plans
### 4. Documentation Additions
Consider adding:
- [ ] CONTRIBUTING.md (how to contribute)
- [ ] SECURITY.md (security policy)
- [ ] GitHub Actions CI/CD workflow
- [ ] Video demo / GIF for README
### 5. Future Enhancements (Optional)
From CODE_REVIEW.md:
- [ ] Update dependencies (dotenv v17, consider OpenAI SDK v6)
- [ ] Add `.editorconfig` for consistent formatting
- [ ] Consider JSON schema for config validation
- [ ] Windows support (WASAPI)
- [ ] Linux support (ALSA/PulseAudio)
---
## 📊 Current Metrics
- **npm downloads:** Track at https://npmtrends.com/audio-transcription-mcp
- **GitHub stars:** Monitor repository
- **Issues opened:** Community engagement indicator
- **Test pass rate:** 100% (61/61)
- **Security vulnerabilities:** 0
---
## 🎯 Success Criteria
### Week 1
- [ ] 100+ npm downloads
- [ ] 10+ GitHub stars
- [ ] Listed in MCP servers directory
- [ ] At least 1 community PR or issue
### Month 1
- [ ] 500+ npm downloads
- [ ] 50+ GitHub stars
- [ ] Active community discussions
- [ ] Blog post published
### Long Term
- [ ] 1000+ npm downloads/month
- [ ] Active contributors
- [ ] Featured in MCP showcase
- [ ] Conference talk opportunity
---
## 💡 Talking Points
When discussing the project, highlight:
1. **Zero Installation** - Works with npx, no setup needed
2. **Production Ready** - 61 tests, comprehensive docs, zero vulnerabilities
3. **Privacy First** - Session isolation, unique files per session
4. **Dual Platform** - Works in both Cursor and Claude Desktop
5. **MCP Best Practices** - No stdout pollution, proper error handling
6. **Real-time** - Configurable chunk sizes for live transcription
---
## 🙏 Acknowledgments
Consider thanking:
- Model Context Protocol team for the excellent SDK
- OpenAI for Whisper API
- Anthropic (Claude Desktop) and Cursor teams
- Early testers and contributors
---
## 📞 Contact & Support
- **GitHub Issues:** Best for bug reports
- **GitHub Discussions:** Best for questions
- **Twitter/X:** For announcements
- **Email:** For security issues
---
**Status:** ✅ Ready to share with the community!
**Last Updated:** 2025-10-03
**Next Review:** After first 100 downloads