new-contrib: Audio Whisper API with Local Device Microphones by yishangupenn · Pull Request #49 · perf-ai-example-org/openai-cookbook

yishangupenn · 2026-03-10T02:52:28Z

Copied from upstream: openai/openai-cookbook#1271
Original author: @CarlKho-Minerva
Originally opened: 2024-07-06

Summary

This PR adds a new notebook that demonstrates how to use the Whisper API to transcribe text from your device's microphone. The notebook includes steps to record audio, transcribe it using the Whisper API, and copy the transcription to the clipboard. It aims to provide a practical guide for users who want to integrate speech-to-text functionality into their applications.

*This pull request was written by Chat GPT and reviewed by a human. The article, however, is made by a human.

Motivation

This tutorial was created because the functionality to transcribe speech to text from a microphone is not well-documented. I found the mic speech-to-text option in the ChatGPT apps (not websites) extremely helpful for day-to-day operations and wanted to save others from having to learn about different audio processing modules.

For new content

When contributing new content, read through our contribution guidelines, and mark the following action items as completed:

I have added a new entry in registry.yaml (and, optionally, in authors.yaml) so that my content renders on the cookbook website.
I have conducted a self-review of my content based on the contribution guidelines (my previous PR message detailed on every one of these 😅):
- Relevance: This content is related to building with OpenAI technologies and is useful to others.
- Uniqueness: I have searched for related examples in the OpenAI Cookbook and verified that my content offers new insights or unique information compared to existing documentation.
- Spelling and Grammar: I have checked for spelling or grammatical mistakes.
- Clarity: I have done a final read-through and verified that my submission is well-organized and easy to understand.
- Correctness: The information I include is correct, and all of my code executes successfully.
- Completeness: I have explained everything fully, including all necessary references and citations.

…hisper API

….yaml

…://github.com/CarlKho-Minerva/openai-cookbook into carl-kho/Whisper_API-device_mic_transcription

- Refactor: Separate transcribe and translate functions - Refactor: Clarify prompt usage in demos (example-based) - Refactor: Add 5-second limit to Spanish translation demo - Docs: Improve formatting and clarity of audio recording details - Docs: Add note about prompt usage with links to API docs

CarlKho-Minerva added 17 commits July 6, 2024 06:01

new-contrib: submission - Audio Whisper API with Device Microphones

9c75686

chore: Updated yaml info + and typo correcting

117523f

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

687d905

feat: Heavily revise article for device microphone transcription in W…

aced6cd

…hisper API

chore: update authors and registry yaml files

c2eaf92

chore: *correctly* update yaml files

7b53a86

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

3ab74f5

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

5318114

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

64a9070

indent fix

7786f07

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

75e8e52

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

587df2b

fix: clean up merged sections and remove conflict markers in registry…

e81cc13

….yaml

Merge branch 'carl-kho/Whisper_API-device_mic_transcription' of https…

e5d4800

…://github.com/CarlKho-Minerva/openai-cookbook into carl-kho/Whisper_API-device_mic_transcription

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

776d55f

Merge branch 'main' into carl-kho/Whisper_API-device_mic_transcription

e81a4f7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new-contrib: Audio Whisper API with Local Device Microphones #49

new-contrib: Audio Whisper API with Local Device Microphones #49
yishangupenn wants to merge 17 commits into
mainfrom
upstream-pr-1271

yishangupenn commented Mar 10, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yishangupenn commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

For new content

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yishangupenn commented Mar 10, 2026 •

edited

Loading