new-contrib: Audio Whisper API with Local Device Microphones #49
Open
yishangupenn wants to merge 17 commits into
Open
new-contrib: Audio Whisper API with Local Device Microphones #49yishangupenn wants to merge 17 commits into
yishangupenn wants to merge 17 commits into
Conversation
…://github.com/CarlKho-Minerva/openai-cookbook into carl-kho/Whisper_API-device_mic_transcription
- Refactor: Separate transcribe and translate functions - Refactor: Clarify prompt usage in demos (example-based) - Refactor: Add 5-second limit to Spanish translation demo - Docs: Improve formatting and clarity of audio recording details - Docs: Add note about prompt usage with links to API docs
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR adds a new notebook that demonstrates how to use the Whisper API to transcribe text from your device's microphone. The notebook includes steps to record audio, transcribe it using the Whisper API, and copy the transcription to the clipboard. It aims to provide a practical guide for users who want to integrate speech-to-text functionality into their applications.
*This pull request was written by Chat GPT and reviewed by a human. The article, however, is made by a human.
Motivation
This tutorial was created because the functionality to transcribe speech to text from a microphone is not well-documented. I found the mic speech-to-text option in the ChatGPT apps (not websites) extremely helpful for day-to-day operations and wanted to save others from having to learn about different audio processing modules.
For new content
When contributing new content, read through our contribution guidelines, and mark the following action items as completed: