The file-processing-transcription library is an extension of the file-processing library, designed to add transcription functionality to the core file processing capabilities. This library is built as a decorator, enabling it to wrap around relevant audio and video file types to transcribe spoken content to text using OpenAI's Whisper model.
- Audio Transcription: Converts spoken content in audio or video files to text.
- Language Detection: Automatically detects the language of transcribed content.
- Decorator Pattern: Seamlessly integrates with the
Fileclass fromfile-processingto add transcription functionality.
To install the file-processing-transcription library and its dependencies, use the following command:
pip install -r requirements.txtAll required packages, including Whisper, are listed in the requirements.txt file.
Note: ffmpeg must be installed on your system for Whisper to process audio and video files. You can install it using the following command:
sudo apt-get update && sudo apt-get install -y ffmpegOn macOS with Homebrew:
brew install ffmpegFor Windows, download from https://ffmpeg.org/download.html and follow the installation instructions.
Contributions are welcome! Please follow these steps:
- Fork the Repository: Create your fork on GitHub.
- Create a Feature Branch: Work on your feature in a separate branch.
- Write Tests: Ensure any changes are covered by tests.
- Submit a Pull Request: When ready, submit a PR for review.
This project is licensed under the MIT License.
For questions or support, please contact:
- Email: ocdo-bdpd@hc-sc.gc.ca
Enhance your file processing suite with transcription capabilities for audio and video content!