Version: 2.0
Author: Harish & Team
Date: February 2025
- Introduction
- Features
- Installation
- Usage Guide
- Gesture & Voice Commands
- Error Handling
- Project Structure
- Dependencies
- Future Enhancements
- Contributing
- License
This project allows users to control PowerPoint presentations using hand gestures and voice commands. It enhances accessibility and improves user experience by eliminating the need for a traditional clicker.
✅ Supports .pptx PowerPoint files
✅ Hand Gestures to navigate slides
✅ Voice Commands for hands-free control
✅ Real-time gesture visualization
✅ PyQt5 GUI for easy interaction
✅ Built-in error handling for a smoother experience
To run the project, install the required dependencies:
pip install opencv-python mediapipe pyqt5 comtypes speechrecognition pyaudioEnsure you have Python 3.x installed.
Run the main script:
python main.py- Click "Open Presentation" in the GUI.
- Select a
.pptxfile. - The PowerPoint will load in slideshow mode.
- Click "Activate Gesture Control" in the GUI.
- Position your hand within the webcam’s view.
- Use predefined hand gestures to navigate slides.
- Click "Activate Voice Control" in the GUI.
- Speak predefined commands to navigate slides.
| Gesture | Action |
|---|---|
| Swipe Right | Next Slide |
| Swipe Left | Previous Slide |
| Open Hand | Pause/Resume |
| Show Five Fingers | Go to First Slide |
| Command | Action |
|---|---|
| "Next" | Move to the next slide |
| "Previous" | Move to the previous slide |
| "Slide X" | Jump to slide X (e.g., "Slide 5") |
- If the microphone is unavailable, an error message is displayed.
- If no hand is detected, the program will wait until a gesture is visible.
- If PowerPoint crashes, the application will attempt to restart it.
gesture_voice_ppt/
│── main.py # Main entry point
│── gui.py # PyQt5 GUI
│── voice_control.py # Handles speech recognition
│── gesture_control.py # Detects hand gestures
│── ppt_controller.py # Controls PowerPoint operations
│── README.md # Project overview
│── DOCUMENTATION.md # Detailed documentation
└── assets/ # Icons, images, etc.
- PyQt5 (Graphical User Interface)
- OpenCV (Webcam access)
- MediaPipe (Hand gesture detection)
- SpeechRecognition (Voice commands)
- PyAudio (Microphone input)
- comtypes (PowerPoint automation)
🚀 Add AI-powered gesture customization
🚀 Support for Google Slides & Keynote
🚀 Multiple language support for voice control
Contributions are welcome! Fork the repo, make changes, and submit a pull request.
This project is open-source under the MIT License.