Skip to content

Comments

fix : VAD system stabilization with respeaker class with auto detection added#1

Open
dudududukim wants to merge 10 commits intoKaist-ICLab:mainfrom
dudududukim:feature/respeaker
Open

fix : VAD system stabilization with respeaker class with auto detection added#1
dudududukim wants to merge 10 commits intoKaist-ICLab:mainfrom
dudududukim:feature/respeaker

Conversation

@dudududukim
Copy link

Problem

  • .db in repo
  • VAD not properly working (keep slicing user's very short utterence)

Changes

0. Improve terminal logging

1. src/audio/player.py

  • Added ReSpeaker hardware support
  • Added performance profiling

2. src/dialog_manager/faster_whisper_recognizer.py

  • Added ReSpeaker VAD (Voice Activity Detection) support
  • Improved buffer-based VAD system
  • Added performance profiling

3. src/dialog_manager/llm_chatgpt.py

  • Disabled local emotion_recognition
  • Added performance profiling

4. src/lib/microphone.py

  • Completely added ReSpeaker hardware support
  • Added 6-channel audio support
  • Added echo cancellation functionality

5. src/main.py

  • Added audio system initialization/termination logic
  • Added performance profiling initialization
  • Changed PyAudio termination method

Test

Read README.md

Optional

2025 summer internship (July)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant