Skip to content

Latest commit

 

History

History

README.md

BSP: ESP-VoCat

HW Reference HOW TO USE API Component Registry maintenance-status

Overview

ESP-VoCat is an intelligent AI development kit. It is suitable for voice interaction products that require large model capabilities, such as toys, smart speakers, and smart central control systems. The device is equipped with a 1.85-inch QSPI circular touch screen, dual microphone array, and supports offline voice wake-up and sound source localization algorithms. Combined with the large model capabilities provided by OpenAI,Xiaozhi AI, Gemini, etc., ESP-VoCat can achieve full-duplex voice interaction, multimodal recognition, and intelligent agent control, providing a solid foundation for developers to create complete edge-side AI application experiences.

Capabilities and dependencies

Available Capability Controller/Codec Component Version
✔️ 📟 DISPLAY st77916 idf
espressif/esp_lcd_st77916
>=5.5
*
✔️ ⚫ LVGL_PORT espressif/esp_lvgl_port ^2
✔️ ☝️ TOUCH cst816s espressif/esp_lcd_touch_cst816s *
✔️ 🔘 BUTTONS espressif/button ^4
⚪ KNOB
✔️ 🎵 AUDIO espressif/esp_codec_dev ~1.5
✔️ 🔈 AUDIO_SPEAKER es8311
✔️ 🎤 AUDIO_MIC es7210
✔️ 💾 SDCARD idf >=5.5
✔️ 💡 LED idf
espressif/led_indicator
>=5.5
^2
📷 CAMERA
🔋 BAT
🎮 IMU

Compatible BSP Examples

Example Description Try with ESP Launchpad
Display Example Show an image on the screen with a simple startup animation (LVGL) Flash Example
Display, Audio and Photo Example Complex demo: browse files from filesystem and play/display JPEG, WAV, or TXT files (LVGL) Flash Example
LVGL Demos Example Run the LVGL demo player - all LVGL examples are included (LVGL) Flash Example
Display Rotation Example Rotate screen using buttons or an accelerometer (BSP_CAPS_IMU, if available) Flash Example
Display SD card Example Example of mounting an SD card using SD-MMC/SPI with display interaction. This example is also supported on boards without a display. Flash Example