You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+7-1Lines changed: 7 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,7 +21,7 @@ An AI-powered knowledge management application that lets you create intelligent
21
21
22
22
## ✨ Features
23
23
24
-
- 📚 **Multiple Source Types** - Upload PDFs, text files, Markdown, DOCX, HTML documents, and video URLs (YouTube, Bilibili with automatic subtitle extraction)
24
+
- 📚 **Multiple Source Types** - Upload PDFs, text files, Markdown, DOCX, HTML documents, audio files (MP3, WAV, M4A, etc.), and video URLs (YouTube, Bilibili with automatic subtitle extraction)
25
25
- 🤖 **AI-Powered Chat** - Ask questions and get answers based on your sources
26
26
- ✨ **Multiple Transformations** - Generate summaries, FAQs, study guides, outlines, timelines, glossaries, quizzes, mindmaps, infographics and podcast scripts
27
27
- 📊 **Infographic Generation** - Create beautiful, hand-drawn style infographics from your content using Google's Gemini Nano Banana
@@ -38,6 +38,7 @@ An AI-powered knowledge management application that lets you create intelligent
38
38
- An LLM API key (OpenAI) or Ollama running locally
39
39
-[markitdown](https://github.com/microsoft/markitdown) (optional, for better document conversion)
40
40
-[yt-dlp](https://github.com/yt-dlp/yt-dlp) (optional, for extracting subtitles from YouTube and Bilibili videos)
41
+
-[vosk-transcriber](https://github.com/alphacep/vosk-transcriber) (optional, for transcribing audio files to text)
41
42
42
43
### Installation
43
44
@@ -171,6 +172,7 @@ You can add content to your notebook in three ways:
171
172
- Click the "+" button in the Sources panel
172
173
- Drag and drop or browse for files
173
174
- Supported: PDF, TXT, MD, DOCX, HTML
175
+
- Audio files: MP3, WAV, M4A, AAC, FLAC, OGG, WMA, OPUS (auto-transcribed to text)
174
176
175
177
**Paste Text**
176
178
@@ -230,6 +232,10 @@ CHUNK_OVERLAP=200 # Overlap between chunks
230
232
# Document Conversion
231
233
ENABLE_MARKITDOWN=true # Use Microsoft markitdown for better PDF/DOCX conversion
0 commit comments