Releases: BBC-Esq/VectorDB-Plugin
v2.5 - amazing effort
Update a lot so make sure and review the user manual.
Added settings for the transcription feature. Also, it will only load the whisper model when a user clicks "start recording" and immediately delete it from memory afterwards, addressing the issue that some people don't want it taking up memory.
Added settings for the database itself. Lots of fun to experiment with!
Improved print statements so you can see the number of chunks being created among other inner-workings of the program.
Reduced space on main GUI that buttons took to conserve space overall and display settings better.
Refactored some code for future expansion.
v2.3 - glorious
Added a slew of settings.
Significantly upgraded user guide.
After research, removed some embedding models and added others.
Refactored code to more heavily rely on config.yaml.
v2.2.3 - add config settings
Added the ability to modify the port and maximum context settings when receiving a response from LM Studio.
v2.2.2 - faster database
Greatly increased the speed of vector database creation when processing PDF files specifically...as well as when querying...as well as reduced the compute resources all of the foregoing.
Added "tips" tab with helpful information to maximize VRAM usage and overall speedup things.
v2.2.1 - Whisper easier
Since the transcriber is set to "auto" to select best available of gpu or cpu acceleration. Therefore, I added 8 threads default (when using cpu) instead of 4, which is the default from faster-whisper library when not otherwise specified.
Added instructions specific to BGE embedding models, which was accidentally overlooked previously, which will greatly increase the effectiveness of these models (now rivaling or possibly exceeding the instructor embeddings models' quality).
v2.2 - Whisper quants
Simplified the process to download and use different Whisper/Ctranslate2 models automatically.
Revised instructions.
v2.1 - transcribe question!
Added audio transcription for your question to the text box!
Refactored multiple scripts for further expansion and modularity.
v2.0.1 - refactor
Refactored to use initialize.py in order to shorten gui.py somewhat.
Commented out a portion of the html file displayed in the tab. Didn't want to offend the hard work of the ggml/gguf/gptq people unless/until I can more thoroughly test ctranslate2 and their implementations.
Stopped the left pane from hogging the GUI's space, which had made the question and answer boxes too small.
v2.0 - PySide6!
1 - Major refactoring of all scripts.
2 - Upgrade GUI to use PySide6.
3 - Revamp GUI to start using tabs.
4 - CHECK OUT the "Instructions" tab for what's to come.
v1.7 - Important
Removed releases between 1.4.2 and 1.6 because the newer version of chromadb was causing problems; somehow has a limit of ~5600 chunks, which prevents ingesting very large PDF document. The only thing I could think of is that the newer ChromaDB relies on Sqlite3 while the older version relies on DuckDB + Parquet/Clickhouse...So rather than figure out how to make the newer ChromaDB work, reverted, works fine.
This release includes all prior improvements from releases I deleted, except the newer ChromaDB version. Thus, new GUI, new layout, refactoring of scripts in preparation for expansion, numerous additional embedding models, etc.