v5.0.0 #3578
t83714
announced in
Announcements
v5.0.0
#3578
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
We are excited to announce the release of Magda v5.0.0, a major update introducing groundbreaking features to enhance data discovery, querying, and exploration. This release includes three powerful new capabilities:
💬 In-Browser LLM Chatbot: AI-Powered Data Exploration
Understanding and analyzing datasets has never been easier. Magda v5.0.0 introduces an LLM-powered chatbot that operates entirely within your web browser, making data exploration more intuitive than ever.
🧠 Key Features
Conversational Dataset Search — Ask the chatbot to find datasets using natural language queries.
Automated Data Analysis — Upload tabular data, and the chatbot will analyze, visualize, and summarize key insights.
SQL Query Generation — Generate SQL queries dynamically from chat prompts, which can be executed in Magda’s new SQL Console.
🔹 Why In-Browser AI?
Unlike traditional server-based AI solutions, Magda’s chatbot runs directly in the browser using WebGPU, reducing infrastructure costs and enhancing privacy by keeping inference local to the user’s device.
Learn more about Magda Chatbot in our Intro Document.
📊 SQL Console: Powerful Data Querying in Your Browser
With Magda v5.0.0, we’re introducing SQL Console, a built-in browser-based SQL tool that enables users to query datasets including Excel spreadsheets, CSV, TSV, and TAB-separated data files, regardless of whether the data is stored and managed by Magda or a third-party platform. The queries are executed on the client side using browser resources, making it a scalable and efficient solution.
⚡ Key Capabilities
🖥️ How to Access SQL Console
You can launch the SQL Console using a simple keyboard shortcut:
Additionally, other Magda features (such as the LLM-powered chatbot) may open the SQL Console automatically when needed.
🔎 Querying Data with source() Function
Magda extends SQL with the source() function, allowing you to query datasets without needing to worry about underlying storage details or file formats.
Basic Usage:
The source() function accepts one of the following as its parameter:
🚀 Performance Benchmark
Tested on a 16GB M1 MacBook Pro using a 1 million-records (174.2MB) CSV dataset:
Query time: 2.458s (3 times average excluding file download time)
Query time: 3.470s (3 times average excluding file download time)
Learn more about SQL Console in our Quick Guide.
🚀 Hybrid Search Engine: The Best of Keyword & AI-Powered Search
Magda v5.0.0 introduces a hybrid search engine that combines keyword-based (lexical) search and semantic (vector) search to improve search relevance and precision.
🔍 Lexical vs. Vector Search
✅ Why Hybrid Search?
By combining lexical and vector search, Magda’s hybrid engine:
Learn more about Magda Hybrid Search Engine from our Intro Document.
Changes since v4.2.4
MaxRAMPercentage
option configurable for all scala services via helm chart #3573 Make JVM MaxRAMPercentage option configurable for all scala services via helm chartsisAdmin
field on users table #3575 Remove unused & obsolete isAdmin field on users tableauth-api-client
typescript definition inauthentication-plugin-sdk
to prevent TS2345 type errorMigration
If you are on version < v4.0.0, you should upgrade to the latest v4 version before upgrading to v5.0.0.
For more detailed migration notes, please find from: https://github.com/magda-io/magda/blob/main/docs/docs/migration/5.0.0.md
Full Changelog: v4.2.4...v5.0.0
This discussion was created from the release v5.0.0.
Beta Was this translation helpful? Give feedback.
All reactions