-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
MULLER provides a comprehensive suite of query functionalities tailored for AI data lakes:
- Comparison Operators: Supports exact and range matching using
>,<,>=, and<=for numerical types (int/float) where the tensor htype is generic. - Equality and Inequality: Supports
==and!=forint,float,str, andbooltypes (genericortexthtypes). Users can optionally build inverted indexes to significantly accelerate retrieval performance. - Range Queries: Supports the
BETWEENkeyword for numerical types (int/float). This feature requires an inverted index. - Full-Text Search: Supports the
CONTAINSoperator forstrtypes (texthtype), backed by an inverted index. For Chinese text, tokenization is handled by the open-source Jieba tokenizer. - Pattern Matching: Supports
LIKEfor regular expression matching onstrtypes (texthtype). - Boolean Logic: Supports complex query compositions using
AND,OR, andNOTlogical connectors. - Pagination: Supports query results with
OFFSETandLIMITclauses for efficient data sampling. - Data Aggregation: Supports standard SQL-like aggregation workflows, including
SELECT,GROUP BY, andORDER BY, alongside aggregate functions such asCOUNT,AVG,MIN,MAX, andSUM. - Vector Similarity Search: Supports high-dimensional vector similarity retrieval based on IVFPQ, HNSW and DISKANN for AI-centric embedding analysis.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels