This document serves as a protocol for AI Agents assisting with Dremio-related tasks. Follow these guidelines to ensure accuracy, security, and high-quality output.
This repository contains a structured, offline index of Dremio documentation to help you find the source of truth quickly without guessing URLs.
- Root Index:
dremio_sitemaps/README.md- This file lists all top-level documentation sections (e.g., Cloud, software versions).
- Sub-Indices: Each top-level section is a directory (e.g.,
dremio_sitemaps/cloud/) containing:README.md: The index for that section.*.mdfiles: Specific sub-topics (e.g.,arctic.md).
Protocol:
- Identify the Context: Determine if the user is using Dremio Cloud or a specific Software version (e.g., 24.3.x).
- Locate the Index: Open the relevant
README.mdindremio_sitemaps/. - Find the specific Page: Search the markdown index for the topic (e.g., "SQL Functions", "Rest API").
- Reference the URL: Use the actual Dremio documentation URL provided in the sitemap to verify syntax or parameters if you have browsing capabilities, or cite it for the user.
Dremio's SQL dialect and feature set are specific. Never assume generic SQL or standard behavior.
- Check the Docs: Before writing a SQL query, consult the "SQL Reference" section found via the sitemap.
- Verify Function Signatures: Ensure function names and arguments match the Dremio version being used.
- Data Types: Verify supported data types and casting rules.
- Mandatory Check: Dremio has a list of reserved keywords that cannot be used as identifiers without quoting.
- Action: detailed in
dremio_sitemaps/.../reserved_keywords.md(or similar). - Best Practice: Always double-quote identifiers (e.g.,
"timestamp","table","date") if there is any doubt.
NEVER ask the user to provide credentials (usernames, passwords, tokens) directly in the chat or code snippets.
Instruct the user to create a .env file in their project root.
Requested Variables:
# Connection Details
DREMIO_URL="https://app.dremio.cloud" # or their software URL
DREMIO_USER="user@example.com"
DREMIO_PASSWORD="secure_password"
DREMIO_PAT="personal_access_token" # Preferred over password for Cloud
DREMIO_PROJECT_ID="project_id" # For Dremio CloudPython Implementation Pattern:
When writing validtion scripts or tools, always use python-dotenv:
import os
from dotenv import load_dotenv
load_dotenv()
dremio_url = os.getenv("DREMIO_URL")
dremio_pat = os.getenv("DREMIO_PAT")
if not dremio_pat:
raise ValueError("DREMIO_PAT is missing from .env file")- Research: Use
dremio_sitemaps/to locate relevant documentation. - Plan: Check against Reserved Words and SQL support.
- Secure: Ensure any code proposed uses
.envfor secrets. - Execute: Generate the code or answer.
- Validate: Review generated code against the documentation constraints one last time.
- If using Dremio Cloud, use
dremio_sitemaps/dremio-cloud/as the root. - If using Dremio Software, use
dremio_sitemaps/current/as the root. - For the Dremio Cloud API, use
dremio_sitemaps/dremio-cloud/api.md - For Dremio Software API, use
dremio_sitemaps/current/reference.md - For Dremio SQL Syntax use, use
dremio_sitemaps/dremio-cloud/sql/readme.md - For Dremio Sources on Cloud, use
dremio_sitemaps/dremio-cloud/bring-data.md - For Dremio Sources on Software, use
dremio_sitemaps/current/data-sources.md
Before writing custom Python scripts from scratch, verify if an existing tool library handles the use case.
Use dremioframe for:
- Data Engineering: ETL/ELT pipelines, managing tables/views, Iceberg table management.
- Administrative Tasks: User/role management, reflection management.
- Documentation Link:
dremio_sitemaps/dremioframe/README.md
⚠️ Dremio Cloud ONLY. This is the official CLI maintained by Dremio. It does NOT support Dremio Software (self-hosted).
- Package:
dremio-clion PyPI - Source: github.com/dremio/cli
- Install:
pipx install dremio-cli(recommended) oruv tool install dremio-cli - Invoked as:
dremio <command> - Auth: Requires a PAT (
DREMIO_TOKEN) and a Project ID (DREMIO_PROJECT_ID). - Best for: Query execution, schema inspection, reflections, jobs, folder management — all scoped to a Dremio Cloud project.
- Documentation Link:
dremio_sitemaps/dremio-cli/README.md
✅ Supports BOTH Dremio Cloud and Dremio Software. This is a community-maintained CLI that covers the full Dremio REST API surface area.
- Package:
alt-dremio-clion PyPI - Source: github.com/developer-advocacy-dremio/dremio-python-cli
- Install:
pip install alt-dremio-cli - Invoked as:
alt-dremio-cli <command> - Auth: Profile-based via
alt-dremio-cli profile create, supporting both username/password (Software) and PAT (Cloud). - Best for: Full catalog management, SQL execution, source/view/table management, grants, users, roles — spanning Cloud and Software.
- Documentation Link:
dremio_sitemaps/alt-dremio-cli/README.md