feat: add OpenAI's async batch API support for vectorizer #564

alejandrodnm · 2025-03-14T15:05:45Z

Adds support for OpenAI's async batch API to process large amounts of embeddings at a lower cost, and higher rate limits.

Key features:

New AsyncBatchEmbedder interface for handling async batch operations
Support for OpenAI's batch API implementation
New database tables for tracking batch status and chunks
Configurable polling interval for batch status checks
Automatic retry mechanism for failed batches

Database changes:

New async_batch_queue_table for tracking batch status
New async_batch_chunks_table for storing chunks pending processing
Added async_batch_polling_interval column to vectorizer table
New SQL functions for managing async batch operations

API changes:

New async_batch_enabled parameter in ai.embedding_openai()
New ai.vectorizer_enable_async_batches() and ai.vectorizer_disable_async_batches() functions
Extended vectorizer configuration to support async batch operations

The async batch workflow:

Chunks are collected and submitted as a batch to OpenAI
Batch status is monitored through polling
When ready, embeddings are retrieved and stored
Batch resources are cleaned up after successful processing

https://www.loom.com/share/09ff363c52204bf6851a797d4c4c4d50?sid=79e95d06-017d-4de5-a740-aa5af916b971

Adds support for OpenAI's async batch API to process large amounts of embeddings at a lower cost, and higher rate limits. Key features: - New AsyncBatchEmbedder interface for handling async batch operations - Support for OpenAI's batch API implementation - New database tables for tracking batch status and chunks - Configurable polling interval for batch status checks - Automatic retry mechanism for failed batches Database changes: - New async_batch_queue_table for tracking batch status - New async_batch_chunks_table for storing chunks pending processing - Added async_batch_polling_interval column to vectorizer table - New SQL functions for managing async batch operations API changes: - New async_batch_enabled parameter in ai.embedding_openai() - New ai.vectorizer_enable_async_batches() and ai.vectorizer_disable_async_batches() functions - Extended vectorizer configuration to support async batch operations The async batch workflow: 1. Chunks are collected and submitted as a batch to OpenAI 2. Batch status is monitored through polling 3. When ready, embeddings are retrieved and stored 4. Batch resources are cleaned up after successful processing

alejandrodnm · 2025-03-19T13:16:31Z

@kolaente this is a first draft of the PR we have some pending changes to our interfaces and we need to see how it'll affect this work.

alejandrodnm temporarily deployed to internal-contributors March 14, 2025 15:05 — with GitHub Actions Inactive

alejandrodnm force-pushed the adn/batch-api-follow branch from 0edcb8e to 846322f Compare March 14, 2025 15:53

alejandrodnm temporarily deployed to internal-contributors March 14, 2025 15:53 — with GitHub Actions Inactive

alejandrodnm force-pushed the adn/batch-api-follow branch from 846322f to f26b879 Compare March 14, 2025 15:57

alejandrodnm temporarily deployed to internal-contributors March 14, 2025 15:57 — with GitHub Actions Inactive

alejandrodnm mentioned this pull request Mar 24, 2025

feat: use openai's batch processing to create large volumes of embeddings #280

Closed

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add OpenAI's async batch API support for vectorizer #564

feat: add OpenAI's async batch API support for vectorizer #564

alejandrodnm commented Mar 14, 2025 •

edited

Loading

alejandrodnm commented Mar 19, 2025

feat: add OpenAI's async batch API support for vectorizer #564

Are you sure you want to change the base?

feat: add OpenAI's async batch API support for vectorizer #564

Conversation

alejandrodnm commented Mar 14, 2025 • edited Loading

alejandrodnm commented Mar 19, 2025

alejandrodnm commented Mar 14, 2025 •

edited

Loading