Skip to content

Commit f96a9a4

Browse files
authored
Merge pull request coleam00#213 from coleam00/fix/consolidate-concurrency-settings
Fix crawler concurrency configuration to prevent memory crashes
2 parents e2e51f3 + e98f52a commit f96a9a4

File tree

4 files changed

+312
-225
lines changed

4 files changed

+312
-225
lines changed

.env.example

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,4 +33,9 @@ EMBEDDING_DIMENSIONS=1536
3333
# - OPENAI_API_KEY (encrypted)
3434
# - MODEL_CHOICE
3535
# - TRANSPORT settings
36-
# - RAG strategy flags (USE_CONTEXTUAL_EMBEDDINGS, USE_HYBRID_SEARCH, etc.)
36+
# - RAG strategy flags (USE_CONTEXTUAL_EMBEDDINGS, USE_HYBRID_SEARCH, etc.)
37+
# - Crawler settings:
38+
# * CRAWL_MAX_CONCURRENT (default: 10) - Max concurrent pages per crawl operation
39+
# * CRAWL_BATCH_SIZE (default: 50) - URLs processed per batch
40+
# * MEMORY_THRESHOLD_PERCENT (default: 80) - Memory % before throttling
41+
# * DISPATCHER_CHECK_INTERVAL (default: 0.5) - Memory check interval in seconds

0 commit comments

Comments
 (0)