You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| DEFAULT_REASONING_EFFORT_MEDIUM_THINKING_BUDGET | Default medium reasoning effort thinking budget. Default is 2048
398
-
| DEFAULT_REDIS_SYNC_INTERVAL | Default Redis synchronization interval in seconds. Default is 1
399
-
| DEFAULT_REPLICATE_GPU_PRICE_PER_SECOND | Default price per second for Replicate GPU. Default is 0.001400
400
-
| DEFAULT_REPLICATE_POLLING_DELAY_SECONDS | Default delay in seconds for Replicate polling. Default is 1
401
-
| DEFAULT_REPLICATE_POLLING_RETRIES | Default number of retries for Replicate polling. Default is 5
402
-
| DEFAULT_SLACK_ALERTING_THRESHOLD | Default threshold for Slack alerting. Default is 300
403
-
| DEFAULT_SOFT_BUDGET | Default soft budget for LiteLLM proxy keys. Default is 50.0
404
-
| DEFAULT_TRIM_RATIO | Default ratio of tokens to trim from prompt end. Default is 0.75
405
365
| DIRECT_URL | Direct URL for service endpoint
406
366
| DISABLE_ADMIN_UI | Toggle to disable the admin UI
407
367
| DISABLE_SCHEMA_UPDATE | Toggle to disable schema updates
408
368
| DOCS_DESCRIPTION | Description text for documentation pages
409
369
| DOCS_FILTERED | Flag indicating filtered documentation
410
370
| DOCS_TITLE | Title of the documentation pages
411
371
| DOCS_URL | The path to the Swagger API documentation. **By default this is "/"**
412
-
| EMAIL_LOGO_URL | URL for the logo used in emails
413
372
| EMAIL_SUPPORT_CONTACT | Support contact email address
414
373
| EXPERIMENTAL_MULTI_INSTANCE_RATE_LIMITING | Flag to enable new multi-instance rate limiting. **Default is False**
415
-
| FIREWORKS_AI_4_B | Size parameter for Fireworks AI 4B model. Default is 4
416
-
| FIREWORKS_AI_16_B | Size parameter for Fireworks AI 16B model. Default is 16
417
-
| FIREWORKS_AI_56_B_MOE | Size parameter for Fireworks AI 56B MOE model. Default is 56
418
-
| FIREWORKS_AI_80_B | Size parameter for Fireworks AI 80B model. Default is 80
419
-
| FIREWORKS_AI_176_B_MOE | Size parameter for Fireworks AI 176B MOE model. Default is 176
420
-
| FUNCTION_DEFINITION_TOKEN_COUNT | Token count for function definitions. Default is 9
421
-
| GALILEO_BASE_URL | Base URL for Galileo platform
422
-
| GALILEO_PASSWORD | Password for Galileo authentication
423
-
| GALILEO_PROJECT_ID | Project ID for Galileo usage
424
-
| GALILEO_USERNAME | Username for Galileo authentication
425
374
| GCS_BUCKET_NAME | Name of the Google Cloud Storage bucket
426
375
| GCS_PATH_SERVICE_ACCOUNT | Path to the Google Cloud service account JSON file
427
376
| GCS_FLUSH_INTERVAL | Flush interval for GCS logging (in seconds). Specify how often you want a log to be sent to GCS. **Default is 20 seconds**
@@ -453,7 +402,6 @@ router_settings:
453
402
| GOOGLE_CLIENT_ID | Client ID for Google OAuth
454
403
| GOOGLE_CLIENT_SECRET | Client secret for Google OAuth
455
404
| GOOGLE_KMS_RESOURCE_NAME | Name of the resource in Google KMS
456
-
| HEALTH_CHECK_TIMEOUT_SECONDS | Timeout in seconds for health checks. Default is 60
457
405
| HF_API_BASE | Base URL for Hugging Face API
458
406
| HCP_VAULT_ADDR | Address for [Hashicorp Vault Secret Manager](../secret.md#hashicorp-vault)
459
407
| HCP_VAULT_CLIENT_CERT | Path to client certificate for [Hashicorp Vault Secret Manager](../secret.md#hashicorp-vault)
@@ -463,13 +411,9 @@ router_settings:
463
411
| HCP_VAULT_CERT_ROLE | Role for [Hashicorp Vault Secret Manager Auth](../secret.md#hashicorp-vault)
464
412
| HELICONE_API_KEY | API key for Helicone service
465
413
| HOSTNAME | Hostname for the server, this will be [emitted to `datadog` logs](https://docs.litellm.ai/docs/proxy/logging#datadog)
466
-
| HOURS_IN_A_DAY | Hours in a day for calculation purposes. Default is 24
467
414
| HUGGINGFACE_API_BASE | Base URL for Hugging Face API
468
415
| HUGGINGFACE_API_KEY | API key for Hugging Face API
469
-
| HUMANLOOP_PROMPT_CACHE_TTL_SECONDS | Time-to-live in seconds for cached prompts in Humanloop. Default is 60
470
416
| IAM_TOKEN_DB_AUTH | IAM token for database authentication
471
-
| INITIAL_RETRY_DELAY | Initial delay in seconds for retrying requests. Default is 0.5
472
-
| JITTER | Jitter factor for retry delay calculations. Default is 0.75
473
417
| JSON_LOGS | Enable JSON formatted logging
474
418
| JWT_AUDIENCE | Expected audience for JWT tokens
475
419
| JWT_PUBLIC_KEY_URL | URL to fetch public key for JWT verification
@@ -490,7 +434,6 @@ router_settings:
490
434
| LANGSMITH_PROJECT | Project name for Langsmith integration
491
435
| LANGSMITH_SAMPLING_RATE | Sampling rate for Langsmith logging
492
436
| LANGTRACE_API_KEY | API key for Langtrace service
493
-
| LENGTH_OF_LITELLM_GENERATED_KEY | Length of keys generated by LiteLLM. Default is 16
494
437
| LITERAL_API_KEY | API key for Literal integration
495
438
| LITERAL_API_URL | API URL for Literal service
496
439
| LITERAL_BATCH_SIZE | Batch size for Literal operations
@@ -511,21 +454,6 @@ router_settings:
511
454
| LITELLM_TOKEN | Access token for LiteLLM integration
512
455
| LITELLM_PRINT_STANDARD_LOGGING_PAYLOAD | If true, prints the standard logging payload to the console - useful for debugging
513
456
| LOGFIRE_TOKEN | Token for Logfire logging service
514
-
| MAX_EXCEPTION_MESSAGE_LENGTH | Maximum length for exception messages. Default is 2000
515
-
| MAX_IN_MEMORY_QUEUE_FLUSH_COUNT | Maximum count for in-memory queue flush operations. Default is 1000
516
-
| MAX_LONG_SIDE_FOR_IMAGE_HIGH_RES | Maximum length for the long side of high-resolution images. Default is 2000
517
-
| MAX_REDIS_BUFFER_DEQUEUE_COUNT | Maximum count for Redis buffer dequeue operations. Default is 100
518
-
| MAX_SHORT_SIDE_FOR_IMAGE_HIGH_RES | Maximum length for the short side of high-resolution images. Default is 768
519
-
| MAX_SIZE_IN_MEMORY_QUEUE | Maximum size for in-memory queue. Default is 10000
520
-
| MAX_SIZE_PER_ITEM_IN_MEMORY_CACHE_IN_KB | Maximum size in KB for each item in memory cache. Default is 512 or 1024
521
-
| MAX_SPENDLOG_ROWS_TO_QUERY | Maximum number of spend log rows to query. Default is 1,000,000
522
-
| MAX_TEAM_LIST_LIMIT | Maximum number of teams to list. Default is 20
523
-
| MAX_TILE_HEIGHT | Maximum height for image tiles. Default is 512
524
-
| MAX_TILE_WIDTH | Maximum width for image tiles. Default is 512
525
-
| MAX_TOKEN_TRIMMING_ATTEMPTS | Maximum number of attempts to trim a token message. Default is 10
526
-
| MAX_RETRY_DELAY | Maximum delay in seconds for retrying requests. Default is 8.0
527
-
| MIN_NON_ZERO_TEMPERATURE | Minimum non-zero temperature value. Default is 0.0001
528
-
| MINIMUM_PROMPT_CACHE_TOKEN_COUNT | Minimum token count for caching a prompt. Default is 1024
529
457
| MISTRAL_API_BASE | Base URL for Mistral API
530
458
| MISTRAL_API_KEY | API key for Mistral API
531
459
| MICROSOFT_CLIENT_ID | Client ID for Microsoft services
@@ -534,12 +462,10 @@ router_settings:
534
462
| MICROSOFT_SERVICE_PRINCIPAL_ID | Service Principal ID for Microsoft Enterprise Application. (This is an advanced feature if you want litellm to auto-assign members to Litellm Teams based on their Microsoft Entra ID Groups)
535
463
| NO_DOCS | Flag to disable documentation generation
536
464
| NO_PROXY | List of addresses to bypass proxy
537
-
| NON_LLM_CONNECTION_TIMEOUT | Timeout in seconds for non-LLM service connections. Default is 15
538
465
| OAUTH_TOKEN_INFO_ENDPOINT | Endpoint for OAuth token info retrieval
539
466
| OPENAI_BASE_URL | Base URL for OpenAI API
540
467
| OPENAI_API_BASE | Base URL for OpenAI API
541
468
| OPENAI_API_KEY | API key for OpenAI services
542
-
| OPENAI_FILE_SEARCH_COST_PER_1K_CALLS | Cost per 1000 calls for OpenAI file search. Default is 0.0025
543
469
| OPENAI_ORGANIZATION | Organization identifier for OpenAI
544
470
| OPENID_BASE_URL | Base URL for OpenID Connect services
545
471
| OPENID_CLIENT_ID | Client ID for OpenID Connect authentication
@@ -561,37 +487,21 @@ router_settings:
561
487
| PREDIBASE_API_BASE | Base URL for Predibase API
562
488
| PRESIDIO_ANALYZER_API_BASE | Base URL for Presidio Analyzer service
563
489
| PRESIDIO_ANONYMIZER_API_BASE | Base URL for Presidio Anonymizer service
564
-
| PROMETHEUS_BUDGET_METRICS_REFRESH_INTERVAL_MINUTES | Refresh interval in minutes for Prometheus budget metrics. Default is 5
565
-
| PROMETHEUS_FALLBACK_STATS_SEND_TIME_HOURS | Fallback time in hours for sending stats to Prometheus. Default is 9
566
490
| PROMETHEUS_URL | URL for Prometheus service
567
491
| PROMPTLAYER_API_KEY | API key for PromptLayer integration
568
492
| PROXY_ADMIN_ID | Admin identifier for proxy server
569
493
| PROXY_BASE_URL | Base URL for proxy service
570
-
| PROXY_BATCH_WRITE_AT | Time in seconds to wait before batch writing spend logs to the database. Default is 10
571
-
| PROXY_BUDGET_RESCHEDULER_MAX_TIME | Maximum time in seconds to wait before checking database for budget resets. Default is 605
572
-
| PROXY_BUDGET_RESCHEDULER_MIN_TIME | Minimum time in seconds to wait before checking database for budget resets. Default is 597
573
494
| PROXY_LOGOUT_URL | URL for logging out of the proxy service
574
495
| LITELLM_MASTER_KEY | Master key for proxy authentication
575
496
| QDRANT_API_BASE | Base URL for Qdrant API
576
497
| QDRANT_API_KEY | API key for Qdrant service
577
-
| QDRANT_SCALAR_QUANTILE | Scalar quantile for Qdrant operations. Default is 0.99
578
498
| QDRANT_URL | Connection URL for Qdrant database
579
-
| QDRANT_VECTOR_SIZE | Vector size for Qdrant operations. Default is 1536
580
-
| REDIS_CONNECTION_POOL_TIMEOUT | Timeout in seconds for Redis connection pool. Default is 5
581
499
| REDIS_HOST | Hostname for Redis server
582
500
| REDIS_PASSWORD | Password for Redis service
583
501
| REDIS_PORT | Port number for Redis server
584
-
| REDIS_SOCKET_TIMEOUT | Timeout in seconds for Redis socket operations. Default is 0.1
585
502
| REDOC_URL | The path to the Redoc Fast API documentation. **By default this is "/redoc"**
586
-
| REPEATED_STREAMING_CHUNK_LIMIT | Limit for repeated streaming chunks to detect looping. Default is 100
587
-
| REPLICATE_MODEL_NAME_WITH_ID_LENGTH | Length of Replicate model names with ID. Default is 64
588
-
| REPLICATE_POLLING_DELAY_SECONDS | Delay in seconds for Replicate polling operations. Default is 0.5
589
-
| REQUEST_TIMEOUT | Timeout in seconds for requests. Default is 6000
590
-
| ROUTER_MAX_FALLBACKS | Maximum number of fallbacks for router. Default is 5
591
-
| SECRET_MANAGER_REFRESH_INTERVAL | Refresh interval in seconds for secret manager. Default is 86400 (24 hours)
592
503
| SERVER_ROOT_PATH | Root path for the server application
593
504
| SET_VERBOSE | Flag to enable verbose logging
594
-
| SINGLE_DEPLOYMENT_TRAFFIC_FAILURE_THRESHOLD | Minimum number of requests to consider "reasonable traffic" for single-deployment cooldown logic. Default is 1000
595
505
| SLACK_DAILY_REPORT_FREQUENCY | Frequency of daily Slack reports (e.g., daily, weekly)
596
506
| SLACK_WEBHOOK_URL | Webhook URL for Slack integration
597
507
| SMTP_HOST | Hostname for the SMTP server
@@ -608,17 +518,7 @@ router_settings:
608
518
| SUPABASE_KEY | API key for Supabase service
609
519
| SUPABASE_URL | Base URL for Supabase instance
610
520
| STORE_MODEL_IN_DB | If true, enables storing model + credential information in the DB.
611
-
| SYSTEM_MESSAGE_TOKEN_COUNT | Token count for system messages. Default is 4
612
521
| TEST_EMAIL_ADDRESS | Email address used for testing purposes
613
-
| TOGETHER_AI_4_B | Size parameter for Together AI 4B model. Default is 4
614
-
| TOGETHER_AI_8_B | Size parameter for Together AI 8B model. Default is 8
615
-
| TOGETHER_AI_21_B | Size parameter for Together AI 21B model. Default is 21
616
-
| TOGETHER_AI_41_B | Size parameter for Together AI 41B model. Default is 41
617
-
| TOGETHER_AI_80_B | Size parameter for Together AI 80B model. Default is 80
618
-
| TOGETHER_AI_110_B | Size parameter for Together AI 110B model. Default is 110
619
-
| TOGETHER_AI_EMBEDDING_150_M | Size parameter for Together AI 150M embedding model. Default is 150
620
-
| TOGETHER_AI_EMBEDDING_350_M | Size parameter for Together AI 350M embedding model. Default is 350
621
-
| TOOL_CHOICE_OBJECT_TOKEN_COUNT | Token count for tool choice objects. Default is 4
622
522
| UI_LOGO_PATH | Path to the logo image used in the UI
0 commit comments