You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<3> Increases the token quota limit for the cluster by 100,000 over a period of 30 days.
56
58
<4> Defines the number of seconds that the scheduler waits and then checks if the period interval is over. When the period interval is over, the scheduler stores the timestamp and resets or increases the quota limit.
57
59
58
-
. Apply the `OLSConfigMap` file so that the token limit quota takes effect by running the following command:
60
+
. Apply the `ConfigMap` file so that the token limit quota takes effect by running the following command:
Tokens are small chunks of text, which can be as short as one character or as long as one word. Tokens are the units of measurement used to quantify the amount of text that the {ols-long} Service sends to, or receives from, a large language model (LLM) - every interaction with the LLM is counted in tokens.
8
+
Tokens are small chunks of text, which can be as small as one character or as large as one word. Tokens are the units of measurement used to quantify the amount of text that the {ols-long} Service sends to, or receives from, a large language model (LLM). Every interaction with the Service and the LLM is counted in tokens.
9
9
10
-
Token quota limits define the number of tokens that can be used in a certain timeframe. Implementing token quota limits helps control costs, encourages more efficient use of queries, and helps regulate system demands. In multi-user configurations, token quota limits can provide equal access to all users ensuring everyone has an opportunity to submit queries.
10
+
Token quota limits define the number of tokens that can be used in a certain timeframe. Implementing token quota limits helps control costs, encourage more efficient use of queries, and regulate system demands. In a multi-user configuration, token quota limits help provide equal access to all users ensuring everyone has an opportunity to submit queries.
11
11
12
-
Token quota limits can be defined for an {ocp-short-name}cluster or for an individual user account.
12
+
You can define token quota limits for {ocp-short-name}clusters or {ocp-short-name}user accounts.
0 commit comments