Documentation Index
Fetch the complete documentation index at: https://docs.coralos.ai/llms.txt
Use this file to discover all available pages before exploring further.
Retry attempts
default0If configured to be a value greater than 0, failed proxied LLM requests made to the Coral Server will be retried up to the specified number of times. The delay between retries is exponential, see:
Retry base delay
default1sSee here for more information on the duration format. If an LLM proxy request fails to reach the configured LLM provider, the request may be retried. This value serves as the initial delay for the first retry, and, for every subsequent retry, the delay will be increased exponentially by Retry delay exponent. With the following configuration:
- Retry base delay of
1s - Retry delay exponent of
2.0 - Retry max delay of
10s
1s2s4s8s10s
Retry delay exponent
default2.0The exponent applied to Retry base delay between retries. The exponent is applied after the first retry. The value must be a floating point number.
Retry max delay
default10sSee here for more information on the duration format. The maximum delay between retries. This caps the exponential delay configured with Retry base delay and Retry delay exponent.
Max request size
default20MiBA ByteSize value representing the maximum size of a request that can be proxied to an LLM provider. Requests made larger than this size will be rejected.
Max response size
default80MiBA ByteSize value representing the maximum size of a response that can be read from a proxied LLM provider. Responses larger than this size will be rejected, and the LLM request will receive an error.
Max stream size
default80MiBA ByteSize value representing the maximum size of a response that can be read from a proxied LLM provider via an SSE stream. This value is cumulative across the lifetime of the SSE stream. Responses larger than this size will be rejected, and the LLM request will receive an error.