Retry attempts

default 0
If configured to be a value greater than 0, failed proxied LLM requests made to the Coral Server will be retried up to the specified number of times. The delay between retries is exponential, see:

Retry base delay
Retry delay exponent
Retry attempts
Retry max delay.

Retry base delay

default 1s
See here for more information on the duration format. If an LLM proxy request fails to reach the configured LLM provider, the request may be retried. This value serves as the initial delay for the first retry, and, for every subsequent retry, the delay will be increased exponentially by Retry delay exponent. With the following configuration:

Retry base delay of 1s
Retry delay exponent of 2.0
Retry max delay of 10s

Delays between retries will be:

1s
2s
4s
8s
10s

Retry delay exponent

default 2.0
The exponent applied to Retry base delay between retries. The exponent is applied after the first retry. The value must be a floating point number.

Retry max delay

default 10s
See here for more information on the duration format. The maximum delay between retries. This caps the exponential delay configured with Retry base delay and Retry delay exponent.

Max request size

default 20MiB
A ByteSize value representing the maximum size of a request that can be proxied to an LLM provider. Requests made larger than this size will be rejected.

Max response size

default 80MiB
A ByteSize value representing the maximum size of a response that can be read from a proxied LLM provider. Responses larger than this size will be rejected, and the LLM request will receive an error.

Max stream size

default 80MiB
A ByteSize value representing the maximum size of a response that can be read from a proxied LLM provider via an SSE stream. This value is cumulative across the lifetime of the SSE stream. Responses larger than this size will be rejected, and the LLM request will receive an error.

Providers

default empty array

Setting a Coral Cloud API key will automatically configure Coral Server LLM proxies for supported Coral Cloud LLM providers and models.

LLM Proxy providers cannot be configured from the command line.

[[llm-proxy.providers]]
name = "My OpenAI Provider"
format.type = "OpenAI"
api_key = "..."
base_url = "https://api.openai.com/v1"
models = ["gpt-4.1-mini", "gpt-5-mini"]

[[llm-proxy.providers]]
name = "Default Anthropic"
format.type = "Anthropic"
api_key = "..."
base_url = "https://api.anthropic.com"
models = ["claude-opus-4-6", "claude-sonnet-4-0"]

An array of LlmProxyProviderConfig objects.

​Retry attempts

​Retry base delay

​Retry delay exponent

​Retry max delay

​Max request size

​Max response size

​Max stream size

​Providers

Retry attempts

Retry base delay

Retry delay exponent

Retry max delay

Max request size

Max response size

Max stream size

Providers