Language Model Info
2 min read
In any space or module, the language model is configured as described in this page.
For Azure and LiteLLM-deployed models, only the model name is required. For custom-deployed models, additional parameters must be provided in JSON format to fully define the model configuration.
Overview
The Language Model Info schema is used to configure different language models within our tool.
For Azure OpenAI models or models deployed via LiteLLM, setup requires only the model name, no additional configuration is needed. For custom models, a full configuration is required as a JSON, as outlined in the sections below.
If you are using Azure or LiteLLM, the rest of this documentation can be skipped and you can can refer directly to the LLM Overview.
Example Configuration
Azure Model Configuration
For Azure models, a simple configuration can be achieved by specifying the model name.
{
"language_model": "AZURE_GPT_4_0613"
}Custom Model Configuration
Custom models require a full configuration setup, detailing all necessary parameters:
{
"language_model": {
"name": "Custom_Model_Name",
"version": "1.0",
"provider": "CUSTOM",
"encoder_name": "o200k_base",
"token_limits": {
"token_limit_input": 5000,
"token_limit_output": 1500
},
"capabilities": ["function_calling", "streaming"],
"info_cutoff_at": "2023-10-01",
"published_at": "2023-09-01",
"retirement_at": null,
"deprecated_at": null,
"retirement_text": null
}
}Fields Documentation
The following table describes the different parameters and which once are required.
Field Name | Description | Type | Default Value | Required |
|---|---|---|---|---|
| The name of the language model, can be an Azure model name or a custom name | string | N/A | Yes |
| The version of the language model | string | N/A | Yes |
| The provider of the language model, either AZURE or CUSTOM | string | N/A | Yes |
| The encoder name used for the model | string [enum] |
| No |
| Defines input and output token limits for the model | integer |
| No |
| Lists the capabilities of the model. See below for available capabilities. | array |
| No |
| Date when information cutoff occurs | string |
| No |
| Date when the model was published | string |
| No |
| Date when the model is retired | string |
| No |
| Date when the model is deprecated | string |
| No |
| Text describing the retirement details | string |
| No |
Nested Types
token_limits
Field Name | Type | Description | Default Value | Required |
|---|---|---|---|---|
token_limit_input | integer | Maximum number of input tokens | 7000 | Yes |
token_limit_output | integer | Maximum number of output tokens | 1000 | Yes |
Enums and Constants
Enumerated types and constants define fixed values to standardize data inputs.
Enum Name: The identifier for the enumeration.
Possible Values: All potential values the enum can represent.
Description: Explanation of the enum's role.
Enum Name | Possible Values | Description |
|---|---|---|
EncoderName |
| Defines the encoder used for the model |
LanguageModelName | Lists available Azure language models | |
LanguageModelProvider |
| Specifies the provider of the language model |
ModelCapabilities | json | Enumerates the capabilities that the model can support |
Full Json Schema
{
"$defs": {
"EncoderName": {
"enum": [
"o200k_base",
"cl100k_base"
],
"title": "EncoderName",
"type": "string"
},
"LanguageModelName": {
"enum": [
"AZURE_GPT_35_TURBO_0125",
"AZURE_GPT_4o_2024_0513",
"AZURE_GPT_4o_2024_0806",
"AZURE_GPT_4o_MINI_2024_0718",
"AZURE_o1_PREVIEW_2024_0912",
"AZURE_o1_2024_1217",
"AZURE_o1_MINI_2024_0912",
"AZURE_o3_MINI_2025_0131",
"AZURE_GPT_45_PREVIEW_2025_0227"
],
"title": "LanguageModelName",
"type": "string"
},
"LanguageModelProvider": {
"enum": [
"AZURE",
"CUSTOM"
],
"title": "LanguageModelProvider",
"type": "string"
},
"LanguageModelTokenLimits": {
"properties": {
"token_limit_input": {
"title": "Token Limit Input",
"type": "integer"
},
"token_limit_output": {
"title": "Token Limit Output",
"type": "integer"
}
},
"required": [
"token_limit_input",
"token_limit_output"
],
"title": "LanguageModelTokenLimits",
"type": "object"
},
"ModelCapabilities": {
"enum": [
"function_calling",
"parallel_function_calling",
"reproducible_output",
"structured_output",
"vision",
"streaming",
"reasoning"
],
"title": "ModelCapabilities",
"type": "string"
}
},
"properties": {
"name": {
"anyOf": [
{
"$ref": "#/$defs/LanguageModelName"
},
{
"type": "string"
}
],
"title": "Name"
},
"version": {
"title": "Version",
"type": "string"
},
"provider": {
"$ref": "#/$defs/LanguageModelProvider"
},
"encoder_name": {
"$ref": "#/$defs/EncoderName",
"default": "cl100k_base"
},
"token_limits": {
"$ref": "#/$defs/LanguageModelTokenLimits",
"default": {
"token_limit_input": 7000,
"token_limit_output": 1000
}
},
"capabilities": {
"default": [
"streaming"
],
"items": {
"$ref": "#/$defs/ModelCapabilities"
},
"title": "Capabilities",
"type": "array"
},
"info_cutoff_at": {
"anyOf": [
{
"format": "date",
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"title": "Info Cutoff At"
},
"published_at": {
"anyOf": [
{
"format": "date",
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"title": "Published At"
},
"retirement_at": {
"anyOf": [
{
"format": "date",
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"title": "Retirement At"
},
"deprecated_at": {
"anyOf": [
{
"format": "date",
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"title": "Deprecated At"
},
"retirement_text": {
"anyOf": [
{
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"title": "Retirement Text"
}
},
"required": [
"name",
"version",
"provider"
],
"title": "LanguageModelInfo",
"type": "object"
}