LLM Availability Overview

8 min read

Overview

Unique can connect to almost any Large Language Model (LLM), whether it is an open-source model or a proprietary one. Depending on the LLM and the deployment model of Unique, using a non-standard model (e.g. Azure OpenAI) may have certain implications.

Unique can tailor the selection of LLMs based on each Space and use case, ensuring you have access to the most suitable models for your specific needs to increase accuracy.

Proprietary LLM API Providers

We can connect your Unique deployment to any AI platform of your choice. Examples include OpenAI, Anthropic, and Google Cloud.

note

Connecting a third-party AI platform means data will leave your tenant or on-premise system. Our AI Governance team reviews each request to connect a third-party provider and ensures that the provider adheres to our strict data privacy and AI safety guidelines.

info

For customers on our US tenant we have vetted and support

  • Anthropic (Claude Opus, Sonnet, and Haiku )

  • OpenAI (GPT-5 and later)

  • Together AI (LLaMA, DeepSeek, Qwen, and other open-source models)

  • Google (Gemini 3.0+ Pro and Flash)

You can use any model supported by these platforms. If the account on the platform is managed by Unique, you will benefit from our negotiated rate limits and pricing.

We also optionally support a bring-your-own-API-key model, allowing you to sign up and manage the relationship with the third-party provider yourself. This option is only available for single-tenant, customer-managed tenant, and on-premise deployments.

Open-Source LLM Inference Providers

We also support a number of inference platforms for open-source models. You can use any model they serve via Unique. As with proprietary providers, we offer both Unique-managed accounts and bring-your-own-API-key options.

How To Set Up Spaces With Different LLMs

When configuring a Unique AI space, choose an AI model from the dropdown menu. For specific models listed in the following Model Availability Matrix, go to Language Model under Advanced Settings. Refer to the first column of the Model Availability Matrix and copy the corresponding Model Name into the LanguageModelName text field.

info

Additional configuration details and exceptions (e.g., for open‑source models) are available here: Unique AI Space

Please contact our support team or refer to your contract to confirm which models are activated for your deployment.


Model Availability Matrix

Legend for support in UniqueAI and Translation:
βšͺ Not yet validated

πŸ”΄ Not suited

🟑 Functionally working - No quality support

🟒 Passed extensive Quality testing

Azure Deployment

Unique Enum

LanguageModelName

(to be used in configurations)

Model

Version / Release Date

Retirement Date

Availability in Unique (release)

Supported Models for Unique AI Chat

Supported Models for Translation

Standard CH
Quota in
[tokens/min]

Standard SWE
Quota in [tokens/min]

Data Zone Standard EU

Quota in
[tokens/min]

Available Global Standard

Quota in
[tokens/min]

Additional information

AZURE_GPT_4o_2024_0513

gpt-4o

2024-05-13

2026-10-01

before
2025.12

🟑 (use AZURE_GPT_4o_2024_1120 as replacement)

🟑

❌

βœ…

500k

βœ…

βœ…

AZURE_GPT_4o_2024_0806

gpt-4o

2024-08-06

2026-10-01

before
2025.12

🟑 (use AZURE_GPT_4o_2024_1120 as replacement)

🟑

❌

βœ…

200k

βœ…

10M

βœ…

AZURE_GPT_4o_2024_1120

gpt-4o

2024-11-20

2026-10-01

2025.18

🟒

🟒

βœ…

1M

βœ…

βœ…

10M

βœ…

AZURE_GPT_4o_MINI_2024_0718

gpt-4o-mini

2024-07-18

2026-10-01

before
2025.12

πŸ”΄

🟑

❌

βœ…

700k

βœ…

20M

βœ…

AZURE_o1_2024_1217

o1

2024-12-17

2026-07-15

2025.14

🟑

🟑

❌

βœ…

βœ…

6M

βœ…

2.5M

  • System prompt is not allowed.

  • temperature and topP are hardcoded to 1

AZURE_o3_MINI_2025_0131

o3-mini

2025-01-31

2026-08-02

2025.14

πŸ”΄

🟑

❌

❌

βœ…

20M

βœ…

2.5M

  • System prompt is not allowed.

  • temperature and topP are hardcoded to 1

AZURE_GPT_41_2025_0414

gpt-4.1

2025-04-14

2026-10-14

2025.14

🟒

🟑

❌

❌

βœ…

2M

βœ…

5M

AZURE_GPT_41_MINI_2025_0414

gpt-4.1-mini

2025-04-14

2026-10-14

2025.24

βšͺ

🟑

❌

❌

βœ…

βœ…

5M

AZURE_GPT_41_NANO_2025_0414

gpt-4.1-nano

2025-04-14

2026-10-14

2025.24

βšͺ

🟑

❌

❌

βœ…

2M

βœ…

5M

AZURE_o3_2025_0416

o3

2025-04-16

2026-10-16

2025.20

🟑

🟑

❌

❌

βœ…

3M

βœ…

5M

AZURE_o4_MINI_2025_0416

o4-mini

2025-04-16

2026-10-16

2025.20

🟑

set temperature to 1.0 in configs

🟑

❌

❌

βœ…

3M

βœ…

5M

AZURE_GPT_5_2025_0807

gpt-5

2025-08-07

2027-02-06

2025.34

🟒

🟒

❌

❌

βœ…
3M

βœ…
10M

❗ Needs request via Dynamics 365 Customer Voice

AZURE_GPT_5_MINI_2025_0807

gpt-5-mini

2025-08-07

2027-02-06

2025.34

🟑

🟑

❌

❌

βœ…
3M

βœ…
10M

AZURE_GPT_5_NANO_2025_0807

gpt-5-nano

2025-08-07

2027-02-06

2025.34

🟑

🟑

❌

❌

βœ…

50M

βœ…

150M

AZURE_GPT_5_CHAT_2025_0807

gpt-5-chat

2025-08-07

2026-06-22

2025.34

πŸ”΄

🟑

❌

❌

❌

βœ…
5M

AZURE_GPT_51_2025_1113

gpt-5.1

2025-11-13

2027-05-25

2025.44

🟒

🟑

❌

❌

βœ…

βœ…

AZURE_GPT_52_2025_1211

gpt-5.2

2025-12-11

2026-12-12

2025.52

🟒

🟑

❌

❌

❌

βœ…

5M

AZURE_GPT_54_2026_0305

gpt-5.4

2026-03-05

2027-03-03

2026.12

🟒

🟑

❌

❌

βœ…

βœ…

AZURE_GPT_54_PRO_2026_0305

gpt-5.4-pro

2026-03-05

2027-03-05

2026.12

πŸ”΄

πŸ”΄

❌

❌

❌

βœ…

AZURE_GPT_55_2026_0424

gpt-5.5

2026-04-24

2027-04-23

2026.18

🟒

🟑

❌

❌

βœ…

βœ…

LiteLLM Deployment

Unique Enum

LanguageModelName

(to be used in configurations)

Provider

Model

Version / Release Date

Retirement Date

Availability in Unique (release)

Supported Models for Unique AI Chat

Supported Models for Translation

litellm:anthropic-claude-haiku-4-5

Anthropic

claude-haiku-4-5

20251001

Not sooner than October 15, 2026

2025.44

To be tested

To be tested

litellm:anthropic-claude-opus-4-1

Anthropic

claude-opus-4-1

20250805

Not sooner than August 5, 2026

2025.40

🟑

🟑

litellm:anthropic-claude-opus-4-5

Anthropic

claude-opus-4-5

20251101

Not sooner than November 24, 2026

2025.50

🟑

🟑

litellm:anthropic-claude-opus-4-6

Anthropic

claude-opus-4-6

Not sooner than February 5, 2027

2026.08

🟒

🟑

litellm:anthropic-claude-opus-4-7

Anthropic

claude-opus-4-7

Not sooner than April 16, 2027

2026.18

🟒

🟑

litellm:anthropic-claude-opus-4-8

Anthropic

claude-opus-4-8

Not sooner than May 28, 2027

2026.24

🟑

🟑

litellm:anthropic-claude-sonnet-4-5

Anthropic

claude-sonnet-4-5

20250929

Not sooner than September 29, 2026

2025.40

🟑

🟑

litellm:anthropic-claude-sonnet-4-6

Anthropic

claude-sonnet-4-6

Not sooner than February 17, 2026

2026.10

🟑

🟑

litellm:gemini-2-5-flash

Gemini

gemini-2-5-flash

June 17, 2025

October 16, 2026

2025.20

🟑

🟑

litellm:gemini-2-5-flash-lite

Gemini

gemini-2-5-flash-lite

June 17, 2025

October 16, 2026

2025.46

🟑

🟑

litellm:gemini-2-5-pro

Gemini

gemini-2-5-pro

June 17, 2025

October 16, 2026

2025.28

🟒

🟑

litellm:gemini-3-1-pro-preview

Gemini

gemini-3-1-pro-preview

February 19, 2026

n/a

2026.10

🟑

🟑

litellm:gemini-3-flash-preview

Gemini

gemini-3-flash-preview

December, 2025

n/a

2025.52

🟒

🟑

litellm:openai-gpt-4-1-mini

OpenAI

gpt-4-1-mini

2025-04-14

2026-10-14

2025.20

🟑

πŸ”΄

litellm:openai-gpt-4-1-nano

OpenAI

gpt-4-1-nano

2025-04-14

2026-10-14

2025.20

πŸ”΄

πŸ”΄

litellm:openai-gpt-5

OpenAI

gpt-5

2025-08-07

No earlier than August 7, 2026

2025.34

🟑 Use Azure deployment

πŸ”΄

litellm:openai-gpt-5-chat

OpenAI

gpt-5-chat

2025-08-07

2026-03-01

2025.34

πŸ”΄ tool calls not allowed

πŸ”΄

litellm:openai-gpt-5-mini

OpenAI

gpt-5-mini

2025-08-07

2027-02-06

2025.34

🟑 Use Azure deployment

🟑

litellm:openai-gpt-5-nano

OpenAI

gpt-5-nano

2025-08-07

2027-02-06

2025.34

🟑 Use Azure deployment

🟑

litellm:openai-gpt-5-2

OpenAI

gpt-5-2

2025-12-11

No earlier than December 11, 2026

2025.52

🟒 Use Azure deployment

🟑

litellm:openai-gpt-5-4

OpenAI

gpt-5-4

2026-03-05

No earlier than March 5, 2027

2026.12

🟒 Use Azure deployment

🟑

litellm:openai-gpt-5-4-pro

OpenAI

gpt-5-4-pro

2026-03-05

No earlier than March 5, 2027

2026.12

πŸ”΄

🟑

litellm:openai-gpt-5-5

OpenAI

gpt-5-5

2026-04-24

No earlier than April 24, 2027

2026.18

πŸ”΄Use Azure deployment

🟑

litellm:openai-o1

OpenAI

o1

2024-12-17

No earlier than July 15, 2026

2025.20

🟑

🟑

litellm:openai-o3

OpenAI

o3

2025-04-16

No earlier than April 11, 2026

2025.20

🟑

🟑

litellm:openai-o3-pro

OpenAI

o3-pro

2025-06-10

No earlier than June 18, 2026

2025.26

πŸ”΄: Chat Completion API is not supported for o3-pro. Only response API.

πŸ”΄

litellm:openai-o4-mini

OpenAI

o4-mini

2025-04-16

No earlier than April 11, 2026

2025.20

🟑

🟑

litellm:deepseek-r1

TogetherAI

deepseek-r1

2025.20

πŸ”΄

  • no tool calls possible

🟑

litellm:deepseek-v3-1

TogetherAI

deepseek-v3-1

2025.36

πŸ”΄

  • no tool calls possible

🟑

litellm:llama-3-3-70b-instruct-turbo

TogetherAI

llama-3-3-70b-instruct-turbo

2025.20

πŸ”΄

  • no forced tool calls possible

  • web search creates a string instead of a tool call

🟑

litellm:qwen-3-235B-A22B

TogetherAI

qwen-3-235B-A22B

2025.36

🟑

🟑

litellm:qwen-3-235B-A22B-thinking

TogetherAI

qwen-3-235B-A22B-thinking

2025.36

πŸ”΄

  • no tool calls possible

🟑

litellm:grok-4-1-fast-non-reasoning

xAI

grok-4-1-fast-non-reasoning

2025-11-19

2026.08

🟑

🟑

litellm:grok-4-1-fast-reasoning

xAI

grok-4-1-fast-reasoning

2025-11-19

2026.08

🟑

🟑

litellm:vertex-claude-opus-4-6

Vertex AI

claude-opus-4-6

Not sooner than February 5, 2027

2026.20

🟒

🟑

litellm:vertex-claude-opus-4-7

Vertex AI

claude-opus-4-7

Not sooner than April 16, 2027

2026.20

🟒

🟑

litellm:vertex-claude-sonnet-4-6

Vertex AI

claude-sonnet-4-6

Not sooner than February 17, 2026

2026.20

🟒

🟑

Retired Models

Unique Enum

LanguageModelName

(to be used in configurations)

Model

Version

Retirement

Replacement

AZURE_GPT_4_0613

gpt-4

0613

June 6, 2025

AZURE_GPT_4o_2024_1120

AZURE_GPT_4_32K_0613

gpt-4-32k

0613

June 6, 2025

AZURE_GPT_4o_2024_1120

AZURE_GPT_4_TURBO_2024_0409

gpt-4-turbo

2024-04-09

June 6, 2025

AZURE_GPT_4o_2024_1120

litellm:gemini-2-5-flash-preview-04-17

gemini-2-5-flash-preview-04-17

2025-04-17

July, 2025

litellm:gemini-2-5-flash

litellm:gemini-2.5-pro-preview-06-05

gemini-2.5-pro-preview-06-05

2025-06-05

December, 2 2025

litellm:gemini-2.5-pro

AZURE_GPT_35_TURBO_0125

gpt-35-turbo

0125

November 11, 2025

AZURE_GPT_4o_2024_1120

AZURE_o1_MINI_2024_0912

o1-mini-2024-09-12

2024-09-12

November 17, 2025

AZURE_o1_MINI_2024_0912

litellm:anthropic-claude-3-7-sonnet

claude-3-7-sonnet

20250219

February 19, 2026

litellm:anthropic-claude-sonnet-4-5

litellm:anthropic-claude-3-7-sonnet-thinking

claude-3-7-sonnet-thinking

20250219

February 19, 2026

litellm:anthropic-claude-sonnet-4-5

litellm:gemini-3-pro-preview

gemini-3-pro-preview

March 9, 2026

gemini-3.1-pro-preview

litellm:anthropic-claude-opus-4

claude-opus-4

20250514

June 15, 2026

litellm:anthropic-claude-opus-4-6

litellm:anthropic-claude-sonnet-4

claude-sonnet-4

20250514

June 15, 2026

litellm:anthropic-claude-sonnet-4-6

Notes

  • Global Models: These models perform inference (data processing) across all regions provided by Microsoft, including the US. This is the default option for US tenants.

  • Standard SWE Models: Available exclusively in Sweden Central. This option applies when customers opt for data processing solely within Europe.

  • Data zone Standard Models: Data is processed somewhere in Europe (and not anymore only in Sweden). This option applies when customers opt for data processing solely within Europe.

  • Standard CH Models: Designed specifically for Switzerland. This option applies when customers choose to have data processing only within Switzerland.

  • Request-Based Models: Certain models require a request and approval from Microsoft. (Refer to the Request Table below for details.)

  • PTU (Prepaid Through Unit): Some customers have purchased PTUs and maintain direct communication with Microsoft. However, they likely also follow a pay-as-you-go strategy, making this page relevant to them as well.

  • Escalations to Microsoft: If a model is not available, customers should escalate the request directly to Microsoft via the email contact they have from MS. The customer should submit a request specifying the model needed.

Details about model retirement

Change Logs

Date

Change Summary

Retirement of litellm:anthropic-claude-opus-4 and litellm:anthropic-claude-sonnet-4

Claude models provided via Vertex AI added

The following models are available with DataZoneStandard via Azure:

  • AZURE_GPT_54_2026_0305

  • AZURE_GPT_55_2026_0424

New models:

  • AZURE_GPT_55_2026_0424

  • litellm:openai-gpt-5-5

New models:

  • litellm:anthropic-claude-opus-4-7

New models added:

  • AZURE_GPT_54_2026_0305

  • AZURE_GPT_54_PRO_2026_0305

  • litellm:openai-gpt-5-4

  • litellm:openai-gpt-5-4-thinking

New Gemini model added with release 2026.10

  • litellm:gemini-3-1-pro-preview

Retirement of litellm:anthropic-claude-3-7-sonnet and litellm:anthropic-claude-3-7-sonnet-thinking

New xAI model added with release 2026.08

  • litellm:grok-4-1-fast-non-reasoning

  • litellm:grok-4-1-fast-reasoning

New Anthropic model added with release 2026.08

  • litellm:anthropic-claude-opus-4-6

Update retirement dates OpenAI & Gemini models

Updating retirement date and replacement model for litellm:anthropic-claude-3-7-sonnet and litellm:anthropic-claude-3-7-sonnet-thinking

New models added:

  • AZURE_GPT_52_2025_1211

  • litellm:gemini-3-flash-preview

Both of them are supported by UniqueAI with release 2025.52

New OpenAI model added

  • litellm:openai-gpt-5-2

Gemini 3 Pro works with UniqueAI using the updated version of LiteLLM

New Anthropic model added with release 2025.50

  • litellm:anthropic-claude-opus-4-5

AZURE_GPT_51_2025_1113 is quality tested and green-lighted for Unique AI chat.

New Gemini model added with release 2025.48

  • litellm:gemini-3-pro-preview

Retired model:

  • litellm:gemini-2.5-pro-preview-06-05

New Gemini model added with release 2025.46

  • litellm:gemini-2-5-flash-lite

Retirement date of anthropic-claude-3-7-sonnet and anthropic-claude-3-7-sonnet-thinkingchanged to February 19, 2026

Update retirement date of gemini-2-5-flash-lite-preview-06-17 and gemini-2-5-flash-preview-05-20 to November 18, 2025

claude-haiku-4-5-20251001 added

No changes

claude-sonnet-4-5 and claude-opus-4-1 added

Several OpenAI models retire by

Last updated