Availability & Health Check Endpoints
5 min read
Unique AI provides a set of standardized service-availability and health-check endpoints designed to give clients real-time insights into the operational status of their Software-as-a-Service (SaaS) environment. These endpoints allow technical teams to confirm that core components are responding correctly.
By integrating these health-check endpoints into existing monitoring systems, clients can improve incident detection, reduce diagnostic time, and increase the reliability of integrations built on top of the Unique AI platform.
Purpose of This Document
This document describes the available endpoints, authentication requirements, expected responses, and recommended best practices for safely incorporating these checks into client-side monitoring workflows.
Who Can Use These Endpoints
Access to these endpoints for clients leveraging Unique SaaS is only available to Single Tenant clients.
For Self-Hosting clients the endpoints are naturally also available as described below but on their respective infrastructure.
Multi-Tenant SaaS clients cannot leverage the endpoints and they are protected by authentication.
Overview of Service Availability Endpoints
What Are Availability Endpoints?
Availability endpoints are lightweight URLs provided by Unique AI that allow clients to quickly verify whether their dedicated service instance is currently operational.
When to Use Them
Availability endpoints should be used whenever a client needs a quick and reliable way to confirm that their Unique AI service instance is reachable. They are ideal for basic uptime checks, automated monitoring systems, and alerts that notify teams when the service becomes unavailable.
Rate Limits & Fair-Use Guidelines
Availability endpoints are protected by web application firewalls configured with bot-protection rules, OWASP security standards, and automated traffic controls.
To ensure stability and security, clients should query these endpoints responsibly and avoid high-frequency or automated probing that resembles abusive traffic.
Excessive or suspicious request patterns may trigger temporary isolation by the firewall, which can also impact access to the service’s workloads. For this reason, clients are encouraged to follow fair-use practices and integrate the endpoints into monitoring systems with conservative polling intervals.
Authentication & Access Requirements
The availability endpoints do not require authentication and can be accessed directly. Adding authentication would defeat their purpose, as these endpoints are intended to provide a simple and fast way to confirm that a service instance is alive. They are instead protected by the previously mentioned web application firewalls, which enforce bot protection, OWASP rules, and potential IP-based blocking.
Clients who require authenticated or more advanced health-check mechanisms can consider upgrading to one of Unique AI’s self-hosting models, where such customization is fully supported.
Network Requirements (IP allowlisting, private endpoints, etc.)
Clients may optionally enable IP blocking or IP allowlisting to further restrict access to the availability endpoints. While this can enhance control, it may also affect business users and day-to-day operations if legitimate traffic is inadvertently blocked.
For this reason, IP-based restrictions should be applied carefully and only when there is a clear operational need.
Health Check Endpoints
Unique publishes a simple, best-effort CLI example to probe the endpoints.
The https://www.npmjs.com/package/@unique-ag/cli util:probe command can demonstratively assess the health of a given Unique tenant.
Endpoints
The endpoints always compose of:
Domain
{{domain}}: The domain your tenant is deployed to (always<tenant>.unique.appon Single Tenant environments) - Self hosting clients have their own domains.API Host
{{apiHost}}: The host where your API Gateway is deployed, eithergatewayorapi.
URL | Display Name | Component | Expected Status |
|---|---|---|---|
| Chat or Recording App | all | 200 |
| Recording API | recording | 200 |
| Microsoft Authentication API and Collector | recording | 200 |
| API Gateway | chat | 404 |
| Chat API | chat | 200 |
| Configuration API | chat | 200 |
| Event Socket API | chat | 200 |
| Ingestion / Knowledge Base API | chat | 200 |
| Scope Management API | chat | 200 |
| IDP Service | chat | 200 |
| Admin App | chat | 200 |
| Chat App | chat | 200 |
| Knowledge Upload App | chat | 200 |
| Theme App | chat | 200 |
Component
Note that not each component is needed/available for every client. Ensure you only monitor those that are available/deployed to your solution.
CLI Utility
While the utility is is helpful, ultimately the endpoint list defines what shall be monitored. You can see the latest endpoints here: https://github.com/Unique-AG/cli/blob/92642008a1b75f80315622fa49827962673db59b/examples/util-probe.schema.yaml
Monitoring Best Practices for Clients
Recommended Polling Frequency
For most use cases, a polling frequency of once every 30 to 60 seconds is sufficient to reliably detect availability issues without generating unnecessary traffic. Clients with less time-sensitive monitoring needs may choose longer intervals, while more aggressive polling is discouraged, as it may trigger firewall protections and provide little additional operational value.
Interpreting Response Data
Refer to Health Check Endpoints above.
Integrating with Monitoring Tools
When integrating availability endpoints with monitoring tools, clients should focus solely on the HTTP response code to determine service status. A successful response (typically 200 OK, can vary) indicates that the service is alive, while any other response code signals a potential issue. The endpoints do not provide detailed diagnostics or metrics, so monitoring should rely only on this simple status check.
Limits & Considerations
Cost
All requests to the availability endpoints generate ingress traffic on the application gateway, which may incur a minimal, marginal cost. While this cost is generally negligible, clients are advised to follow the recommended polling frequency of 30–60 seconds to avoid unnecessary traffic and ensure efficient use of resources.
What Endpoints Do Not Guarantee
These endpoints do not perform deep system diagnostics; instead, they return a simple response indicating whether the service is alive and reachable. This makes them ideal for basic uptime monitoring and integration with external monitoring tools, ensuring clients can immediately detect when their environment is online and available.
Security Considerations
No Sensitive Data Returned
The availability endpoints do not return any sensitive or client-specific data. Besides the HTTP response code indicating service status, the only additional information provided is the currently served application version. This allows clients to verify which version of the service is running without exposing any internal or confidential details.
Change Notifications for Clients
Additions to the endpoints can be identified through the Release Notes and Calendar, namely the Infrastructure Changesof an release.
Troubleshooting
Contacting Support
Refer to Enterprise Support