Higher latencies for a subset of models in one cluster. Root caused and a fix is in progress

Incident Report for Baseten

Resolved

The incident is resolved.
Posted May 15, 2026 - 18:39 PDT

Monitoring

Fix is in place. Monitoring.
Posted May 15, 2026 - 18:10 PDT

Identified

The issue has been identified and a fix is being implemented.
Posted May 15, 2026 - 18:02 PDT
This incident affected: Dedicated Inference.