All Systems Operational
API ? Operational
90 days ago
100.0 % uptime
Today
Model Serving Operational
90 days ago
100.0 % uptime
Today
Web Application Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Past Incidents
Dec 11, 2023

No incidents reported today.

Dec 10, 2023

No incidents reported.

Dec 9, 2023

No incidents reported.

Dec 8, 2023
Resolved - This incident has been resolved. Start time for replicas using A100 GPUs is back to normal
Dec 8, 08:24 PST
Monitoring - A fix has been implemented and we are monitoring the results.
Dec 8, 08:01 PST
Update - We are rolling out a fix and are seeing improvement in A100 start times.
Dec 8, 07:08 PST
Identified - The issue has been identified and a fix is being implemented.
Dec 8, 06:46 PST
Dec 7, 2023

No incidents reported.

Dec 6, 2023

No incidents reported.

Dec 5, 2023

No incidents reported.

Dec 4, 2023

No incidents reported.

Dec 3, 2023

No incidents reported.

Dec 2, 2023

No incidents reported.

Dec 1, 2023
Resolved - For a period of 2 minutes, we experienced an elevated level of API errors due to a failure in an internal load balancer. Inference requests are retried but may have resulted in errors under high model load.
Dec 1, 12:04 PST
Nov 30, 2023

No incidents reported.

Nov 29, 2023

No incidents reported.

Nov 28, 2023

No incidents reported.

Nov 27, 2023

No incidents reported.