Previous incidents

October 2024
No incidents reported
September 2024
Sep 26, 2024
2 incidents

Prediction serving degraded

Degraded

Resolved Sep 27 at 12:25am UTC

Upon further investigation we were unable to work through the backlog of predictions. Backlogged predictions have been cancelled. It will take some time before the prediction IDs report failed in the replicate web console.

Users may resubmit any of these cancelled predictions.

All predictions that have been submitted since the last update at Sep 26 2024 at 11:52pm UTC are unaffected by this cancellation of predictions.

2 previous updates

Website availability problems

Downtime

Resolved Sep 26 at 05:15pm UTC

Queues for predictions remain fairly high for black-forest-labs/flux-schnell and meta/meta-llama-3.1-405b-instruct. All other models should be behaving normally.

2 previous updates

Sep 13, 2024
1 incident

Website unavailable

Downtime

Resolved Sep 13 at 03:18pm UTC

Things have been running normally for at least the last 10 minutes. This incident was -- ironically -- triggered by work we're doing to improve the overall performance and reliability of our primary database. We apologise for the disruption.

3 previous updates

Sep 01, 2024
1 incident

Prediction Service Normal

Resolved Sep 01 at 07:41pm UTC

We were alerted to a potential issue with prediction serving. Upon investigation, one of our providers used to monitor is seeing an outage impacting some automated monitoring. We've taken steps to isolate the problematic monitors while our provider works to resolve the issue.

August 2024
Aug 28, 2024
1 incident

Predictions not running on A40s

Downtime

Resolved Aug 28 at 06:11am UTC

A40 workloads are running again. We're continuing to monitor and investigate the underlying cause.

1 previous update

Aug 21, 2024
1 incident

Streaming service degraded for A100s

Degraded

Resolved Aug 21 at 11:04am UTC

We believe these problems have now been resolved. Please contact us if you are still seeing issues with streaming from Europe.

2 previous updates

Aug 09, 2024
1 incident

A40s degraded

Degraded

Resolved Aug 09 at 03:58pm UTC

A40 behavior has been stable for some time now. All systems are green.

1 previous update