Back to overview

Prediction creation unavailable for L40S and H100 hardware

Feb 03 at 06:30am UTC
Affected services
API

Resolved
Feb 03 at 06:30am UTC

The cache used by the API for predictions was misconfigured for a period of ~20 minutes beginning at 20:34 UTC until a rollback completed at 20:56 UTC. Models using the L40S and H100 hardware types were affected. During the period of misconfiguration, prediction creation was severely limited, resulting in many API responses with status 503.