Cloud Load Balancer, the secret to uptime for AI inference