☁️ SkyServe: Spot Instance AI Model Serving Across Clouds
Impossible d'ajouter des articles
Échec de l’élimination de la liste d'envies.
Impossible de suivre le podcast
Impossible de ne plus suivre le podcast
-
Lu par :
-
De :
À propos de ce contenu audio
Serving demanding AI models cost-effectively and reliably is challenging due to GPU expenses and service requirements. This paper introduces SpotHedge, a policy that intelligently uses discounted spot instances across different cloud regions to lower costs while maintaining high availability. The system built upon this policy, SkyServe, dynamically manages a mix of spot and on-demand replicas, proactively hedging against spot instance preemptions and unavailability. Evaluations show SkyServe significantly reduces costs and improves latency compared to existing solutions by diversifying resources and adapting to market conditions. This work demonstrates the feasibility of using spot instances for AI model serving without compromising service quality.
Vous êtes membre Amazon Prime ?
Bénéficiez automatiquement de 2 livres audio offerts.Bonne écoute !