Broadcast Date: On-Demand

Level: 300

In most deep learning applications, making predictions using a trained model with a process known as inference, can drive as much as 90% of the compute costs of the application. While Amazon Elastic Inference solves this problem by allowing you to attach just the right amount of GPU-powered inference acceleration to any EC2 and reduce costs on Inferencing, using EC2 Spot with Elastic Inference can further reduce your compute costs up to 90%. In this tech talk, we'll discuss cost optimization of Amazon Elastic Inference running with Amazon EC2 Spot instances and walk through the best practices by using Cloudformation and launch templates for build automation.

Learning Objectives

  • Optimize compute cost for Inference workloads
  • Demo and walk through the deployment of Elastic Inference with EC2 Spot instances
  • Best practices and recommendations on using Elastic Inference with EC2 Spot instances

Who Should Attend?

Developers, Software Engineers, DevOps Engineers, IT, Cloud Engineers, and users of Amazon API Gateway

Speakers

  • Chakra Nagarajan, EC2 Spot Solutions Architect, AWS


Learn More

To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/ec2/spot/

Download the Slide Deck