Reducing Machine Learning Inference Cost for PyTorch Models

Broadcast Date: 21 April, 2020

Level: 200

In deep learning applications, inference accounts for up to 90% of compute cost. To reduce this high inference cost, you can use Amazon Elastic Inference, which allows you to attach just the right amount of GPU-powered inference acceleration to any EC2 or SageMaker instance type or ECS task. In this tech talk, you will learn about how to use Elastic Inference for deploying models built on PyTorch, a popular machine learning framework.

Learning Objectives

Get an overview of Amazon Elastic Inference
Learn about how to use Elastic Inference to reduce costs and improve latency for your PyTorch models on Amazon SageMaker
Get a demo using TorchScript with Elastic Inference API

Who Should Attend?

Data Scientists, ML Developers. Researchers, and Data Engineers

Speakers

David Thomas, Software Development Engineer, AWS

Learn More

To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/machine-learning/elastic-inference/

Intro body copy here about 2018 re:Invent launches.

Download the Slide Deck

Website Referral Code:

Z-[OP]-Form Validation Bot Verification:

Last Web Form Update:

_mkto_trk

Suppress SFDC Auto-Response Email:

Z-[OP]-URL Tracking TRK Campaign:

Z-[OP]-URL Tracking SiteCatalyst Campaign:

Z-[OP]-URL Tracking SiteCatalyst Segment:

Z-[OP]-URL Tracking SiteCatalyst Channel:

Z-[OP]-URL Tracking SiteCatalyst Geo:

Z-[OP]-URL Tracking SiteCatalyst Content:

Z-[OP]-URL Tracking SiteCatalyst Medium:

Z-[OP]-URL Tracking SiteCatalyst Outcome:

Z-[OP]-URL Tracking SiteCatalyst Publisher:

Z-[OP]-URL Tracking SiteCatalyst S_FID:

Z-[OP]-Form Terms and Conditions Copy:

Z-[OP]-Email Validation Hygiene:

Z-[OP]-URL Tracking Lead ID:

Z-[OP]-Form Unique ID:

Business Email Address:

First Name:

Last Name:

Company Name:

Job Role:

Phone Number:

Country / Region:

State/Province:

Postal Code:

Industry:

Job Title:

Level of AWS Usage:

Use Case:

I am completing this form in connection with my:

Compute

Service How To

December 19th, 2018 | 1:00 PM PT

Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.

Containers

What's New / Cloud Innovation

December 11th, 2018 | 1:00 PM PT

EMBARGOED

Data Lakes & Analytics

Webinar 1:

What's New / Cloud Innovation

December 10th, 2018 | 11:00 AM PT

EMBARGOED

Webinar 2:

What's New / Cloud Innovation

December 12th, 2018 | 11:00 AM PT

EMBARGOED

Reducing Machine Learning Inference Cost for PyTorch Models

Learning Objectives

Who Should Attend?

Speakers

Download the Slide Deck

Compute

Service How To

Containers

What's New / Cloud Innovation

Data Lakes & Analytics

Webinar 1: What's New / Cloud Innovation

Webinar 2: What's New / Cloud Innovation

Webinar 1:

What's New / Cloud Innovation

Webinar 2:

What's New / Cloud Innovation