Broadcast Date: July 31, 2019

Level: 300

Kubernetes provides isolation, auto-scaling, load balancing, flexibility and GPU support. These features are critical to run computationally, data-intensive and hard to parallelize machine learning models. Declarative syntax of Kubernetes deployment descriptors make it easy for non-operationally focused engineers to easily train machine learning models on Kubernetes. In this tech talk, we will explain why and how Amazon EKS is well-suited for single and multi-node distributed training, training your models, and deploying your models in production. Specifically, we will show how to use KubeFlow and TensorFlow on Amazon EKS for your machine learning needs. We will also demonstrate how to setup machine learning pipelines, and visualization tools like TensorBoard for monitoring. We will also discuss distributed training using Horovod.

Learning Objectives

  • Learn why and how Amazon EKS is well-suited for single and multi-node distributed training
  • Learn how to train and deploy your models in production
  • See how to use KubeFlow and TensorFlow on Amazon EKS for your machine learning needs

Who Should Attend?

Developers, Architects, Infrastructure Engineers

Speakers

  • Arun Gupta, Principal OS Technologist, AWS


Learn More

To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/eks

Intro body copy here about 2018 re:Invent launches.

Download the Slide Deck

Compute

Service How To

December 19th, 2018 | 1:00 PM PT

Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.

Register Now>

Containers

What's New / Cloud Innovation

December 11th, 2018 | 1:00 PM PT

EMBARGOED

Register Now>

Data Lakes & Analytics

Webinar 1:

What's New / Cloud Innovation

December 10th, 2018 | 11:00 AM PT

EMBARGOED

Register Now>

Webinar 2:

What's New / Cloud Innovation

December 12th, 2018 | 11:00 AM PT

EMBARGOED

Register Now>