
Broadcast Date: April 27, 2022
Level: 300
Amazon SageMaker Studio is the first fully integrated development environment (IDE) for machine learning (ML). It provides a single, web-based visual interface where you can perform all ML development steps required to prepare data, as well as to build, train, and deploy models. Analyzing, transforming, and preparing large amounts of data is a foundational step of any data science and ML workflow. Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto. In this talk, we will demonstrate recent integrations between the services making it really simple for Data Scientists and Machine Learning Engineers to use distributed big data frameworks such as Spark in their machine learning workflow

Learning Objectives
- How to use a unified notebook-centric experience to create and manage EMR clusters, run analytics on those clusters, and train and deploy SageMaker models
- How to use a one-click interface for debugging and monitoring Amazon EMR jobs through the Spark UI.
- How data workers can discover, connect, create, and stop clusters in a multi-account setup.

Who Should Attend?
TBD
Speakers
- Damon Cortesi, Principal developer advocate, AWS
- Sumedha Swamy, Principal Product Manager, AWS

Learn More
To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/emr/features/studio/
Intro body copy here about 2018 re:Invent launches.
Download the Slide Deck
Compute
Service How To
December 19th, 2018 | 1:00 PM PT
Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.
Data Lakes & Analytics
Webinar 1:
What's New / Cloud Innovation
December 10th, 2018 | 11:00 AM PT
EMBARGOED
Register Now>