Build ETL Processes for Data Lakes with AWS Glue

Broadcast Date: June 17, 2019

Level: 200

Every data lake initiative begins with setting up extract, transform, and load (ETL) processes where data is moved from various data sources into a central data repository. In this tech talk, we will show how you can use AWS Glue to build, automate, and manage ETL jobs in a scalable, serverless Apache Spark platform. See how to support Python shell jobs too, in addition to Spark jobs.

Learning Objectives

Learn about building a data lake on AWS
Discover how to create ETL processes using AWS Glue
Understand how serverless Spark and Python jobs reduce costs

Who Should Attend?

Analysts, Developers, Data Scientists, Data Engineers, DBAs

Speakers

Raghu Prabhu, Sr. Business Development Manager, AWS

Learn More

To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/glue

Intro body copy here about 2018 re:Invent launches.

Download the Slide Deck

Website Referral Code:

Z-[OP]-Form Validation Bot Verification:

Last Web Form Update:

_mkto_trk

Suppress SFDC Auto-Response Email:

Z-[OP]-URL Tracking TRK Campaign:

Z-[OP]-URL Tracking SiteCatalyst Campaign:

Z-[OP]-URL Tracking SiteCatalyst Segment:

Z-[OP]-URL Tracking SiteCatalyst Channel:

Z-[OP]-URL Tracking SiteCatalyst Geo:

Z-[OP]-URL Tracking SiteCatalyst Content:

Z-[OP]-URL Tracking SiteCatalyst Medium:

Z-[OP]-URL Tracking SiteCatalyst Outcome:

Z-[OP]-URL Tracking SiteCatalyst Publisher:

Z-[OP]-URL Tracking SiteCatalyst S_FID:

Z-[OP]-Form Terms and Conditions Copy:

Z-[OP]-Email Validation Hygiene:

Z-[OP]-URL Tracking Lead ID:

Z-[OP]-Form Unique ID:

Business Email Address:

First Name:

Last Name:

Company Name:

Job Role:

Phone Number:

Country / Region:

State/Province:

Postal Code:

Industry:

Job Title:

Level of AWS Usage:

Use Case:

I am completing this form in connection with my:

Compute

Service How To

December 19th, 2018 | 1:00 PM PT

Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.

Containers

What's New / Cloud Innovation

December 11th, 2018 | 1:00 PM PT

EMBARGOED

Data Lakes & Analytics

Webinar 1:

What's New / Cloud Innovation

December 10th, 2018 | 11:00 AM PT

EMBARGOED

Webinar 2:

What's New / Cloud Innovation

December 12th, 2018 | 11:00 AM PT

EMBARGOED

Build ETL Processes for Data Lakes with AWS Glue

Learning Objectives

Who Should Attend?

Speakers

Download the Slide Deck

Compute

Service How To

Containers

What's New / Cloud Innovation

Data Lakes & Analytics

Webinar 1: What's New / Cloud Innovation

Webinar 2: What's New / Cloud Innovation

Webinar 1:

What's New / Cloud Innovation

Webinar 2:

What's New / Cloud Innovation