Best Practices for Transforming and Analyzing Data in Your Data Lake

Broadcast Date: August 21, 2019

Level: 200

Companies are increasingly building data lakes in order to apply modern analytic techniques to what was previously siloed data sources. However, to create a useful data lake the data needs to be transformed, partitioned, and cataloged so that different data consumers are able to draw useful insights from data that has been optimized for analytics. In this tech talk, learn about best practices for transforming your data so that it is optimized for analytics, including partitioning strategies and the use of columnar based file formats. We'll also talk about the different AWS services that can be used to catalog, transform, and analyze data in the data lake.

Learning Objectives

Learn about best practices for partitioning your data to optimize for analytics
Understand the benefits of using columnar based file formats for your analytics
Learn about AWS services that can be used to catalog, transform, and analyze data in your data lake

Who Should Attend?

Data Engineers, Data Scientists, Business Analytics, and anyone working with Data Lakes

Speakers

Gareth Eagar, Solutions Architect, AWS

Learn More

To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/lake-formation/

Intro body copy here about 2018 re:Invent launches.

Download the Slide Deck

Website Referral Code:

Z-[OP]-Form Validation Bot Verification:

Last Web Form Update:

_mkto_trk

Suppress SFDC Auto-Response Email:

Z-[OP]-URL Tracking TRK Campaign:

Z-[OP]-URL Tracking SiteCatalyst Campaign:

Z-[OP]-URL Tracking SiteCatalyst Segment:

Z-[OP]-URL Tracking SiteCatalyst Channel:

Z-[OP]-URL Tracking SiteCatalyst Geo:

Z-[OP]-URL Tracking SiteCatalyst Content:

Z-[OP]-URL Tracking SiteCatalyst Medium:

Z-[OP]-URL Tracking SiteCatalyst Outcome:

Z-[OP]-URL Tracking SiteCatalyst Publisher:

Z-[OP]-URL Tracking SiteCatalyst S_FID:

Z-[OP]-Form Terms and Conditions Copy:

Z-[OP]-Email Validation Hygiene:

Z-[OP]-URL Tracking Lead ID:

Z-[OP]-Form Unique ID:

Business Email Address:

First Name:

Last Name:

Company Name:

Job Role:

Phone Number:

Country / Region:

State/Province:

Postal Code:

Industry:

Job Title:

Level of AWS Usage:

Use Case:

I am completing this form in connection with my:

Compute

Service How To

December 19th, 2018 | 1:00 PM PT

Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.

Containers

What's New / Cloud Innovation

December 11th, 2018 | 1:00 PM PT

EMBARGOED

Data Lakes & Analytics

Webinar 1:

What's New / Cloud Innovation

December 10th, 2018 | 11:00 AM PT

EMBARGOED

Webinar 2:

What's New / Cloud Innovation

December 12th, 2018 | 11:00 AM PT

EMBARGOED

Best Practices for Transforming and Analyzing Data in Your Data Lake

Learning Objectives

Who Should Attend?

Speakers

Download the Slide Deck

Compute

Service How To

Containers

What's New / Cloud Innovation

Data Lakes & Analytics

Webinar 1: What's New / Cloud Innovation

Webinar 2: What's New / Cloud Innovation

Webinar 1:

What's New / Cloud Innovation

Webinar 2:

What's New / Cloud Innovation