Broadcast Date: August 21, 2019

Level: 200

Companies are increasingly building data lakes in order to apply modern analytic techniques to what was previously siloed data sources. However, to create a useful data lake the data needs to be transformed, partitioned, and cataloged so that different data consumers are able to draw useful insights from data that has been optimized for analytics. In this tech talk, learn about best practices for transforming your data so that it is optimized for analytics, including partitioning strategies and the use of columnar based file formats. We'll also talk about the different AWS services that can be used to catalog, transform, and analyze data in the data lake.

Learning Objectives

  • Learn about best practices for partitioning your data to optimize for analytics
  • Understand the benefits of using columnar based file formats for your analytics
  • Learn about AWS services that can be used to catalog, transform, and analyze data in your data lake

Who Should Attend?

Data Engineers, Data Scientists, Business Analytics, and anyone working with Data Lakes

Speakers

  • Gareth Eagar, Solutions Architect, AWS


Learn More

To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/lake-formation/

Intro body copy here about 2018 re:Invent launches.

Download the Slide Deck

Compute

Service How To

December 19th, 2018 | 1:00 PM PT

Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.

Register Now>

Containers

What's New / Cloud Innovation

December 11th, 2018 | 1:00 PM PT

EMBARGOED

Register Now>

Data Lakes & Analytics

Webinar 1:

What's New / Cloud Innovation

December 10th, 2018 | 11:00 AM PT

EMBARGOED

Register Now>

Webinar 2:

What's New / Cloud Innovation

December 12th, 2018 | 11:00 AM PT

EMBARGOED

Register Now>