Broadcast Date: August 21, 2019
Level: 200
Companies are increasingly building data lakes in order to apply modern analytic techniques to what was previously siloed data sources. However, to create a useful data lake the data needs to be transformed, partitioned, and cataloged so that different data consumers are able to draw useful insights from data that has been optimized for analytics. In this tech talk, learn about best practices for transforming your data so that it is optimized for analytics, including partitioning strategies and the use of columnar based file formats. We'll also talk about the different AWS services that can be used to catalog, transform, and analyze data in the data lake.
Learning Objectives
- Learn about best practices for partitioning your data to optimize for analytics
- Understand the benefits of using columnar based file formats for your analytics
- Learn about AWS services that can be used to catalog, transform, and analyze data in your data lake
Who Should Attend?
Data Engineers, Data Scientists, Business Analytics, and anyone working with Data Lakes
Speakers
- Gareth Eagar, Solutions Architect, AWS
Learn More
To learn more about the services featured in this talk, please visit:
https://aws.amazon.com/lake-formation/
Intro body copy here about 2018 re:Invent launches.
Download the Slide Deck
Compute
Service How To
December 19th, 2018 | 1:00 PM PT
Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.
Data Lakes & Analytics
Webinar 1:
What's New / Cloud Innovation
December 10th, 2018 | 11:00 AM PT
EMBARGOED
Register Now>