ON-DEMAND

How to Build a Data Lake in Amazon S3 & Amazon Glacier

Broadcast Date: February 1, 2018

Level 200 | Service How To

In this session, we discuss best practices for data ingestion, storage, cataloging and analysis on Amazon object storage services. We examine ways to reduce or eliminate costly extract, transform, and load (ETL) processes using query-in-place technology, such as Amazon S3 Select, Amazon Glacier Select, Amazon Athena, and Amazon Redshift Spectrum. We also review custom analytics integration using Apache Spark, Apache Hive, Presto, and other technologies in Amazon EMR.

Learning Objectives:
• Understand the options for building an analytics platform that leverages Amazon S3 & Amazon Glacier
• Learn about the key considerations for ETL and other core analytics functions
• Determine if query-in-place capabilities like Amazon S3 Select, Amazon Glacier Select, Amazon Athena, and Amazon Redshift Spectrum are a good fit for your use case

Suited For: Storage Administrators, Data Scientists, Analytics Professionals

Speaker(s): PD Dutta, Sr. Product Manager, Amazon S3, AWS

Having trouble with this page? Please email us at [email protected]

Download the Slide Deck

Website Referral Code:

Z-[OP]-Form Validation Bot Verification:

Last Web Form Update:

_mkto_trk

Suppress SFDC Auto-Response Email:

Z-[OP]-URL Tracking TRK Campaign:

Z-[OP]-URL Tracking SiteCatalyst Campaign:

Z-[OP]-URL Tracking SiteCatalyst Segment:

Z-[OP]-URL Tracking SiteCatalyst Channel:

Z-[OP]-URL Tracking SiteCatalyst Geo:

Z-[OP]-URL Tracking SiteCatalyst Content:

Z-[OP]-URL Tracking SiteCatalyst Medium:

Z-[OP]-URL Tracking SiteCatalyst Outcome:

Z-[OP]-URL Tracking SiteCatalyst Publisher:

Z-[OP]-URL Tracking SiteCatalyst S_FID:

Z-[OP]-Form Terms and Conditions Copy:

Z-[OP]-Email Validation Hygiene:

Z-[OP]-URL Tracking Lead ID:

Z-[OP]-Form Unique ID:

Business Email Address:

First Name:

Last Name:

Company Name:

Job Role:

Phone Number:

Country / Region:

State/Province:

Postal Code:

Industry:

Job Title:

Level of AWS Usage:

Use Case:

I am completing this form in connection with my: