Transforming Data Lakes with Amazon S3 Select & Amazon Glacier Select


Transforming Data Lakes with Amazon S3 Select & Amazon Glacier Select

Broadcast Date:
March 29, 2018

Level 300 | Service Deep Dive
Data Lakes contain massive amounts of data that companies want to store more cost-effectively and query faster and more efficiently. Amazon S3 Select can increase analytics query performance up to 400%, and Amazon Glacier Select makes it practical to extend queries to archive storage, significantly reducing data lake storage costs. In this webinar, we will demonstrate ways to accelerate analytics applications and extend your data lake to cost-effective archive storage by filtering and retrieving only a subset of data from an S3 or Glacier object instead of retrieving the entire object. We'll discuss how to use these features with Amazon Athena or Amazon Redshift Spectrum, with third-party software, and we'll demonstrate a query on an S3-based data lake using a Presto connector.

Learning Objectives:
• Define Amazon S3 Select and Amazon Glacier Select
• Understand the scenarios in which these features can help you increase performance and extend your data lake
• See a before & after scenario of a query with and without Amazon S3 Select

Suited For: Developers, IT Administrators, IT leaders

Speaker(s): Rahul Bhartia, Principal Product Manager, Amazon S3; Rashim Gupta, Principal Product Manager, Amazon Glacier

Having trouble with this page? Please email us at [email protected]

Download the Slide Deck