Broadcast Date: October 30, 2020

Level: 200

Many organizations like healthcare, financial services, public sector, and manufacturing generate and store large amounts of paper heavy documents to gather information from their patients, clients, and vendors. Those documents, whether scanned images, PDF’s, or scanned documents, contain critical business information your organization needs to perform essential tasks like extracting data from a mortgage application. Using advanced machine learning, Amazon Textract uses OCR technology identifying each character, word, and letter but also the contents of fields in forms and information stored in tables for scanned images, documents, and PDF’s. In this tech talk, you will learn how to use Amazon Textract to extract text and data from documents and use Amazon A2I to provide your workflow with human oversight.

Learning Objectives

  • Learn how Amazon Textract can help you with your toughest data extractions using Optical Character Recognition (OCR)
  • Learn how to extract text in forms and tables from all types of documents like W-2’s, insurance claims, and financial forms using Textract
  • Overcome the manual process of data entry with a quick setup of how to get started with Amazon Textract

Who Should Attend?



  • Sonali Sahu, Solutions Architect, AWS

Learn More

To learn more about the services featured in this talk, please visit:

Intro body copy here about 2018 re:Invent launches.

Download the Slide Deck


Service How To

December 19th, 2018 | 1:00 PM PT

Developing Deep Learning Models for Computer Vision with
Amazon EC2 P3 Instances.

Register Now>


What's New / Cloud Innovation

December 11th, 2018 | 1:00 PM PT


Register Now>

Data Lakes & Analytics

Webinar 1:

What's New / Cloud Innovation

December 10th, 2018 | 11:00 AM PT


Register Now>

Webinar 2:

What's New / Cloud Innovation

December 12th, 2018 | 11:00 AM PT


Register Now>