Skip to content

sarika-subram/serverless-datalake-on-aws

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction to EC2, S3, and Datalake and Analytics on AWS

Learning outcomes from this workshop

  • Launching an EC2 instance
  • Creation and various properties of S3
  • Design serverless data lake architecture
  • Build a data processing pipeline and Data Lake using Amazon S3 for storing data
  • Use Amazon Kinesis for real-time streaming data
  • Use AWS Glue to automatically catalog datasets
  • Query data using Amazon Athena & visualize it using Amazon QuickSight
  • Use Tensorflow on SageMaker

Pre-requisites:

  • You need to have access to an AWS account with AdminstratorAccess
  • This lab should be executed in us-east-1 region
  • Best is to follow links from this guide & open them in new a tab
  • Run this lab in a modern browser

Syllabus

Content Link
Module 1: EC2 & S3 Open Lab ▶️
Module 2: Data Analytics Open Lab ▶️
Module 3: Tensorflow on SageMaker Open Lab ▶️

Clean Up

Failing to do this will result in incurring AWS usage charges.

Make sure you bring down/delete all resources created as part of this lab

Resources to delete

About

Forked from 2019 AWS summit workshop content

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%