SlideShare a Scribd company logo
Data HubsData Hubs
Architectural Patterns & Security BestArchitectural Patterns & Security Best
Practices for Data-Intensive AWSPractices for Data-Intensive AWS
PlatformsPlatforms
Martijn van DongenMartijn van Dongen
AWS Cloud Evangelist | Founder AWSug.nl | AWS APN AmbassadorAWS Cloud Evangelist | Founder AWSug.nl | AWS APN Ambassador
1
Hands-on AWSHands-on AWS
WorkshopsWorkshops
instruqt.cominstruqt.com
2
Doing "Consultancy"Doing "Consultancy"
Currently @ PathéCurrently @ Pathé
3
Organizing MeetupsOrganizing Meetups
AWSug.nlAWSug.nl
4
SpeakingSpeaking
5
Data HubsData Hubs
Architectural Patterns & Security BestArchitectural Patterns & Security Best
Practices for Data-Intensive AWSPractices for Data-Intensive AWS
PlatformsPlatforms
Martijn van DongenMartijn van Dongen
AWS Cloud Evangelist | Founder AWSug.nl | AWS APN AmbassadorAWS Cloud Evangelist | Founder AWSug.nl | AWS APN Ambassador
6
Well Known Analytics & ML ServicesWell Known Analytics & ML Services
Athena EMR QuickSight GlueLake Formation
Sagemaker Forecast Personalize DeepRacer Rekognition
S3
IAM
7
But, how about these...?But, how about these...?
EFS
KMS API Gateway
RDSDMS
VPC VPN Transit Gateway
ECS Lambda
8
High Level OverviewHigh Level Overview
AWS Cloud
CRM
ERP
Database
Migration
Service
S3 Data Lake
Sagemaker
Athena
API Gateway
Quicksight Analysts
Apps
9
So, AWS Lake Formation?So, AWS Lake Formation?
10
PermissionsPermissions
AWS Account
Key Management Service
Users
S3
Organizations
Service Control Policies
Key Policies
Identity-Based Policies
Bucket Policies
Resource-
Based
Policies
11
Service Control Policies (SCP)Service Control Policies (SCP)
Statement:
- Sid: "DenyAllOutsideEU"
Effect: Deny
NotAction:
- "s3:*"
- "rds:*"
- "ec2:*"
# ...
Resource: "*"
Condition:
StringNotEquals:
"aws:RequestedRegion":
- "eu-west-1"
- "eu-central-1"
12
Identity-Based Policies (IAM)Identity-Based Policies (IAM)
Statement:
- Sid: "AllowUserSeeBuckets"
Effect: Allow
Action:
- "s3:ListAllMyBuckets"
Resource: "arn:aws:s3:::*"
- Sid: "AllowUserReadDataSet"
Effect: Allow
Action:
- "s3:Get*"
Resource:
- "arn:aws:s3:::examplebucket/dataset/*"
13
Identity-Based Policies (IAM)Identity-Based Policies (IAM)
Statement:
- Sid: "AllowUserReadDataSet"
Effect: Allow
Action:
- "s3:Get*"
Resource:
- "arn:aws:s3:::examplebucket/dataset/*"
Condition:
StringEquals: {
"s3:ResourceTag/PCI":
"False"
}
14
Resource-Based PoliciesResource-Based Policies
Statement:
- Sid: "DenyWhenNotEncrypted"
Effect: Deny
Principal: *
Action: "s3:PutObject"
Resource: "arn:aws:s3:::examplebucket/*"
Condition:
StringNotLikeIfExists: {
"s3:x-amz-server-side-encryption-aws-kms-key-id":
"arn:aws:kms:eu-west-1:111122223333:key/*"
}
15
NetworkingNetworking
AWS Cloud
Datacenter A
Datacenter B
CRM
ERP
Data Transit VPC
Dev VPC
Transit Gateway
DMS
Sagemaker
S3 Data Lake
16
Data IngestionData Ingestion
Source Database
Migration
Service
RDS
S3 Staging
Glue S3 Datalake
Internet
17
OverviewOverview
AWS Cloud
CRM
ERP
Database
Migration
Service
S3 Data Lake
Sagemaker
Athena
API Gateway
Quicksight Analysts
Apps
18
AnalyticsAnalytics
Crawl
S3 Datalake
Schema
Glue
Athena Quicksight Analysts
19
OverviewOverview
AWS Cloud
CRM
ERP
Database
Migration
Service
S3 Data Lake
Sagemaker
Athena
API Gateway
Quicksight Analysts
Apps
20
Sagemaker VPCSagemaker VPC
VPC
Public subnet Private subnet Data subnet
Elastic File SystemNAT Gateway
Internet
Sagemaker
VPC EndpointS3 Data Lake
21
Now publish that APINow publish that API
S3 Datalake
Sagemaker
Elastic
Container
Repository
Internet
Elastic
Container
Service
(ECS)
ECS for 
Kubernetes
API Gateway
Sagemaker
Inference
Lambda
22
Pathé Data HubPathé Data Hub
Diederik MeijerinkDiederik Meijerink
23

More Related Content

PDF
Move fast, build things with AWS (June 2016)
PDF
Busting the Myths to AWS Cloud Adoption_Liam Caskie
PDF
IoT at the Edge_Greengrass and More_ Craig Lawton_AWS
PDF
Building Dynamic Pipelines in Azure Data Factory (SQLSaturday Oslo)
PDF
Modern Data Platforms - Thinking Data Flywheel on the Cloud
PDF
A Public Sector Guide to AWS_ Avi Lewin
PDF
Cloudreach Voices - Azure Active Directory
PDF
The Economics of Innovation_Andrew Phillips_AWS
Move fast, build things with AWS (June 2016)
Busting the Myths to AWS Cloud Adoption_Liam Caskie
IoT at the Edge_Greengrass and More_ Craig Lawton_AWS
Building Dynamic Pipelines in Azure Data Factory (SQLSaturday Oslo)
Modern Data Platforms - Thinking Data Flywheel on the Cloud
A Public Sector Guide to AWS_ Avi Lewin
Cloudreach Voices - Azure Active Directory
The Economics of Innovation_Andrew Phillips_AWS

Similar to GoDataFest 2019 Data Hubs (12)

PPTX
AWS Startup Day Bangalore: Being Well-Architected in the Cloud
PPTX
2016 Utah Cloud Summit: Architecting on AWS - Best Practices
PDF
Denver AWS Meetup - March 2019 slides
PDF
Well Architected Framework Presentation @ TU Delft
PPTX
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned
PDF
AWS Meetup - Well-architected Framework (31012017)
PPTX
Getting to 1.5M Ads/sec: How DataXu manages Big Data
PDF
Denver AWS Meetup - February 2019
PPTX
Cloud platforms - Cloud Computing
PPTX
AWS Skills for the Modern IT Professional.
PDF
Making it easy to work with data
PPTX
Being Well Architected in the Cloud (Updated)
AWS Startup Day Bangalore: Being Well-Architected in the Cloud
2016 Utah Cloud Summit: Architecting on AWS - Best Practices
Denver AWS Meetup - March 2019 slides
Well Architected Framework Presentation @ TU Delft
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned
AWS Meetup - Well-architected Framework (31012017)
Getting to 1.5M Ads/sec: How DataXu manages Big Data
Denver AWS Meetup - February 2019
Cloud platforms - Cloud Computing
AWS Skills for the Modern IT Professional.
Making it easy to work with data
Being Well Architected in the Cloud (Updated)
Ad

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Big Data Technologies - Introduction.pptx
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Unlocking AI with Model Context Protocol (MCP)
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Spectral efficient network and resource selection model in 5G networks
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Review of recent advances in non-invasive hemoglobin estimation
Reach Out and Touch Someone: Haptics and Empathic Computing
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Empathic Computing: Creating Shared Understanding
Chapter 3 Spatial Domain Image Processing.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Big Data Technologies - Introduction.pptx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Encapsulation_ Review paper, used for researhc scholars
Dropbox Q2 2025 Financial Results & Investor Presentation
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MYSQL Presentation for SQL database connectivity
Network Security Unit 5.pdf for BCA BBA.
Unlocking AI with Model Context Protocol (MCP)
The AUB Centre for AI in Media Proposal.docx
Electronic commerce courselecture one. Pdf
NewMind AI Monthly Chronicles - July 2025
Spectral efficient network and resource selection model in 5G networks
“AI and Expert System Decision Support & Business Intelligence Systems”
Review of recent advances in non-invasive hemoglobin estimation
Ad

GoDataFest 2019 Data Hubs