SlideShare a Scribd company logo
Amazon Web Services: Lessons for Architecting Data in the Cloud
Ask us anything!
Don Murray Stewart Harper
Connect
Transform
Automate
FME: By Safe Software
AWS Data Storage
AWS
Simple
Storage
(S3)
AWS RDS
/ Aurora
Easy to operate,
fault tolerant,
scalable Relational
Database Service
in the cloud.
Secure, durable,
highly-scalable
object storage
AWS S3 - Overview
Secure,
durable, highly-
scalable object
storage
99.999999999% reliability
3+ million requests per second
S3 is highly performant and
applications can get 100’s of
requests a second
$0.03 per GB of storage (5TB of
data $150 a month)
Can store single objects up to 5TB
AWS S3 - How it works
A simple
key/value
persistent
object store.
It is not a disk
and doesn’t have
a directory or
folder structure.
It can be
presented as a
folder structure
though.
Every object
is identified
with a key.
You have a large
amount of data on
premise.
When might you use S3?
You want to take
advantage of features
that are not available
on premises.
You are serving
large amounts of
data and are
having problems
scaling.
You want
programmatic
access to your data
You wish to build
an event driven
architecture.
AWS S3 - Tips & Tricks
Host a static website on S3 for next to nothing.
Every object in S3 is web addressable.
For even higher performance content delivery
use in combination with CloudFront.
Version your data automatically or transition to
Glacier using lifecycle management.
Provide time-limited access to objects via query
string authentication.
Easy to operate,
fault tolerant,
scalable Relational
Database Service
in the cloud.
AWS RDS - Overview
AWS Aurora (MySQL compatible)
Oracle
MariaDB
PostgreSQL (PostGIS)
Microsoft SQL Server
MySQLSupportedDataTypes
The code,
applications, and
tools you use today
with your existing
databases can be
used with Amazon
RDS.
The AWS RDS service lets you...
Deploy in minutes
Automate backups
Apply software patches automatically
Scale storage and performance with one click
Replicate to enhance availability and reliability
When might you use RDS?
If you don’t want to
worry about
infrastructure.
If you are looking
to run a production
database with minimal
administration,
perfectly suited for
a small team.
If you don’t want to
worry about scaling,
fault tolerance and
enhanced availability
as you grow.
On-Premises vs RDS PostGIS
Specification: 16 Cores, 122GB of RAM, Linux, 50TB Storage
RDS - Tips and Tricks
● You can’t always install all the libraries you want, so check this before you
migrate. For example urllib Python lib is not included.
● You don’t have SUPER privileges, which can make it impossible to perform
advanced operations against the DB.
Native Spatial Support Enterprise Geodatabase Support
AWS Aurora
Database engine runs on EC2 and is compatible with the Open Source MySQL,
logging and storage runs on Amazon’s storage layer. This is where most of the
gains have come from.
RDS vs Aurora
Aurora only supports the
InnoDB storage engine. That
rules out spatial index
support.
The gains are mainly
around fault tolerance
and performance. If you
aren’t running a high throughput
transactional database it might
not be worth migrating.
Aurora is only available on
larger instances (r3 family),
so it is more expensive if
you only need a small
database.
Cross region replication is not
supported by Aurora,
it is with RDS.
Other Key AWS Services
Simple Queueing Service
Lambda
Simple Notification Service
API Gateway
Event Driven Data
Processing
The event driven design pattern
Deploying an event-driven
architecture is
difficult and requires
custom development.
Lambda, Amazon’s new
compute service, coupled
with FME gives you an
extremely powerful set of
tools for the elastic
processing of hundreds of
datasets.
Amazon S3,
Amazon
Kinesis and
Amazon Dynamodb
trigger events.
Automate data processing pattern with AWS
Lambda, S3, SNS and SQS
Event Driven Processing Demo
Automatically process data when it is dropped into an S3
bucket, send an email once the data has finished
processing and place output in another S3 bucket.
Incoming
Data
S3 Bucket S3 Bucket
Pelmorex Example
Pelmorex Example
Data Delivery
AWS Architecture for Data Delivery
AWS perfectly suited for data delivery solutions:
Multiple
services for
storing large
volumes of data
cheaply and
securely.
Costs grows
with the
popularity of
your data
delivery service.
You can build
highly
performant
workflows with
the click of a
button.
Caching a snapshot of your data on S3
Real time Data Delivery
Architecture - State Arkansas Data Portal
Our rack space costs (real estate on data center
floor) are $3200 a month. Add to that the hardware
costs, etc and you can see why moving to the cloud
was a no brainer for us.
Projected Costs - State Arkansas Data Portal
On Premises AWS
Anthony Davis, State Arkansas
APIs - Next Generation of Data Delivery
● APIs allow businesses to build platforms that partners
and customers can use to access core business systems,
whenever they want, in a stable and secure way.
● We now have the tools to create a fully functioning
scalable API in less than a day, this is going to
revolutionize businesses and bring the next wave of API
value—creating internal disposable APIs.
Building an API with no infrastructure
or code
API Gateway - Demo
Building an API on top of a database in less than 5 minutes.
Free Training
Hands-on and instructor led.
FME Desktop Basic
● Nov 17-18
FME Server Authoring
● Dec 1-2
www.safe.com/training
AWS Doctor's Office
Coming soon - Safe’s AWS
experts on hand for 1H.
Q&A
Thank you!
Download a free trial of
FME at
www.safe.com

More Related Content

PDF
AWS RDS
PDF
Amazon Kinesis Data Streams
PPTX
Aws Atlanta meetup Amazon Athena
PPTX
AWS SSA Webinar 21 - Getting Started with Data lakes on AWS
PDF
AWS tutorial-Part5 to 10(Combined):Overview of various AWS services and offer...
PDF
AWS tutorial-Part58:AWS Cloud Database Products-1st Intro Session
PDF
AWS tutorial-Part59:AWS Cloud Database Products-2nd Intro Session
PDF
Redshift Spectrum & AWS Athena Deep Dive
AWS RDS
Amazon Kinesis Data Streams
Aws Atlanta meetup Amazon Athena
AWS SSA Webinar 21 - Getting Started with Data lakes on AWS
AWS tutorial-Part5 to 10(Combined):Overview of various AWS services and offer...
AWS tutorial-Part58:AWS Cloud Database Products-1st Intro Session
AWS tutorial-Part59:AWS Cloud Database Products-2nd Intro Session
Redshift Spectrum & AWS Athena Deep Dive

Similar to Amazon Web Services: Lessons for Architecting Data in the Cloud (20)

PDF
2017 AWS DB Day | AWS 데이터베이스 개요 - 나의 업무에 적합한 데이터베이스는?
PPTX
re:Invent re:Cap - Big Data & IoT at Any Scale
PDF
AWS Study Group - Chapter 09 - Storage Option [Solution Architect Associate G...
PDF
Transformation Track AWS Cloud Experience Argentina - Bases de Datos en AWS
PDF
Choosing the Right Database Service (김상필, 유타카 호시노) - AWS DB Day
PDF
Amazon Aurora (MySQL, Postgres)
PDF
Introduction to AWS Cloud Databases [Apr 2020]
PDF
Builders Day' - Databases on AWS: The Right Tool for The Right Job
PDF
Moving Quickly with Data Services in the Cloud
PDF
AWS Summit - Atlanta
PDF
Databases in the Cloud em Amazon Web Services
PPTX
amazon database
PDF
Module 1 - CP Datalake on AWS
PDF
Data Analytics on AWS
PDF
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
PDF
AWSomeDayOnline Q322_2. Introduction to AWS Services Compute, Storage, Databa...
PDF
AWS Cloud Experience CA: Bases de Datos en AWS: distintas necesidades, distin...
PPTX
AWS Cloud SAA Relational Database presentation
PDF
AWS Innovate: Build a Data Lake on AWS- Johnathon Meichtry
PDF
Building a Bigdata Architecture on AWS
2017 AWS DB Day | AWS 데이터베이스 개요 - 나의 업무에 적합한 데이터베이스는?
re:Invent re:Cap - Big Data & IoT at Any Scale
AWS Study Group - Chapter 09 - Storage Option [Solution Architect Associate G...
Transformation Track AWS Cloud Experience Argentina - Bases de Datos en AWS
Choosing the Right Database Service (김상필, 유타카 호시노) - AWS DB Day
Amazon Aurora (MySQL, Postgres)
Introduction to AWS Cloud Databases [Apr 2020]
Builders Day' - Databases on AWS: The Right Tool for The Right Job
Moving Quickly with Data Services in the Cloud
AWS Summit - Atlanta
Databases in the Cloud em Amazon Web Services
amazon database
Module 1 - CP Datalake on AWS
Data Analytics on AWS
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
AWSomeDayOnline Q322_2. Introduction to AWS Services Compute, Storage, Databa...
AWS Cloud Experience CA: Bases de Datos en AWS: distintas necesidades, distin...
AWS Cloud SAA Relational Database presentation
AWS Innovate: Build a Data Lake on AWS- Johnathon Meichtry
Building a Bigdata Architecture on AWS
Ad

More from Safe Software (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
PDF
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
PDF
Notification System for Construction Logistics Application
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
PDF
FME in Overdrive - Peak of Data & AI 2025
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
PDF
Pipeline Industry IoT - Real Time Data Monitoring
PDF
FME in Overdrive: Unleashing the Power of Parallel Processing
PDF
Fiber to the People! By Deutsche Telekom
PDF
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
PDF
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Getting Started with Data Integration: FME Form 101
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Notification System for Construction Logistics Application
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Transforming Utility Networks: Large-scale Data Migrations with FME
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
FME in Overdrive - Peak of Data & AI 2025
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Pipeline Industry IoT - Real Time Data Monitoring
FME in Overdrive: Unleashing the Power of Parallel Processing
Fiber to the People! By Deutsche Telekom
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
5 Things to Consider When Deploying AI in Your Enterprise
Ad

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Approach and Philosophy of On baking technology
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPT
Teaching material agriculture food technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Empathic Computing: Creating Shared Understanding
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Electronic commerce courselecture one. Pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
MYSQL Presentation for SQL database connectivity
Advanced methodologies resolving dimensionality complications for autism neur...
Network Security Unit 5.pdf for BCA BBA.
A comparative analysis of optical character recognition models for extracting...
Dropbox Q2 2025 Financial Results & Investor Presentation
20250228 LYD VKU AI Blended-Learning.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Approach and Philosophy of On baking technology
gpt5_lecture_notes_comprehensive_20250812015547.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Teaching material agriculture food technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Empathic Computing: Creating Shared Understanding
Chapter 3 Spatial Domain Image Processing.pdf
Spectroscopy.pptx food analysis technology
Encapsulation_ Review paper, used for researhc scholars
Electronic commerce courselecture one. Pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
MYSQL Presentation for SQL database connectivity

Amazon Web Services: Lessons for Architecting Data in the Cloud

  • 2. Ask us anything! Don Murray Stewart Harper
  • 4. AWS Data Storage AWS Simple Storage (S3) AWS RDS / Aurora Easy to operate, fault tolerant, scalable Relational Database Service in the cloud. Secure, durable, highly-scalable object storage
  • 5. AWS S3 - Overview Secure, durable, highly- scalable object storage 99.999999999% reliability 3+ million requests per second S3 is highly performant and applications can get 100’s of requests a second $0.03 per GB of storage (5TB of data $150 a month) Can store single objects up to 5TB
  • 6. AWS S3 - How it works A simple key/value persistent object store. It is not a disk and doesn’t have a directory or folder structure. It can be presented as a folder structure though. Every object is identified with a key.
  • 7. You have a large amount of data on premise. When might you use S3? You want to take advantage of features that are not available on premises. You are serving large amounts of data and are having problems scaling. You want programmatic access to your data You wish to build an event driven architecture.
  • 8. AWS S3 - Tips & Tricks Host a static website on S3 for next to nothing. Every object in S3 is web addressable. For even higher performance content delivery use in combination with CloudFront. Version your data automatically or transition to Glacier using lifecycle management. Provide time-limited access to objects via query string authentication.
  • 9. Easy to operate, fault tolerant, scalable Relational Database Service in the cloud. AWS RDS - Overview AWS Aurora (MySQL compatible) Oracle MariaDB PostgreSQL (PostGIS) Microsoft SQL Server MySQLSupportedDataTypes The code, applications, and tools you use today with your existing databases can be used with Amazon RDS.
  • 10. The AWS RDS service lets you... Deploy in minutes Automate backups Apply software patches automatically Scale storage and performance with one click Replicate to enhance availability and reliability
  • 11. When might you use RDS? If you don’t want to worry about infrastructure. If you are looking to run a production database with minimal administration, perfectly suited for a small team. If you don’t want to worry about scaling, fault tolerance and enhanced availability as you grow.
  • 12. On-Premises vs RDS PostGIS Specification: 16 Cores, 122GB of RAM, Linux, 50TB Storage
  • 13. RDS - Tips and Tricks ● You can’t always install all the libraries you want, so check this before you migrate. For example urllib Python lib is not included. ● You don’t have SUPER privileges, which can make it impossible to perform advanced operations against the DB. Native Spatial Support Enterprise Geodatabase Support
  • 14. AWS Aurora Database engine runs on EC2 and is compatible with the Open Source MySQL, logging and storage runs on Amazon’s storage layer. This is where most of the gains have come from.
  • 15. RDS vs Aurora Aurora only supports the InnoDB storage engine. That rules out spatial index support. The gains are mainly around fault tolerance and performance. If you aren’t running a high throughput transactional database it might not be worth migrating. Aurora is only available on larger instances (r3 family), so it is more expensive if you only need a small database. Cross region replication is not supported by Aurora, it is with RDS.
  • 16. Other Key AWS Services Simple Queueing Service Lambda Simple Notification Service API Gateway
  • 18. The event driven design pattern Deploying an event-driven architecture is difficult and requires custom development. Lambda, Amazon’s new compute service, coupled with FME gives you an extremely powerful set of tools for the elastic processing of hundreds of datasets. Amazon S3, Amazon Kinesis and Amazon Dynamodb trigger events.
  • 19. Automate data processing pattern with AWS Lambda, S3, SNS and SQS
  • 20. Event Driven Processing Demo Automatically process data when it is dropped into an S3 bucket, send an email once the data has finished processing and place output in another S3 bucket. Incoming Data S3 Bucket S3 Bucket
  • 24. AWS Architecture for Data Delivery AWS perfectly suited for data delivery solutions: Multiple services for storing large volumes of data cheaply and securely. Costs grows with the popularity of your data delivery service. You can build highly performant workflows with the click of a button.
  • 25. Caching a snapshot of your data on S3
  • 26. Real time Data Delivery
  • 27. Architecture - State Arkansas Data Portal
  • 28. Our rack space costs (real estate on data center floor) are $3200 a month. Add to that the hardware costs, etc and you can see why moving to the cloud was a no brainer for us. Projected Costs - State Arkansas Data Portal On Premises AWS Anthony Davis, State Arkansas
  • 29. APIs - Next Generation of Data Delivery ● APIs allow businesses to build platforms that partners and customers can use to access core business systems, whenever they want, in a stable and secure way. ● We now have the tools to create a fully functioning scalable API in less than a day, this is going to revolutionize businesses and bring the next wave of API value—creating internal disposable APIs.
  • 30. Building an API with no infrastructure or code
  • 31. API Gateway - Demo Building an API on top of a database in less than 5 minutes.
  • 32. Free Training Hands-on and instructor led. FME Desktop Basic ● Nov 17-18 FME Server Authoring ● Dec 1-2 www.safe.com/training
  • 33. AWS Doctor's Office Coming soon - Safe’s AWS experts on hand for 1H.
  • 34. Q&A
  • 35. Thank you! Download a free trial of FME at www.safe.com