SlideShare a Scribd company logo
Introduction to
Amazon Web Services
 Dayanand Shanmugham
    http://www.linkedin.com/in/dayanandshanmugham




               14 September 2012
Agenda
1. What is Amazon Web Services?
2. IT & Architecture Perspective:
   a.   Amazon Product Stack
   b.   S3 (Simple Storage Service) Basic Concepts
   c.   Demo of how to use S3
   d.   Potential Use Cases
3. Cloud IT Eco-System / Business Perspective :
   a. Cloud Storage Costs
   b. Cloud Service Providers (CSPs)
   c. Comparison Study Report
Amazon Web Services
                                   http://aws.amazon.com/


1.   Amazon Web Services offers a complete set of infrastructure and
     application services that enable us to run virtually everything in the cloud
     – from enterprise applications and big data projects to social games and
     mobile apps.

2.   In 2006, Amazon Web Services (AWS) began offering IT infrastructure
     services to businesses in the form of web services – now commonly
     known as cloud computing.

3.   Key benefits is opportunity to replace up-front capital infrastructure
     expenses with low variable costs that scale with our business:
     a.   Low Cost - Pay-as-you-go pricing
     b.   Agility and Instant Elasticity - Massive global cloud infrastructure
     c.   Open and Flexible - Language and operating system agnostic platform
     d.   Secure - PCI DSS Level 1, ISO 27001, FISMA Moderate, HIPAA & SAS 70 Type
          II
Amazon Product Stack
      http://aws.amazon.com/products/

        Our Applications
Amazon Global Infrastructure
      Where are Amazon Web Services available?
Amazon Global Infrastructure




   7 Regions – Each region has multiple Availability Zones
Amazon S3 (Simple Storage Service)
                         What is S3? Features? Concepts? Controls? Demo




       Quote “…Subscriptions to Cloud Storage Services to Reach Half-Billion Level This Year…
               jump to 625 million next year… projected to hit 1.3 billion in 2017…”
                       http://www.bloomberg.com/article/2012-09-06/aUXSunqkHUP0.html


Large organizations such as Amazon, NASA, Netflix, Google, NBC, Zynga, etc depend on Cloud Storage
Amazon S3 (Simple Storage Service)               http://aws.amazon.com/s3/
                                     http://[BucketName].s3.amazonaws.com/[Filename]

Amazon Simple Storage Service (Amazon S3) is a web service that enables us to
store data in the cloud. We can then download data or use the data with other
AWS services, such as Amazon Elastic Cloud Compute (EC2).

Features:
• Storage as a Service: through Web Service API
• Highly Scalable (Unlimited Storage Space)
• High Data Durability (99.999999999%)
• Highly Available (99.99%)
• Server Side Encryption (Data at Rest)
• Versioning

S3 Basic Concepts:
                                   Object                          Buckets With        Region - Geographical location where
      Buckets                                                                            Amazon S3 service is available
                                                                     Objects
(Storage Container)




                      1. Fundamental Entity in Buckets
                      2. Each Object has a unique Key
Access Controls
1.   Amazon S3 enables us to manage access to objects and buckets using – we can
     use them independently or together:
     –   Access control lists (ACLs)
     –   Bucket policies
     –   IAM policies


2.   ACLs only grant permissions; they do not deny them. ACLs can contain the
     following grantee types:
     –   Specific AWS accounts
     –   All AWS accounts
     –   Any anonymous request


3.   Bucket policies provide access control management at the bucket level for –
     these are collection of JSON statements:
     –   Bucket
     –   Objects


4.   AWS Identity and Access Management (IAM) enables us to create multiple users
     within our AWS account and manage their permissions via IAM policies:
     –   Bucket policies are attached to a bucket
     –   IAM policies are attached to individual users in your account
Access Controls
                                        http://docs.amazonwebservices.com/AmazonS3/latest/dev/UsingIAMPolicies.html

Example#1: IAM policy and Bucket Policy gives Bob & Susan permission to             Example#2: IAM policy gives Bob permission to use PutObject on
                   access PutObject on bucket_xyz                                    bucket_xyz & bucket policy gives permission to use ListBucket




         Example#3: Explicit deny always overrides an allow                   Example#4: Bucket policy denies upload object (s3:PutObject) permission
                                                                                 to everyone if the request does not include the x-amz-server-side-
                                                                                        encryption header requesting server-side encryption
Basic Operations
                         https://console.aws.amazon.com
                http://[BucketName].s3.amazonaws.com/[Filename]




           Store /Write
                                                                  DC2
               Read
                                                          DC1
Our App
             Delete
                                                                  DC3

               List




  Demo of S3 using Amazon Web Service Console
Distributed Storage
 Systems Strategy
If we have to design S3? Design Challenges? How?
Distributed Storage
            Systems Strategy
Option#1



            Store /Write
                                  DC2
              Allow
           Replication To   DC1
 Our App    Complete
                                  DC3
           Response (OK)

                List
Distributed Storage
            Systems Strategy
Option#2



            Store /Write
                                   DC2
           Response (OK)
                             DC1
 Our App
           Replicate after
           Data is stored          DC3
              in DC1

                 List
Behaviour of S3 in
Concurrent Read(s) & Write(s)
        http://[BucketName].s3.amazonaws.com/[Filename]
Potential Use Cases /
      Case Studies
S3 in Overall IT Application Architecture – When to use? Need?
Potential Use Cases / Case Studies
Need:
1. This enterprise manages two disparate sets of
   information. Table-oriented data is maintained in
   an on-premise Oracle database, while a SAN is
   used as a repository for file-based information.
2. For further safeguarding of these vital assets,
   tapes are used for backup and disaster recovery
   purposes.
3. Approximately 20 GB of new information is
   generated each day.
4. Unfortunately,   the backup          and    archive
   management processes are cumbersome and
   expensive,      while        restoring    archived
   information can take days to complete

Solution:
1. Architects decide to use AWS cloud
2. For file-based artifacts, access to Amazon S3
   will be via the Amazon AWS SDK for Java and
   AWS Toolkit for Eclipse
3. Storage architects will create one or more uniquely-
   identified buckets in Amazon S3, each of which can
   hold an unlimited amount of backup data
4. For relational data, Oracle’s Secure Backup Cloud
   Module will be able to take advantage of existing
   RMAN scripts to back up information directly
   from the Oracle database into Amazon S3
5. Third-party storage management solution can be
   used to manage the entire process, including
   encryption and other security details
Potential Use Cases / Case Studies
Need:
1. This company provides a tremendous number
   of multimedia files (audio files, videos, and
   images), which are stored on internally hosted
   servers and made available over the Internet.
   In addition to the multimedia files themselves,
   each file requires a significant amount of
   metadata (such as title, author, keywords, size,
   and so on).
2. On the plus side, their web site is a hit
3. Unfortunately, internal servers are failing to
   keep up with this demand — the sheer
   amount of data will soon overwhelm available
   disk storage, and the amount of necessary
   metadata indexing is outstripping processing
   capacity



Solution:
1. Architects decide to publish all
   content to the AWS cloud-based
   storage, thereby eliminating the
   need to purchase and maintain
   internal servers.
2. Users will directly fetch content
   from AWS-based storage
Potential Use Cases / Case Studies
Need:
1. A large corporation is maintaining a data
   warehouse on a High-CPU Extra Large
   Amazon EC2 instance, with ten 800 GB
   EBS volumes holding the information itself
2. While this architecture successfully meets
   business needs, some new requirements
   will mandate an extension to their storage
   composition. Specifically, a change to
   organizational policy now requires
   frequent data snapshots; these images
   need to archive supplementary data
   approximately every one to two hours.



Solution:
1. Architects decide to add Amazon S3 to
   the mixture
2. Amazon EC2 instance and supporting
   EBS volumes will continue in their current
   roles
3. Developers will write a script or small
   application that uses Amazon EBS API
   to create incremental snapshots stored
   in Amazon S3
4. The new snapshot application will be run
   every 120 minutes using a Linux cron job
   or a Windows scheduled task
Potential Use Cases / Case Studies
Need:
1. A New SaaS application is in the planning stages
2. Solution will need to manage a variety of information, including
   traditional relational data, frequently changing status feeds, and
   large amounts of multimedia.
3. The logic behind the application will be complex and processing-
   intensive, needing numerous joins of relational data to produce the
   required results.
4. Finally, as is the case with virtually all of these types of solutions,
   users will demand high availability and fault tolerance.
5. Rather than being forced to hand-code all of these availability-
   oriented capabilities, the designers hope to leverage the cloud
   for features such as replication, scalability, and automated
   backups, freeing their time for developing the core application
   logic.



Solution:
1. The Architects of the new SaaS package opt for a collection of AWS
   storage offerings, each of which will handle a specific responsibility.
2. RDS will serve as the repository for all information that requires full
   relational database infrastructure
3. To process complex cross-table joins, RDS will deliver high availability
   via its well-proven data replication architecture
4. Read operations will be served by the slaves, with writes being
   processed on the master—all of which will be automatically
   managed by RDS
5. RDS will provide database administration as a service, with one-click
   vertical scaling, elastic storage, and automated backups
6. Multimedia objects will reside in Amazon S3, while SimpleDB will
   be tasked with maintaining the continually updated status feeds
Potential Use Cases / Case Studies
Need:
1. This web-based social application has seen
   a massive spike in traffic - likely cause for
   this momentum is the release of a new client
   application designed for Smartphones
2. Server-side    information    repository was
   originally deployed using an on-premise
   MySQL database
3. No longer possible to support the number
   of clients or the amount of managed
   information with that database alone




Solution:
1. Architects want to avoid forcing a costly and
   time-consuming rewrite of their primary
   application, so most of the database-focused
   logic must remain unchanged
Cloud IT Eco-System
   Costs? Is Cloud Storage economical?
“File Storage Costs Less In The Cloud
     Than In-House” By Forrestor
       http://media.amazonwebservices.com/Forrester_File_Storage_Costs_Less_In_The_Cloud.pdf
                               http://aws.amazon.com/s3/#pricing
“File Storage Costs Less In The Cloud
     Than In-House” By Forrestor
       http://media.amazonwebservices.com/Forrester_File_Storage_Costs_Less_In_The_Cloud.pdf
                               http://aws.amazon.com/s3/#pricing
“File Storage Costs Less In The Cloud
     Than In-House” By Forrestor
       http://media.amazonwebservices.com/Forrester_File_Storage_Costs_Less_In_The_Cloud.pdf
                               http://aws.amazon.com/s3/#pricing
Cloud IT Eco-System
    Other Cloud Service Providers?
Cloud IT Eco-System
          Cloud Storage Providers
   1. Amazon Simple Storage Service (S3)
   2. AT&T Synaptic Storage as a Service
   3. Box.net
   4. Choopa Cloud Object Storage
   5. Cloud Drive
   6. CloudOne Storage
   7. Egnyte HybridCloud
   8. Enzu Cloud Storage
   9. GoGrid Cloud Storage
   10. Iron Mountain Storage-as-a-Service
   11. Layered Tech Depot
   12. Mozzy
   13. Nasuni
   14. Netcetera vStorage
   15. NetDepot Cloud Storage
   16. OpSource Cloud Files
   17. Rackspace Cloud Files
   18. Softlayer Cloudlayer Storage
   19. Storage Qloud
   20. SugarSync
   21. Syncplicity Virtual Private Cloud
   22. The Planet Storage Cloud
   23. Windows Azure Storage Service
   24. Zetta Storage Service
   25. …
Cloud IT Eco-System
           Content Delivery Networks
 1. Akamai Media Delivery      21. Level 3 CDN
 2. Amazon Cloudfront          22. Limelight Networks
 3. BitGravity                 23. LocalMirror
 4. Cachefly                   24. MaxCDN
 5. CDN Video                  25. Mirror Image
 6. CDN77                      26. NetDNA
 7. CDNetworks                 27. Ngenix
 8. ChinaCache                 28. Prime Networks
 9. ChinaNetCenter CDN         29. Rapidedge CDN
 10. CloudCache                30. SimpleCDN
 11. CloudFlare CDN            31. Softlayer Cloudlayer CDN
 12. Cotendo                   32. Tata Communications CDN
 13. EdgeCast Networks         33. UPX CloudCache
 14. Fastly                    34. Value CDN
 15. Fastweb CDN               35. VoxCAST
 16. GoGrid CDN                36. VPS.NET CDN
 17. Highwinds                 37. …
 18. Hostway CDN
 19. Internap CDN
 20. Internode CDN
Cloud IT Eco-System
                                   Cloud Hosting Providers
1. 1&1 Dynamic Cloud Server    31. iland                             61. The Planet Server Cloud
2. Atlantic Cloud Servers      32. Instance Cloud Computing          62. VI Cloud Hosting
3. BlueLock                    33. iomart Cloud Hosting              63. VoxCLOUD
4. CariNet                     34. iWeb Smart Servers                64. VPS.NET
5. Cartika Cloud               35. Joyent Cloud                      65. VPSLand Bare Metal Cloud Servers
6. Cloud Central               36. Layered Tech Virtual Machines     66. WestHost Cloud Hosting
7. Cloud.bg                    37. Logicworks Cloud Computing        67. Windows Azure Compute Service
8. CloudSigma                  38. Mediatemple                       68. Zerigo Cloud Servers
9. Combell Cloud Servers       39. NaviSite Managed Cloud Services   69. Amazon Elastic Compute Cloud (EC2)
10. Connectria Cloud Hosting   40. NetDepot Cloud Servers            70. AT&T Synaptic Compute as a Service
11. Crackerhost                41. NewServers                        71. Go Daddy Cloud Server (Mac OS X)
12. Datapipe Managed Cloud     42. Openhosting vServer               72. AirVM
13. Dediserve Cloud Machines   43. OpSource Cloud Servers            73. …
14. DigitalOcean               44. OrcsWeb Windows Cloud Server
15. DotBlock Cloud VPS         45. OVH Cloud
16. ElasticHosts               46. Peak 10 Cloud Services
17. Engine Yard AppCloud       47. Rackforce ddsCloud
18. Enzu Cloud Hosting         48. Rackspace Cloud Servers
19. FireHost                   49. Razor Servers Cloud
20. FlexiScale                 50. ReliaCloud
21. Gandi Cloud VPS            51. Server4You vServer Cloud
22. Gigenetcloud               52. ServerCentral Cloud Services
23. GoGrid Cloud Servers       53. SingleHop Cascade
24. Google App Engine          54. SiteCloud
25. Hivelocity PDC             55. Slicehost
26. Hostbasket Dynamic Cloud   56. Softlayer Cloudlayer Computing
27. HostIcan VPS               57. Storm On Demand
28. Hosting.com                58. SunGard Cloud Platform
29. Hostway Cloud Servers      59. Tektonic Cloud VPS
30. Hpchost Cloud Hosting      60. Terremark vCloud Express
Cloud IT Eco-System
   Which Cloud Provider(s) to use?
Comparative Study (Done by Nasuni)
                          http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf


Background:
1.   In April 2009, Nasuni began an ongoing and unprecedented evaluation process of
     “Cloud Service Providers (CSPs) based on a wide array of factors

2.   Technical test looked at how CSPs perform in a specific use case, namely, how
     they perform for organizations that want to take advantage of the cloud for
     primary storage, data protection and disaster recovery. Nasuni did not test for
     every possible use case.

3.   Test focus on three key areas of importance for organizations that want to take
     advantage of cloud storage:
      – Performance: The cloud needs to respond quickly to queries and not slow when
        stressed. If performance is poor, organizations will spend too much time waiting on the
        cloud and productivity will suffer.
      – Stability/Availability: If organizations are to trust critical data to the cloud, it must be
        available at all times.
      – Scalability: One of the primary advantages of cloud storage is unlimited capacity.
        Without this property, the cloud is much less valuable to organizations.
Comparative Study Results
                       http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf


Results:
1.   Ultimately, only 6 of 16 providers passed Nasuni’s testing:
     –   Amazon S3
     –   AT&T Synaptic Storage as a Service
     –   Microsoft Azure
     –   Nirvanix
     –   Peer1 Hosting
     –   Rackspace Cloud

2.   Only Two Cloud Service Providers emerged as top performers in the
     Nasuni study:
     – Amazon S3
     – Microsoft Azure
     (Note: Amazon S3 standout across all evaluation areas)
Comparative Study Methodology
                    http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf


Methodology:
1. API Integration – To ensure that it is possible to test the service at all

2. Unit Testing – Larger software components are broken down into
   their building blocks (units) and then tested for inputs, outputs and
   error cases

3. Performance Testing – To measure response time (how quickly one
   can interact with the cloud), throughput (how fast data can move
   back and forth to and from the cloud), and the impact of a higher
   level of stress.

4. Stability Testing – To assess the long-term reliability of each CSP

5. Scalability Testing – To understand how well each CSP handles high
   object counts
Comparative Study Reports
   http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf
Comparative Study Reports
   http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf
Comparative Study Reports
   http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf

More Related Content

PPT
Amazon Web Services
PPTX
Amazon web services,
PPT
Amazon product stack and infrastructure
PPTX
Aws overview
PPTX
Basic introduction of Amazon Web Services (AWS)
PDF
Artem Zhurbila - 1 aws overview
PPTX
Aws overview (Amazon Web Services)
PPTX
Aws platform overview
Amazon Web Services
Amazon web services,
Amazon product stack and infrastructure
Aws overview
Basic introduction of Amazon Web Services (AWS)
Artem Zhurbila - 1 aws overview
Aws overview (Amazon Web Services)
Aws platform overview

What's hot (14)

PPTX
Aws tutorial for beginners- tibacademy.in
PPTX
Aws ppt
ODP
Introduction to Amazon Web Services
PPTX
Introduction to Amazon Web Services by i2k2 Networks
PPT
Cloud Computing With AWS
PPT
Cloud Computing and Amazon Web Services
PDF
AWS TEchnical Essentials Workshop
PPTX
AWS Overview in a Single Diagram
PPS
Amazon web service
PDF
Aws seminar report
PPTX
Aws overview
PPT
Amazon Web Services (AWS) Case study
PPTX
What is AWS?
PDF
AWS 101, London - September 2014
Aws tutorial for beginners- tibacademy.in
Aws ppt
Introduction to Amazon Web Services
Introduction to Amazon Web Services by i2k2 Networks
Cloud Computing With AWS
Cloud Computing and Amazon Web Services
AWS TEchnical Essentials Workshop
AWS Overview in a Single Diagram
Amazon web service
Aws seminar report
Aws overview
Amazon Web Services (AWS) Case study
What is AWS?
AWS 101, London - September 2014
Ad

Viewers also liked (16)

PPTX
App Development Evolution: What has changed?
PPTX
Cloud Security (AWS)
PPTX
Cloud service models 101
PPTX
Microsoft Azure Platform-as-a-Service (PaaS)
PDF
Understanding The Benefits Of Amazon EC2
PPSX
Google In China - Case Study
PPTX
Nanotechnology in dentistry
PDF
Amazon.com - Company Analysis (OD & HRM)
PPTX
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
PPTX
Amazon: Strategic Management
PDF
Latest Seminar Topics for Engineering,MCA,MSc Students
KEY
NoSQL Databases: Why, what and when
PPTX
Amazon ppt
PDF
Amazon.com Strategic Analysis
PPTX
Azure Cloud PPT
PPTX
Microsoft Cloud Computing - Windows Azure Platform
App Development Evolution: What has changed?
Cloud Security (AWS)
Cloud service models 101
Microsoft Azure Platform-as-a-Service (PaaS)
Understanding The Benefits Of Amazon EC2
Google In China - Case Study
Nanotechnology in dentistry
Amazon.com - Company Analysis (OD & HRM)
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
Amazon: Strategic Management
Latest Seminar Topics for Engineering,MCA,MSc Students
NoSQL Databases: Why, what and when
Amazon ppt
Amazon.com Strategic Analysis
Azure Cloud PPT
Microsoft Cloud Computing - Windows Azure Platform
Ad

Similar to Introduction to Amazon Web Services (20)

PDF
Cloud Computing With Amazon Web Services, Part 2: Storage in the Cloud With A...
PPT
PPTX
amazon web servics in the cloud aws and its categories compute cloud and stor...
PDF
PowerPoint Presentation.pdf
PPTX
Aws Solution Architecture Associate - summary
PPTX
cse40822-amazon.pptx
PPTX
Amazon Web Services OverView
PPTX
AWS Storage - S3 Fundamentals
PPTX
Amazon_S3 (Simple storage service)_Presentation.pptx
PDF
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PPTX
Aws introduction tocloudstorage
PPSX
Amazon ec2 s3 dynamo db
PDF
Aws storage services whitepaper v9
PPTX
Jump Start to Amazon Web Services
PDF
Deep Dive on EC2 and S3
PDF
Aws storage services whitepaper v9
PPTX
Case Study Amazon AWS
PDF
My cool new Slideshow!
PDF
cdac@amitkumar@test123
Cloud Computing With Amazon Web Services, Part 2: Storage in the Cloud With A...
amazon web servics in the cloud aws and its categories compute cloud and stor...
PowerPoint Presentation.pdf
Aws Solution Architecture Associate - summary
cse40822-amazon.pptx
Amazon Web Services OverView
AWS Storage - S3 Fundamentals
Amazon_S3 (Simple storage service)_Presentation.pptx
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Aws introduction tocloudstorage
Amazon ec2 s3 dynamo db
Aws storage services whitepaper v9
Jump Start to Amazon Web Services
Deep Dive on EC2 and S3
Aws storage services whitepaper v9
Case Study Amazon AWS
My cool new Slideshow!
cdac@amitkumar@test123

More from Dayanand Shanmugham (7)

PPTX
Pugh matrix analysis_to_purchase_new_mobile_phone
PPTX
Criteria used to_purchase_mobile_phone
PPT
Docker benefits v0.1
PPT
Cloud it eco system
PPT
Amazon simple db
PPT
Amazon simple queue service
PPT
Cloud storage providers comparative study
Pugh matrix analysis_to_purchase_new_mobile_phone
Criteria used to_purchase_mobile_phone
Docker benefits v0.1
Cloud it eco system
Amazon simple db
Amazon simple queue service
Cloud storage providers comparative study

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Encapsulation theory and applications.pdf
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Digital-Transformation-Roadmap-for-Companies.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Encapsulation theory and applications.pdf
Spectroscopy.pptx food analysis technology
Big Data Technologies - Introduction.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Unlocking AI with Model Context Protocol (MCP)
Reach Out and Touch Someone: Haptics and Empathic Computing
Dropbox Q2 2025 Financial Results & Investor Presentation
NewMind AI Weekly Chronicles - August'25 Week I
Mobile App Security Testing_ A Comprehensive Guide.pdf
Empathic Computing: Creating Shared Understanding
Encapsulation_ Review paper, used for researhc scholars
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Machine learning based COVID-19 study performance prediction
Diabetes mellitus diagnosis method based random forest with bat algorithm
Building Integrated photovoltaic BIPV_UPV.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025

Introduction to Amazon Web Services

  • 1. Introduction to Amazon Web Services Dayanand Shanmugham http://www.linkedin.com/in/dayanandshanmugham 14 September 2012
  • 2. Agenda 1. What is Amazon Web Services? 2. IT & Architecture Perspective: a. Amazon Product Stack b. S3 (Simple Storage Service) Basic Concepts c. Demo of how to use S3 d. Potential Use Cases 3. Cloud IT Eco-System / Business Perspective : a. Cloud Storage Costs b. Cloud Service Providers (CSPs) c. Comparison Study Report
  • 3. Amazon Web Services http://aws.amazon.com/ 1. Amazon Web Services offers a complete set of infrastructure and application services that enable us to run virtually everything in the cloud – from enterprise applications and big data projects to social games and mobile apps. 2. In 2006, Amazon Web Services (AWS) began offering IT infrastructure services to businesses in the form of web services – now commonly known as cloud computing. 3. Key benefits is opportunity to replace up-front capital infrastructure expenses with low variable costs that scale with our business: a. Low Cost - Pay-as-you-go pricing b. Agility and Instant Elasticity - Massive global cloud infrastructure c. Open and Flexible - Language and operating system agnostic platform d. Secure - PCI DSS Level 1, ISO 27001, FISMA Moderate, HIPAA & SAS 70 Type II
  • 4. Amazon Product Stack http://aws.amazon.com/products/ Our Applications
  • 5. Amazon Global Infrastructure Where are Amazon Web Services available?
  • 6. Amazon Global Infrastructure 7 Regions – Each region has multiple Availability Zones
  • 7. Amazon S3 (Simple Storage Service) What is S3? Features? Concepts? Controls? Demo Quote “…Subscriptions to Cloud Storage Services to Reach Half-Billion Level This Year… jump to 625 million next year… projected to hit 1.3 billion in 2017…” http://www.bloomberg.com/article/2012-09-06/aUXSunqkHUP0.html Large organizations such as Amazon, NASA, Netflix, Google, NBC, Zynga, etc depend on Cloud Storage
  • 8. Amazon S3 (Simple Storage Service) http://aws.amazon.com/s3/ http://[BucketName].s3.amazonaws.com/[Filename] Amazon Simple Storage Service (Amazon S3) is a web service that enables us to store data in the cloud. We can then download data or use the data with other AWS services, such as Amazon Elastic Cloud Compute (EC2). Features: • Storage as a Service: through Web Service API • Highly Scalable (Unlimited Storage Space) • High Data Durability (99.999999999%) • Highly Available (99.99%) • Server Side Encryption (Data at Rest) • Versioning S3 Basic Concepts: Object Buckets With Region - Geographical location where Buckets Amazon S3 service is available Objects (Storage Container) 1. Fundamental Entity in Buckets 2. Each Object has a unique Key
  • 9. Access Controls 1. Amazon S3 enables us to manage access to objects and buckets using – we can use them independently or together: – Access control lists (ACLs) – Bucket policies – IAM policies 2. ACLs only grant permissions; they do not deny them. ACLs can contain the following grantee types: – Specific AWS accounts – All AWS accounts – Any anonymous request 3. Bucket policies provide access control management at the bucket level for – these are collection of JSON statements: – Bucket – Objects 4. AWS Identity and Access Management (IAM) enables us to create multiple users within our AWS account and manage their permissions via IAM policies: – Bucket policies are attached to a bucket – IAM policies are attached to individual users in your account
  • 10. Access Controls http://docs.amazonwebservices.com/AmazonS3/latest/dev/UsingIAMPolicies.html Example#1: IAM policy and Bucket Policy gives Bob & Susan permission to Example#2: IAM policy gives Bob permission to use PutObject on access PutObject on bucket_xyz bucket_xyz & bucket policy gives permission to use ListBucket Example#3: Explicit deny always overrides an allow Example#4: Bucket policy denies upload object (s3:PutObject) permission to everyone if the request does not include the x-amz-server-side- encryption header requesting server-side encryption
  • 11. Basic Operations https://console.aws.amazon.com http://[BucketName].s3.amazonaws.com/[Filename] Store /Write DC2 Read DC1 Our App Delete DC3 List Demo of S3 using Amazon Web Service Console
  • 12. Distributed Storage Systems Strategy If we have to design S3? Design Challenges? How?
  • 13. Distributed Storage Systems Strategy Option#1 Store /Write DC2 Allow Replication To DC1 Our App Complete DC3 Response (OK) List
  • 14. Distributed Storage Systems Strategy Option#2 Store /Write DC2 Response (OK) DC1 Our App Replicate after Data is stored DC3 in DC1 List
  • 15. Behaviour of S3 in Concurrent Read(s) & Write(s) http://[BucketName].s3.amazonaws.com/[Filename]
  • 16. Potential Use Cases / Case Studies S3 in Overall IT Application Architecture – When to use? Need?
  • 17. Potential Use Cases / Case Studies Need: 1. This enterprise manages two disparate sets of information. Table-oriented data is maintained in an on-premise Oracle database, while a SAN is used as a repository for file-based information. 2. For further safeguarding of these vital assets, tapes are used for backup and disaster recovery purposes. 3. Approximately 20 GB of new information is generated each day. 4. Unfortunately, the backup and archive management processes are cumbersome and expensive, while restoring archived information can take days to complete Solution: 1. Architects decide to use AWS cloud 2. For file-based artifacts, access to Amazon S3 will be via the Amazon AWS SDK for Java and AWS Toolkit for Eclipse 3. Storage architects will create one or more uniquely- identified buckets in Amazon S3, each of which can hold an unlimited amount of backup data 4. For relational data, Oracle’s Secure Backup Cloud Module will be able to take advantage of existing RMAN scripts to back up information directly from the Oracle database into Amazon S3 5. Third-party storage management solution can be used to manage the entire process, including encryption and other security details
  • 18. Potential Use Cases / Case Studies Need: 1. This company provides a tremendous number of multimedia files (audio files, videos, and images), which are stored on internally hosted servers and made available over the Internet. In addition to the multimedia files themselves, each file requires a significant amount of metadata (such as title, author, keywords, size, and so on). 2. On the plus side, their web site is a hit 3. Unfortunately, internal servers are failing to keep up with this demand — the sheer amount of data will soon overwhelm available disk storage, and the amount of necessary metadata indexing is outstripping processing capacity Solution: 1. Architects decide to publish all content to the AWS cloud-based storage, thereby eliminating the need to purchase and maintain internal servers. 2. Users will directly fetch content from AWS-based storage
  • 19. Potential Use Cases / Case Studies Need: 1. A large corporation is maintaining a data warehouse on a High-CPU Extra Large Amazon EC2 instance, with ten 800 GB EBS volumes holding the information itself 2. While this architecture successfully meets business needs, some new requirements will mandate an extension to their storage composition. Specifically, a change to organizational policy now requires frequent data snapshots; these images need to archive supplementary data approximately every one to two hours. Solution: 1. Architects decide to add Amazon S3 to the mixture 2. Amazon EC2 instance and supporting EBS volumes will continue in their current roles 3. Developers will write a script or small application that uses Amazon EBS API to create incremental snapshots stored in Amazon S3 4. The new snapshot application will be run every 120 minutes using a Linux cron job or a Windows scheduled task
  • 20. Potential Use Cases / Case Studies Need: 1. A New SaaS application is in the planning stages 2. Solution will need to manage a variety of information, including traditional relational data, frequently changing status feeds, and large amounts of multimedia. 3. The logic behind the application will be complex and processing- intensive, needing numerous joins of relational data to produce the required results. 4. Finally, as is the case with virtually all of these types of solutions, users will demand high availability and fault tolerance. 5. Rather than being forced to hand-code all of these availability- oriented capabilities, the designers hope to leverage the cloud for features such as replication, scalability, and automated backups, freeing their time for developing the core application logic. Solution: 1. The Architects of the new SaaS package opt for a collection of AWS storage offerings, each of which will handle a specific responsibility. 2. RDS will serve as the repository for all information that requires full relational database infrastructure 3. To process complex cross-table joins, RDS will deliver high availability via its well-proven data replication architecture 4. Read operations will be served by the slaves, with writes being processed on the master—all of which will be automatically managed by RDS 5. RDS will provide database administration as a service, with one-click vertical scaling, elastic storage, and automated backups 6. Multimedia objects will reside in Amazon S3, while SimpleDB will be tasked with maintaining the continually updated status feeds
  • 21. Potential Use Cases / Case Studies Need: 1. This web-based social application has seen a massive spike in traffic - likely cause for this momentum is the release of a new client application designed for Smartphones 2. Server-side information repository was originally deployed using an on-premise MySQL database 3. No longer possible to support the number of clients or the amount of managed information with that database alone Solution: 1. Architects want to avoid forcing a costly and time-consuming rewrite of their primary application, so most of the database-focused logic must remain unchanged
  • 22. Cloud IT Eco-System Costs? Is Cloud Storage economical?
  • 23. “File Storage Costs Less In The Cloud Than In-House” By Forrestor http://media.amazonwebservices.com/Forrester_File_Storage_Costs_Less_In_The_Cloud.pdf http://aws.amazon.com/s3/#pricing
  • 24. “File Storage Costs Less In The Cloud Than In-House” By Forrestor http://media.amazonwebservices.com/Forrester_File_Storage_Costs_Less_In_The_Cloud.pdf http://aws.amazon.com/s3/#pricing
  • 25. “File Storage Costs Less In The Cloud Than In-House” By Forrestor http://media.amazonwebservices.com/Forrester_File_Storage_Costs_Less_In_The_Cloud.pdf http://aws.amazon.com/s3/#pricing
  • 26. Cloud IT Eco-System Other Cloud Service Providers?
  • 27. Cloud IT Eco-System Cloud Storage Providers 1. Amazon Simple Storage Service (S3) 2. AT&T Synaptic Storage as a Service 3. Box.net 4. Choopa Cloud Object Storage 5. Cloud Drive 6. CloudOne Storage 7. Egnyte HybridCloud 8. Enzu Cloud Storage 9. GoGrid Cloud Storage 10. Iron Mountain Storage-as-a-Service 11. Layered Tech Depot 12. Mozzy 13. Nasuni 14. Netcetera vStorage 15. NetDepot Cloud Storage 16. OpSource Cloud Files 17. Rackspace Cloud Files 18. Softlayer Cloudlayer Storage 19. Storage Qloud 20. SugarSync 21. Syncplicity Virtual Private Cloud 22. The Planet Storage Cloud 23. Windows Azure Storage Service 24. Zetta Storage Service 25. …
  • 28. Cloud IT Eco-System Content Delivery Networks 1. Akamai Media Delivery 21. Level 3 CDN 2. Amazon Cloudfront 22. Limelight Networks 3. BitGravity 23. LocalMirror 4. Cachefly 24. MaxCDN 5. CDN Video 25. Mirror Image 6. CDN77 26. NetDNA 7. CDNetworks 27. Ngenix 8. ChinaCache 28. Prime Networks 9. ChinaNetCenter CDN 29. Rapidedge CDN 10. CloudCache 30. SimpleCDN 11. CloudFlare CDN 31. Softlayer Cloudlayer CDN 12. Cotendo 32. Tata Communications CDN 13. EdgeCast Networks 33. UPX CloudCache 14. Fastly 34. Value CDN 15. Fastweb CDN 35. VoxCAST 16. GoGrid CDN 36. VPS.NET CDN 17. Highwinds 37. … 18. Hostway CDN 19. Internap CDN 20. Internode CDN
  • 29. Cloud IT Eco-System Cloud Hosting Providers 1. 1&1 Dynamic Cloud Server 31. iland 61. The Planet Server Cloud 2. Atlantic Cloud Servers 32. Instance Cloud Computing 62. VI Cloud Hosting 3. BlueLock 33. iomart Cloud Hosting 63. VoxCLOUD 4. CariNet 34. iWeb Smart Servers 64. VPS.NET 5. Cartika Cloud 35. Joyent Cloud 65. VPSLand Bare Metal Cloud Servers 6. Cloud Central 36. Layered Tech Virtual Machines 66. WestHost Cloud Hosting 7. Cloud.bg 37. Logicworks Cloud Computing 67. Windows Azure Compute Service 8. CloudSigma 38. Mediatemple 68. Zerigo Cloud Servers 9. Combell Cloud Servers 39. NaviSite Managed Cloud Services 69. Amazon Elastic Compute Cloud (EC2) 10. Connectria Cloud Hosting 40. NetDepot Cloud Servers 70. AT&T Synaptic Compute as a Service 11. Crackerhost 41. NewServers 71. Go Daddy Cloud Server (Mac OS X) 12. Datapipe Managed Cloud 42. Openhosting vServer 72. AirVM 13. Dediserve Cloud Machines 43. OpSource Cloud Servers 73. … 14. DigitalOcean 44. OrcsWeb Windows Cloud Server 15. DotBlock Cloud VPS 45. OVH Cloud 16. ElasticHosts 46. Peak 10 Cloud Services 17. Engine Yard AppCloud 47. Rackforce ddsCloud 18. Enzu Cloud Hosting 48. Rackspace Cloud Servers 19. FireHost 49. Razor Servers Cloud 20. FlexiScale 50. ReliaCloud 21. Gandi Cloud VPS 51. Server4You vServer Cloud 22. Gigenetcloud 52. ServerCentral Cloud Services 23. GoGrid Cloud Servers 53. SingleHop Cascade 24. Google App Engine 54. SiteCloud 25. Hivelocity PDC 55. Slicehost 26. Hostbasket Dynamic Cloud 56. Softlayer Cloudlayer Computing 27. HostIcan VPS 57. Storm On Demand 28. Hosting.com 58. SunGard Cloud Platform 29. Hostway Cloud Servers 59. Tektonic Cloud VPS 30. Hpchost Cloud Hosting 60. Terremark vCloud Express
  • 30. Cloud IT Eco-System Which Cloud Provider(s) to use?
  • 31. Comparative Study (Done by Nasuni) http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf Background: 1. In April 2009, Nasuni began an ongoing and unprecedented evaluation process of “Cloud Service Providers (CSPs) based on a wide array of factors 2. Technical test looked at how CSPs perform in a specific use case, namely, how they perform for organizations that want to take advantage of the cloud for primary storage, data protection and disaster recovery. Nasuni did not test for every possible use case. 3. Test focus on three key areas of importance for organizations that want to take advantage of cloud storage: – Performance: The cloud needs to respond quickly to queries and not slow when stressed. If performance is poor, organizations will spend too much time waiting on the cloud and productivity will suffer. – Stability/Availability: If organizations are to trust critical data to the cloud, it must be available at all times. – Scalability: One of the primary advantages of cloud storage is unlimited capacity. Without this property, the cloud is much less valuable to organizations.
  • 32. Comparative Study Results http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf Results: 1. Ultimately, only 6 of 16 providers passed Nasuni’s testing: – Amazon S3 – AT&T Synaptic Storage as a Service – Microsoft Azure – Nirvanix – Peer1 Hosting – Rackspace Cloud 2. Only Two Cloud Service Providers emerged as top performers in the Nasuni study: – Amazon S3 – Microsoft Azure (Note: Amazon S3 standout across all evaluation areas)
  • 33. Comparative Study Methodology http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf Methodology: 1. API Integration – To ensure that it is possible to test the service at all 2. Unit Testing – Larger software components are broken down into their building blocks (units) and then tested for inputs, outputs and error cases 3. Performance Testing – To measure response time (how quickly one can interact with the cloud), throughput (how fast data can move back and forth to and from the cloud), and the impact of a higher level of stress. 4. Stability Testing – To assess the long-term reliability of each CSP 5. Scalability Testing – To understand how well each CSP handles high object counts
  • 34. Comparative Study Reports http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf
  • 35. Comparative Study Reports http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf
  • 36. Comparative Study Reports http://www.nasuni.com/downloads/resources/87/the_state_of_cloud_storage.pdf