SlideShare a Scribd company logo
AUTOATIC SCALING OF INTER
NET APPLICATIONS FOR CLOUD
COMPUTING SERVICES
P.CHANDU
118T1A0503
ponnagantichandu@gmail.com
COMPONENTS
INTRODUCTION
SYSTEM
ARCHITECTURE
PROBLEM
DEFINITION
CCBP FOR
AUTOMATIC
SCALING OF
RESOURSCES
EXPERIMENTS
SIMULATIONS
CONCLUSION
INTRODUCTION
• Benefits of cloud computing service is the
resource elasticity
• A business customer can scale up and down its
resource usage as needed without upfront
capital investment or long term commitment.
The amazon EC2 service, for example, allows
users to buy as many virtual machine (VM)
instances as they want and operate them much
like physical hardware.
• Many internet applications can benefit from
an auto scaling property where their resource
usage can be scaled up and down
automatically by the cloud service provider.
• “Pay as you go” model.
• Fault tolerance.
• It consists of a load balancing switch,
A set of application servers, and a set of
backend storage servers.
INTRODUCTION[1]
• The applications store
their state
information in the
backend storage
servers. It is
important that the
applications
themselves are
stateless so that they
can be replicated
safely.
• The storage servers
may also become
overloaded, but the
focus of this work is
on the application
tier.
Example: the Google
app engine service.
Fig .two tiered
architecture for internet
applications
SYSTEM ARCHITECTURE
• We encapsulate each
application instance inside a
virtual machine the use virtual
machine to provide isolation
among untrusted users.
• Each server run the XEN
hypervisor.
• The system described as some
decisions:
Application placement Load
distribution.
Effectiveness techniques in
ACFIAFCS:
• Ballooning technique
• Mirroring technology
DEFINITIONPROBLEM
• Multi dimensional resource demands
• Vector bin packing.
Automatic scaling problem we want to slove is
defined as follows: Our goals are to maximize
the demand satisfaction ratio, minimize the
placement change frequency and minimize
energy consumption. Our optimization
objectives can be expressed as follows
• To simplify the problem described above, we make
the assumption that the servers are homogeneous
with uniform capacity.
• Then the auto scaling problem is similar to the
Class Constrained Bin Packing (CCBP) problem.
• The only difference is that the CCBP problem does
not have the "Minimize the placement change
frequency "goal.
• Therefore, in order to solve our problem, we
modified the CCBP model to support the
"Minimize the placement change frequency "goal
CCBPFORAUTOMATICSCALING
OFRESOURCES
• Bin packing problem: a series of items of
different sizes need to be packed into a minimum
number of bins.
• The goal is to pack the items in to a minimum
number of bins.
• We can model our resource allocation as the Class
Constrained Bin Packing (CCBP) problem where each
server is a bin and each class represents an
application. Items from a specific class represent the
resource demands of the corresponding application.
The class constraint reflects the practical limit on the
number of applications a server can run
simultaneously. For J2EE applications
• Load unit
• Most online CCBP algorithms do not support item
departure.
• minimize application placement changes by adjusting
the existing load distribution.
• Fig. 3 shows an example where
we have two servers and three
applications. Each server can
run at most two applications
simultaneously . We set the load
unit to 20% Each server can
thus satisfy 5 units of load (i.e.,
). Let the resource demands of
the applications be 60%, 20%,
and 120% of the server
capacity, respectively.
• This translates into 3, 1, and 6
items to be packed for their
corresponding classes.
• The CCBP problem is NP-hard.
 Detail of our algorithm: Our algorithm
belongs to the family of color set
algorithms
 Application load increase: The load increase
of an application is modeled as the arrival
of items with the corresponding color .
 Application load decrease: The algorithm
has the freedom to choose which item of
that color to remove.
 Application joins and leaves: When the last
item of a specific color leaves, that color can
be removed from its color set.
If there are, we allocate the
new colors to the unfilled sets
first using the following
add_new_colorsprocedure.in
that use a three procedures that
are a
Procedure add_new_colors
Procedure
consolidate_unfilled_sets
Procedure fill(s1 ,s2 )
ANALYSIS OF THE
APPROXIMATION RATIO
The quality of a polynomial time algorithm A is measured by
its approximation ratio R(A) to the optimal algorithm OPT:
Where is the list of the input sequence and and
are the number of bins used under the algorithm and
the optimal algorithm, respectively
 the problem is NP-hard, no polynomial time optimal
algorithm exists. Hence, the OPT algorithm here serves as
an unrealistic upper bound.
PRACTICAL CONSIDERATIONS
Service
equivalence
Class
constraint
Load change
Optimization
EXPERIMENTS
• Load shifting: We first evaluate the
effectiveness of load shifting in our algorithm
to avoid application placement change.
• Auto scaling :we reduce the request rate of the
application gradually to emulate that the flash
crowd is over. The algorithm scales down the
server resources accordingly to conserve energy.
In experiment demonstrate the
effectiveness of our system in real experiments
SIMULATIONS
• This section evaluates the performance of our
application scheduling algorithm in large scale
simulation.
• We assume that all servers are homogeneous. We
define the“demand ratio”as the ratio between the
aggregate demand of all applications and the
aggregate capacity of all servers.
• The simulator invokes our scheduling algorithm
according to the prepared input and computes
performance metrics such as the number of apms
in the system, the decision time, the application
demand satisfaction ratio, and the number of
placement changes.
 Application demand ratio
 Scalability : We evaluate the scalability of the algorithm by
increasing both the number of servers and the number of
applications from 1000 to 10,000.
Fig. Left shows how the decision time increases with the system
size. The middle figure shows that the demand satisfaction ratio
is independent of the system size .The right figure shows that the
number of placement changes increases linearly with the system
size.
Application number: Next we vary the ratio
between the applications and the servers by increasing the
number of applications from 200 to 2000. The number of
servers is fixed at 1000. The results are shown in Fig The
left figure shows that the decision time is longer (but
remains under 0.5 second) when there are more
applications and when the class constraint is larger. The
middle figure shows that the satisfaction ratio is little
affected by the number of applications. The right figure
shows that when the demand is high, more applications
lead to more placement changes .
CONCLUSION
We presented the design and implementation of a system that
can scale up and down the number of application instances
automatically based on demand. We developed a color set
algorithm to decide the application placement and the load
distribution. Our system achieves high satisfaction ratio of
application demand even when the load is very high. It saves
energy by reducing the number of running instances when the
load is low. There are several directions for future work. Some
cloud service providers may provide multiple levels of services
totheir customers. When the resources become tight, they may
want to give their premium customers a higher demand
satisfaction ratio than other customers. In the future, we plan to
extend our system to support differentiated services but also
consider fairness when allocating the resources across the
applications.
Autoatic scaling of inter net applications for cloud

More Related Content

PDF
Designing Resource-Aware Applications for the Cloud with ABS
DOCX
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Automatic scaling of internet applica...
PPTX
ENMAX Supporting Processes and Improving GIS Data Workflows with FME
PPTX
A Few of My Favorite Things: The Magic that FME Brings to My Life
PPT
Managing Amazon AWS Costs
PPTX
GPX Data ingestion & DX Presentataion at AWS Mumbai Summit 2017
PPTX
2016 Utah Cloud Summit: TCO & Cost Optimization
PDF
IRJET- Commercial Web Application Load Balancing based on Hybrid Cloud
Designing Resource-Aware Applications for the Cloud with ABS
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Automatic scaling of internet applica...
ENMAX Supporting Processes and Improving GIS Data Workflows with FME
A Few of My Favorite Things: The Magic that FME Brings to My Life
Managing Amazon AWS Costs
GPX Data ingestion & DX Presentataion at AWS Mumbai Summit 2017
2016 Utah Cloud Summit: TCO & Cost Optimization
IRJET- Commercial Web Application Load Balancing based on Hybrid Cloud

Viewers also liked (9)

PPTX
Senior resarch project.social entrpreneurship pp
PDF
Fitness academy-training-pl
PDF
Fueling for endurance
PDF
EFMA SME Conference
PPTX
How to Have a Student-Centered Approach to Social Media: Perspectives From Li...
PPTX
PPTX
Woolhouspp
PDF
PPTX
Entrpreneurship development institute of india(edii)
Senior resarch project.social entrpreneurship pp
Fitness academy-training-pl
Fueling for endurance
EFMA SME Conference
How to Have a Student-Centered Approach to Social Media: Perspectives From Li...
Woolhouspp
Entrpreneurship development institute of india(edii)
Ad

Similar to Autoatic scaling of inter net applications for cloud (20)

DOCX
2014 IEEE DOTNET CLOUD COMPUTING PROJECT Automatic scaling of internet applic...
DOCX
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS Automatic scaling of internet appli...
PDF
Automatic scaling of web applications for cloud computing services
PDF
Learning Software Performance Models for Dynamic and Uncertain Environments
PDF
A Novel Dynamic Priority Based Job Scheduling Approach for Cloud Environment
PDF
Autonomic Resource Provisioning for Cloud-Based Software
PPTX
Licentiate Defense Slide
PDF
Resource scheduling algorithm
PDF
Optimization of FCFS Based Resource Provisioning Algorithm for Cloud Computing
PDF
C017531925
PPTX
Slides ch-5-the definitive guide to cloud computing -by- dan sullivan
PDF
Architecting for the cloud scability-availability
PDF
Fuzzy Control meets Software Engineering
PPTX
GoodFit: Multi-Resource Packing of Tasks with Dependencies
PPTX
GoodFit: Multi-Resource Packing of Tasks with Dependencies
PDF
Scheduling Algorithm Based Simulator for Resource Allocation Task in Cloud Co...
PDF
Resourceconstrained Project Scheduling Models Algorithms Extensions And Appli...
PPTX
Data stream processing and micro service architecture
PDF
Cost-Based Task Scheduling in Cloud Computing
PDF
Psdot 15 performance analysis of cloud computing
2014 IEEE DOTNET CLOUD COMPUTING PROJECT Automatic scaling of internet applic...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS Automatic scaling of internet appli...
Automatic scaling of web applications for cloud computing services
Learning Software Performance Models for Dynamic and Uncertain Environments
A Novel Dynamic Priority Based Job Scheduling Approach for Cloud Environment
Autonomic Resource Provisioning for Cloud-Based Software
Licentiate Defense Slide
Resource scheduling algorithm
Optimization of FCFS Based Resource Provisioning Algorithm for Cloud Computing
C017531925
Slides ch-5-the definitive guide to cloud computing -by- dan sullivan
Architecting for the cloud scability-availability
Fuzzy Control meets Software Engineering
GoodFit: Multi-Resource Packing of Tasks with Dependencies
GoodFit: Multi-Resource Packing of Tasks with Dependencies
Scheduling Algorithm Based Simulator for Resource Allocation Task in Cloud Co...
Resourceconstrained Project Scheduling Models Algorithms Extensions And Appli...
Data stream processing and micro service architecture
Cost-Based Task Scheduling in Cloud Computing
Psdot 15 performance analysis of cloud computing
Ad

Recently uploaded (20)

PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Cell Structure & Organelles in detailed.
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Classroom Observation Tools for Teachers
PPTX
Lesson notes of climatology university.
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Computing-Curriculum for Schools in Ghana
PPTX
master seminar digital applications in india
Abdominal Access Techniques with Prof. Dr. R K Mishra
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Cell Structure & Organelles in detailed.
O5-L3 Freight Transport Ops (International) V1.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Module 4: Burden of Disease Tutorial Slides S2 2025
Classroom Observation Tools for Teachers
Lesson notes of climatology university.
Complications of Minimal Access Surgery at WLH
Microbial diseases, their pathogenesis and prophylaxis
STATICS OF THE RIGID BODIES Hibbelers.pdf
Final Presentation General Medicine 03-08-2024.pptx
GDM (1) (1).pptx small presentation for students
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Final Presentation General Medicine 03-08-2024.pptx
Computing-Curriculum for Schools in Ghana
master seminar digital applications in india

Autoatic scaling of inter net applications for cloud

  • 1. AUTOATIC SCALING OF INTER NET APPLICATIONS FOR CLOUD COMPUTING SERVICES P.CHANDU 118T1A0503 ponnagantichandu@gmail.com
  • 3. INTRODUCTION • Benefits of cloud computing service is the resource elasticity • A business customer can scale up and down its resource usage as needed without upfront capital investment or long term commitment. The amazon EC2 service, for example, allows users to buy as many virtual machine (VM) instances as they want and operate them much like physical hardware.
  • 4. • Many internet applications can benefit from an auto scaling property where their resource usage can be scaled up and down automatically by the cloud service provider. • “Pay as you go” model. • Fault tolerance. • It consists of a load balancing switch, A set of application servers, and a set of backend storage servers. INTRODUCTION[1]
  • 5. • The applications store their state information in the backend storage servers. It is important that the applications themselves are stateless so that they can be replicated safely. • The storage servers may also become overloaded, but the focus of this work is on the application tier. Example: the Google app engine service. Fig .two tiered architecture for internet applications
  • 6. SYSTEM ARCHITECTURE • We encapsulate each application instance inside a virtual machine the use virtual machine to provide isolation among untrusted users. • Each server run the XEN hypervisor. • The system described as some decisions: Application placement Load distribution. Effectiveness techniques in ACFIAFCS: • Ballooning technique • Mirroring technology
  • 7. DEFINITIONPROBLEM • Multi dimensional resource demands • Vector bin packing. Automatic scaling problem we want to slove is defined as follows: Our goals are to maximize the demand satisfaction ratio, minimize the placement change frequency and minimize energy consumption. Our optimization objectives can be expressed as follows
  • 8. • To simplify the problem described above, we make the assumption that the servers are homogeneous with uniform capacity. • Then the auto scaling problem is similar to the Class Constrained Bin Packing (CCBP) problem. • The only difference is that the CCBP problem does not have the "Minimize the placement change frequency "goal. • Therefore, in order to solve our problem, we modified the CCBP model to support the "Minimize the placement change frequency "goal
  • 9. CCBPFORAUTOMATICSCALING OFRESOURCES • Bin packing problem: a series of items of different sizes need to be packed into a minimum number of bins. • The goal is to pack the items in to a minimum number of bins. • We can model our resource allocation as the Class Constrained Bin Packing (CCBP) problem where each server is a bin and each class represents an application. Items from a specific class represent the resource demands of the corresponding application. The class constraint reflects the practical limit on the number of applications a server can run simultaneously. For J2EE applications • Load unit • Most online CCBP algorithms do not support item departure. • minimize application placement changes by adjusting the existing load distribution.
  • 10. • Fig. 3 shows an example where we have two servers and three applications. Each server can run at most two applications simultaneously . We set the load unit to 20% Each server can thus satisfy 5 units of load (i.e., ). Let the resource demands of the applications be 60%, 20%, and 120% of the server capacity, respectively. • This translates into 3, 1, and 6 items to be packed for their corresponding classes. • The CCBP problem is NP-hard.
  • 11.  Detail of our algorithm: Our algorithm belongs to the family of color set algorithms  Application load increase: The load increase of an application is modeled as the arrival of items with the corresponding color .  Application load decrease: The algorithm has the freedom to choose which item of that color to remove.  Application joins and leaves: When the last item of a specific color leaves, that color can be removed from its color set.
  • 12. If there are, we allocate the new colors to the unfilled sets first using the following add_new_colorsprocedure.in that use a three procedures that are a Procedure add_new_colors Procedure consolidate_unfilled_sets Procedure fill(s1 ,s2 )
  • 13. ANALYSIS OF THE APPROXIMATION RATIO The quality of a polynomial time algorithm A is measured by its approximation ratio R(A) to the optimal algorithm OPT: Where is the list of the input sequence and and are the number of bins used under the algorithm and the optimal algorithm, respectively  the problem is NP-hard, no polynomial time optimal algorithm exists. Hence, the OPT algorithm here serves as an unrealistic upper bound.
  • 15. EXPERIMENTS • Load shifting: We first evaluate the effectiveness of load shifting in our algorithm to avoid application placement change. • Auto scaling :we reduce the request rate of the application gradually to emulate that the flash crowd is over. The algorithm scales down the server resources accordingly to conserve energy. In experiment demonstrate the effectiveness of our system in real experiments
  • 16. SIMULATIONS • This section evaluates the performance of our application scheduling algorithm in large scale simulation. • We assume that all servers are homogeneous. We define the“demand ratio”as the ratio between the aggregate demand of all applications and the aggregate capacity of all servers. • The simulator invokes our scheduling algorithm according to the prepared input and computes performance metrics such as the number of apms in the system, the decision time, the application demand satisfaction ratio, and the number of placement changes.
  • 17.  Application demand ratio  Scalability : We evaluate the scalability of the algorithm by increasing both the number of servers and the number of applications from 1000 to 10,000. Fig. Left shows how the decision time increases with the system size. The middle figure shows that the demand satisfaction ratio is independent of the system size .The right figure shows that the number of placement changes increases linearly with the system size.
  • 18. Application number: Next we vary the ratio between the applications and the servers by increasing the number of applications from 200 to 2000. The number of servers is fixed at 1000. The results are shown in Fig The left figure shows that the decision time is longer (but remains under 0.5 second) when there are more applications and when the class constraint is larger. The middle figure shows that the satisfaction ratio is little affected by the number of applications. The right figure shows that when the demand is high, more applications lead to more placement changes .
  • 19. CONCLUSION We presented the design and implementation of a system that can scale up and down the number of application instances automatically based on demand. We developed a color set algorithm to decide the application placement and the load distribution. Our system achieves high satisfaction ratio of application demand even when the load is very high. It saves energy by reducing the number of running instances when the load is low. There are several directions for future work. Some cloud service providers may provide multiple levels of services totheir customers. When the resources become tight, they may want to give their premium customers a higher demand satisfaction ratio than other customers. In the future, we plan to extend our system to support differentiated services but also consider fairness when allocating the resources across the applications.