SlideShare a Scribd company logo
A Paradigm Shift: The
Increasing Dominance of
Memory-Oriented Solutions for
High Performance Data Access!

Ben Stopford : RBS
How fast is a HashMap lookup?




~20 ns
That’s how long it takes light to
         travel a room
How fast is a database lookup?




~20 ms
That’s how long it takes light to go
       to Australia and back
3 times
Computers really are very fast!
The problem is we’re quite good at
writing software that slows them down
Desktop Virtualization
We love
abstraction
There are many reasons
why abstraction is a
good idea… 

…performance just isn’t
one of them
Question: is it fair to compare a
Database with a HashMap?
Not really…
Key Point



 On one end of
                     ..on the other sits
the scale sits the
                       the database… 
  HashMap…


 …but it’s a very very long scale that
         sits between them.
Times are changing
Database Architecture is
Aging
A Paradigm Shift: The Increasing Dominance of Memory-Oriented Solutions for High Performance Data Access
The Traditional Architecture
Traditional


       Shared                  Shared
                In Memory
        Disk
                  Nothing


Distributed                        Simpler
In Memory
                         Contract
Simplifying the
   Contract
How big is the internet?



     5 exabytes
              
(which is 5,000 petabytes or
   5,000,000 terabytes)
How big is an average enterprise
           database


   80% < 1TB
           (in 2009)
Simplifying the Contract
Databases have huge operational
          overheads




                             Taken from “OLTP Through
                             the Looking Glass, and What
                             We Found There”
                             Harizopoulos et al
Avoid that overhead with a simpler
contract and avoiding IO
Improving Database Performance !
Shared Disk Architecture




                            Shared
                             Disk
Improving Database Performance !
Shared Nothing Architecture
Each machine is responsible for a subset of the
   records. Each record exists on only one
                  machine.!
                     

                     1, 2, 3…
   97, 98, 99…




             765, 769…
                  169, 170…
  Client
       

                    333, 334…
   244, 245…
Improving Database Performance (3)!
 In Memory Databases!
(single address-space)
Databases must cache subsets of
      the data in memory




             Cache
Not knowing what you don’t know




        90% in Cache


           Data on Disk
If you can fit it ALL in memory you
know everything!!
The architecture of an in memory
            database
Memory is at least 100x faster than disk
               ms
   μs
       ns
           ps

1MB Disk/Network
        1MB Main Memory


          0.000,000,000,000
Cross Continental    Main Memory
                  L1 Cache Ref
Round Trip
          Ref
         Cross Network             L2 Cache Ref
         Round Trip
       * L1 ref is about 2 clock cycles or 0.7ns. This is
                           the time it takes light to travel 20cm
Memory allows random access.
Disk only works well for sequential
reads
This makes them very fast!!
The proof is in the stats. TPC-H
Benchmarks on a 1TB data set
So why haven’t in memory
  databases taken off?
Address-Spaces are relatively small
     and of a finite, fixed size
Durability
One solution is
 distribution
Distributed In Memory (Shared
           Nothing)
Again we spread our data but this time only
               using RAM.




                   1, 2, 3…
   97, 98, 99…




           765, 769…
                  169, 170…
Client
     

                  333, 334…
   244, 245…
Distribution solves our two
          problems
We get massive amounts of parallel
          processing
But at the cost of
loosing the single
  address space
Traditional


       Shared                  Shared
                In Memory
        Disk
                  Nothing


Distributed                        Simpler
In Memory
                         Contract
There are three key themes here:


                  Simplify the
Distribution
                        No Disk
                   contract

                    Improve
  Gain
                    scalability by
  scalability
                    picking          All data is
  through a
                    appropriate      held in RAM
  distributed
                    ACID
  architecture
                    properties.
ODC
ODC – Distributed, Shared Nothing, In
Memory, Semi-Normalised, Graph DB

      450 processes
      2TB of RAM




  Messaging (Topic Based) as a system of record
                  (persistence)
ODC represents a
balance between
 throughput and
     latency
What is Latency?
What is Throughput
Which is best for latency?


                 Shared
                Nothing 
              (Distributed)
Traditional                    In-Memory
Database
                       Database



               Latency?
Which is best for throughput?


                   Shared
                  Nothing 
                (Distributed)
  Traditional                    In-Memory
  Database
                       Database



                 Latency?
So why do we use distributed in
          memory?
                     Plentiful
       In Memory
                    hardware




        Latency
    Throughput
This is the technology of
the now. 
So what is the technology
of the future?
Terabyte Memory Architectures
Fast Persistent Storage
New Innovations on the Horizon
These factors are remolding the
hardware landscape to one where
 memory both vast and durable
This is changing the way we write
             software
Huge servers in the
commodity space are
driving us towards single
process architectures that
utilise many cores and
large address spaces
We can attain hundreds of
thousands of executions
per second from a single
process if it is well
optimised.
“All computers wait at the
same speed” 
!
We need to optimise for our CPU architecture

               ms
   μs
       ns
           ps

1MB Disk/Network
        1MB Main Memory


          0.000,000,000,000
Cross Continental    Main Memory
                  L1 Cache Ref
Round Trip
          Ref
         Cross Network             L2 Cache Ref
         Round Trip
       * L1 ref is about 2 clock cycles or 0.7ns. This is
                           the time it takes light to travel 20cm
Tools like Vtune allow us to
optimise software to truly leverage
           our hardware
So what does this all mean?
Further Reading

More Related Content

PDF
Big Data and the Future of Storage
PDF
Great Article, Thanks Paul Feresten, Sr. Product Marketing Manager, and Rajes...
PDF
Digital Forensics
PPTX
Memory Management in Windows 7
PDF
R1Soft CDP 3.0 Key Features
PPTX
Quantum NDX - NAS Based Data Protection
PDF
How Nyherji Manages High Availability TSM Environments using FlashCopy Manager
PPTX
JetStor NAS 724UXD Dual Controller Active-Active ZFS Based
Big Data and the Future of Storage
Great Article, Thanks Paul Feresten, Sr. Product Marketing Manager, and Rajes...
Digital Forensics
Memory Management in Windows 7
R1Soft CDP 3.0 Key Features
Quantum NDX - NAS Based Data Protection
How Nyherji Manages High Availability TSM Environments using FlashCopy Manager
JetStor NAS 724UXD Dual Controller Active-Active ZFS Based

What's hot (19)

PDF
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
PPTX
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
PPTX
The future of tape april 16
PPTX
How swift is your Swift - SD.pptx
PPSX
Free Presentation ... I-Safe
PPSX
Free Presentation I Safe
ODP
ZFS by PWR 2013
PDF
An IBM Storage Solution for Small and Mid-size Businesses -- The IBM Storwize...
PDF
8 considerations for evaluating disk based backup solutions
PDF
Openstorage Openstack
PPTX
Quantum Enterprise Expansion Product Suite
PPT
39 virtual memory
PPT
Data Storage Devices Holography
PDF
PPTX
Storage class memory
PDF
Automated Storage Tiering
PPTX
Vancouver bug enterprise storage and zfs
PPTX
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
PDF
Virtual Private Server Documentation
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
The future of tape april 16
How swift is your Swift - SD.pptx
Free Presentation ... I-Safe
Free Presentation I Safe
ZFS by PWR 2013
An IBM Storage Solution for Small and Mid-size Businesses -- The IBM Storwize...
8 considerations for evaluating disk based backup solutions
Openstorage Openstack
Quantum Enterprise Expansion Product Suite
39 virtual memory
Data Storage Devices Holography
Storage class memory
Automated Storage Tiering
Vancouver bug enterprise storage and zfs
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
Virtual Private Server Documentation
Ad

Similar to A Paradigm Shift: The Increasing Dominance of Memory-Oriented Solutions for High Performance Data Access (20)

PPTX
Advanced databases ben stopford
PDF
Beyond The Data Grid: Coherence, Normalisation, Joins and Linear Scalability
PDF
IAP09 CUDA@MIT 6.963 - Guest Lecture: Out-of-Core Programming with NVIDIA's C...
PPTX
002-Storage Basics and Application Environments V1.0.pptx
PDF
In-Memory Computing: Myths and Facts
PPTX
in-memory database system and low latency
PPTX
Storing data in windows server 2012 ss
PPTX
State of the Art Thin Provisioning
PPTX
PPTX
PPTX
CS 542 Putting it all together -- Storage Management
PDF
SOUG_SDM_OracleDB_V3
PDF
Memory-Based Cloud Architectures
PPTX
SAN BASICS..Why we will go for SAN?
PDF
Linux and H/W optimizations for MySQL
PPTX
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
PDF
Designs, Lessons and Advice from Building Large Distributed Systems
ODP
Low level java programming
PPT
Cache memory presentation
PPTX
Computer Memory Architecture and Evolution
Advanced databases ben stopford
Beyond The Data Grid: Coherence, Normalisation, Joins and Linear Scalability
IAP09 CUDA@MIT 6.963 - Guest Lecture: Out-of-Core Programming with NVIDIA's C...
002-Storage Basics and Application Environments V1.0.pptx
In-Memory Computing: Myths and Facts
in-memory database system and low latency
Storing data in windows server 2012 ss
State of the Art Thin Provisioning
CS 542 Putting it all together -- Storage Management
SOUG_SDM_OracleDB_V3
Memory-Based Cloud Architectures
SAN BASICS..Why we will go for SAN?
Linux and H/W optimizations for MySQL
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
Designs, Lessons and Advice from Building Large Distributed Systems
Low level java programming
Cache memory presentation
Computer Memory Architecture and Evolution
Ad

More from Ben Stopford (20)

PPTX
10 Principals for Effective Event-Driven Microservices with Apache Kafka
PPTX
10 Principals for Effective Event Driven Microservices
PDF
The Future of Streaming: Global Apps, Event Stores and Serverless
PDF
A Global Source of Truth for the Microservices Generation
PDF
Building Event Driven Services with Kafka Streams
PDF
NDC London 2017 - The Data Dichotomy- Rethinking Data and Services with Streams
PDF
Building Event Driven Services with Apache Kafka and Kafka Streams - Devoxx B...
PDF
Building Event Driven Services with Stateful Streams
PDF
Devoxx London 2017 - Rethinking Services With Stateful Streams
PDF
Event Driven Services Part 2: Building Event-Driven Services with Apache Kafka
PDF
Event Driven Services Part 1: The Data Dichotomy
PDF
Event Driven Services Part 3: Putting the Micro into Microservices with State...
PDF
Strata Software Architecture NY: The Data Dichotomy
PDF
The Power of the Log
PDF
Streaming, Database & Distributed Systems Bridging the Divide
PDF
Data Pipelines with Apache Kafka
PDF
JAX London Slides
PDF
Microservices for a Streaming World
PDF
A little bit of clojure
PPTX
Big iron 2 (published)
10 Principals for Effective Event-Driven Microservices with Apache Kafka
10 Principals for Effective Event Driven Microservices
The Future of Streaming: Global Apps, Event Stores and Serverless
A Global Source of Truth for the Microservices Generation
Building Event Driven Services with Kafka Streams
NDC London 2017 - The Data Dichotomy- Rethinking Data and Services with Streams
Building Event Driven Services with Apache Kafka and Kafka Streams - Devoxx B...
Building Event Driven Services with Stateful Streams
Devoxx London 2017 - Rethinking Services With Stateful Streams
Event Driven Services Part 2: Building Event-Driven Services with Apache Kafka
Event Driven Services Part 1: The Data Dichotomy
Event Driven Services Part 3: Putting the Micro into Microservices with State...
Strata Software Architecture NY: The Data Dichotomy
The Power of the Log
Streaming, Database & Distributed Systems Bridging the Divide
Data Pipelines with Apache Kafka
JAX London Slides
Microservices for a Streaming World
A little bit of clojure
Big iron 2 (published)

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPT
Teaching material agriculture food technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Encapsulation_ Review paper, used for researhc scholars
Understanding_Digital_Forensics_Presentation.pptx
Review of recent advances in non-invasive hemoglobin estimation
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Teaching material agriculture food technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Programs and apps: productivity, graphics, security and other tools
Electronic commerce courselecture one. Pdf
NewMind AI Weekly Chronicles - August'25 Week I
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
MIND Revenue Release Quarter 2 2025 Press Release
The Rise and Fall of 3GPP – Time for a Sabbatical?
The AUB Centre for AI in Media Proposal.docx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

A Paradigm Shift: The Increasing Dominance of Memory-Oriented Solutions for High Performance Data Access