SlideShare a Scribd company logo
1©2017 Check Point Software Technologies Ltd.
BIG DATA FORUM
ALEXANDER FOK, BIG DATA
ARCHITECT
MARCH 2017 MEETING
2©2017 Check Point Software Technologies Ltd.
• Big Data Forum Scope
• Big Data Projects in CP
• No SQL Overview
Agenda
3©2017 Check Point Software Technologies Ltd.
Big Data Forum Scope
• Engineers talk about Engineering
• Bring ideas, dilemmas, problems, technologies, reference architectures
• Ask QUESTIONS
• What is Big Data Problem?
4©2017 Check Point Software Technologies Ltd.
Simple Exercise
• What is more expensive
• Network IO
• Disk IO
• RAM Memory IO
• CPU Cache IO
• CPU Context switch
• Google interview question
5©2017 Check Point Software Technologies Ltd.
How long would it take to copy 1TB file?
• USB 2 ~480 Mbps
• Network 1 Gbps
• SATA drive 6 Gbps
• SAS 12 Gbps
• SSD PCIe 10 Gbps
• Disk to disk
1TB/500Mbps = ~5h
• Over LAN 1TB/250Mbps
= ~10h
• Over WAN
1TB/100Mbps = ~25h
6©2017 Check Point Software Technologies Ltd.
Some World Numbers
• NYSE – 1TB/day
• Airbus A380 640TB/flight (10GB/s)
• Verizon – 1M Events/s, 20 B/day
7©2017 Check Point Software Technologies Ltd.
Big Data Forum for You
• What is Big Data Forum for you?
• What is Big Data for you?
• What do you expect from this forum?
• What are you willing to contribute to this forum?
8©2017 Check Point Software Technologies Ltd.
Some CP Projects
• Threat Cloud infrastructure – distributed threats DBs handles hundreds
of millions of threat indicators (IPs, files, domains, etc)
• Threat Cloud DataLake
• MTP infrastructure – AWS based applications store
• GWs TI Access Logs (AV, URL filtering, AB, etc) – 300 GB/day, 20K
events/s
• CPDiag 300GB/day diagnostic data from CPDiag enabled GWs
9©2017 Check Point Software Technologies Ltd.
Rule of Thumb
• If you can solve the problem with more RAM – do it
• If you can solve the problem with more CPUs (threads) – do it
• Else – you have to distribute the solution
10©2017 Check Point Software Technologies Ltd.
Machine Learning vs Big Data Problems
Big Data
Machine
Learning
11©2017 Check Point Software Technologies Ltd.
• Volume – a lot of data to collect and make accessible
• Velocity – processed quickly at a rapid pace (correlations, enrichments,
etc)
• Variety – no predefined data schema (veracity)
Big Data Three V’s Challenges
12©2017 Check Point Software Technologies Ltd.
• Sizing
• Duplication prevention
• Correlation
• Data Integrity, Consistency
• Visualization
• Integration with other systems
• Retention Policy, Distribution, Regulations, Security
• Storage and Backup
Big Data Three V’s Challenges in Real Life
Main Challenge - How can we
handle x10, 100, 1000 times more
load?
13©2017 Check Point Software Technologies Ltd.
• Scale out architecture
• Cheap storage
• Reasonable computation times for various data exploration scenarios:
̶ Key-value lookup
̶ Documents search
̶ Generic filtering and aggregation analysis - batch processing
̶ Interactive queries
• Stream Processing
̶ (Near) Real Time Complex Events Processing
NoSQL Selected Problems
14©2017 Check Point Software Technologies Ltd.
NoSQL Trend Overview
• Database providing mechanism for storage and retrieval of data other than
tabular relations used by RDBMS
• Data collection, visualization, access management – not always are part of
the solution ecosystem
• What now – everybody is looking for SQL over noSQL, or NewSQL
• Compromised Consistency (CAP Theorem)
• How many NoSQL DBs are there?
15©2017 Check Point Software Technologies Ltd.
CAP (Brewer's)
Theorem
16©2017 Check Point Software Technologies Ltd.
NoSQL – Where To?
• We have a lot of data
• What next?
• Emerging Computational Models
17©2017 Check Point Software Technologies Ltd.
NoSQL Types
• Key-Value stores
• Document stores
• Search Engines
• Column stores
• Graph DBs
• Time Series
• RDF, Object Oriented, Multivalue
• Cloud Provided Solutions
18©2017 Check Point Software Technologies Ltd.
Key-Value Stores
• map concept – key->value
̶ Value can be any object (blob)
• Cache – in memory
• Store – have solid persistency model
• Additional query mechanisms (secondary indexes, time series)
• Abstract Data Types (data structure and algorithms on them) – sets,
hashes, lists, queues, etc
• Distribution
• Aerospike, Couchbase, Dynamo, Redis, Riak, Hbase, Cassandra
19©2017 Check Point Software Technologies Ltd.
Document stores
• Semi structured data store
• key-value retrieval and APIs based on document properties
• Collections, tags, metadata, etc
• Search engines
• Couchbase (CouchDB + Memcached), MongoDB, , Riak, ElasticSearch
20©2017 Check Point Software Technologies Ltd.
Search Engines
• Limited Document Stores with poor raw data storage capabilities
• Strong indexing mechanisms at ingestion time
• Complex query capabilities
• ElasticSearch, SolR, Splunk
21©2017 Check Point Software Technologies Ltd.
Column stores
• Good for append only scenarios
• Good for batch data insertion
• Good for analytics queries requiring massive data partial reads
• Bad for updates
• Bad for specific objects searches
• Bad for real time analytics
• Vertica, GreenPlum, Cassandra, File systems based solutions
• OLAP usage, except Cassandra
22©2017 Check Point Software Technologies Ltd.
Column stores
23©2017 Check Point Software Technologies Ltd.
Graph DBs
• Graph traversal queries
̶ Social Networks Recommendation engines
• Data Modeling - not about implementation mechanism
• Scale problems
• Query Standardization issues – Cypher, SPARQL, XQuery, Gremlin
• Neo4j, TITAN
• Modern approach – computational engine oriented solutions – Spark
Graph, ELK Graph
24©2017 Check Point Software Technologies Ltd.
Time Series
• Used for meters performance counters analysis – e.g. monitoring systems
(CPU, RAM, etc)
• Riak TS, TSDB,
• https://prometheus.io/
• http://influxdb.com/
• http://opentsdb.net/
• https://github.com/kairosdb/kairosdb
25©2017 Check Point Software Technologies Ltd.
Cloud Provided Solutions
• Azure Tables
• Amazon DynamoDB
• Google BigTable, Spanner
26©2017 Check Point Software Technologies Ltd.
Tip of The Day
• Apache is cemetery full of dead bodies
̶ No one size fits all solutiontechnologyarchitecture
̶ COOL is not always RIGHT
• http://nosql-database.org/
• http://db-engines.com
27©2017 Check Point Software Technologies Ltd.
• Use of low language query languages (C++, Java, Python) and tools
(Kibana, Jupyter Notebooks)
• Lack of standardization
• Integrative challenge
• No all size fits all solutions – REAL NEED to KNOW and UNDERSTAND the
underlying technology
NoSQL Adaptation Barriers
28©2017 Check Point Software Technologies Ltd.
Yuval Noah Harari on Big Data, Google and the end of
free will
http://www.ft.com/cms/s/2/50bb4830-6a4c-11e6-ae5b-a7cc5dd5a28c.html
29©2017 Check Point Software Technologies Ltd.
“Listen, Google,”
• “Well, I know you from the day you were born. I have read all your emails, recorded all your phone calls, and
know your favourite films, your DNA and the entire biometric history of your heart. I have exact data about each
date you went on, and I can show you second-by-second graphs of your heart rate, blood pressure and sugar
levels whenever you went on a date with John or Paul. And, naturally enough, I know them as well as I know you.
Based on all this information, on my superb algorithms and on decades’ worth of statistics about millions of
relationships — I advise you to go with John, with an 87 per cent probability of being more satisfied with him in
the long run.
• “Indeed, I know you so well that I even know you don’t like this answer. Paul is much more handsome than John
and, because you give external appearances too much weight, you secretly wanted me to say ‘Paul’. Looks
matter, of course, but not as much as you think. Your biochemical algorithms — which evolved tens of thousands
of years ago in the African savannah — give external beauty a weight of 35 per cent in their overall rating of
potential mates. My algorithms — which are based on the most up-to-date studies and statistics — say that looks
have only a 14 per cent impact on the long-term success of romantic relationships. So, even though I took Paul’s
beauty into account, I still tell you that you would be better off with John.”
30©2017 Check Point Software Technologies Ltd.
Alexander Fok, Big Data Architect
THANK YOU

More Related Content

PPTX
Telco analytics at scale
PDF
Spark at Airbnb
PDF
Apache Spark Listeners: A Crash Course in Fast, Easy Monitoring
PDF
Building Robust Production Data Pipelines with Databricks Delta
PDF
Unifying Streaming and Historical Telemetry Data For Real-time Performance Re...
PPTX
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
PDF
03-NOV-1510-Ognjen-Antonic-Telemach-stream-1
PDF
Low-latency data applications with Kafka and Agg indexes | Tino Tereshko, Fir...
Telco analytics at scale
Spark at Airbnb
Apache Spark Listeners: A Crash Course in Fast, Easy Monitoring
Building Robust Production Data Pipelines with Databricks Delta
Unifying Streaming and Historical Telemetry Data For Real-time Performance Re...
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
03-NOV-1510-Ognjen-Antonic-Telemach-stream-1
Low-latency data applications with Kafka and Agg indexes | Tino Tereshko, Fir...

What's hot (20)

PDF
Geosp.AI.tial: Applying Big Data and Machine Learning to Solve the World's To...
PDF
GPU Acceleration for Financial Services
PDF
Detecting Mobile Malware with Apache Spark with David Pryce
PPTX
Indexing 3-dimensional trajectories: Apache Spark and Cassandra integration
PDF
Streamsets and spark
PDF
How To Achieve Real-Time Analytics On A Data Lake Using GPUs
PDF
Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan Volz
PPTX
Realtime streaming architecture in INFINARIO
PPTX
SplunkLive! Hunk Technical Deep Dive
PDF
An Update on Scaling Data Science Applications with SparkR in 2018 with Heiko...
PDF
WSO2Con ASIA 2016: Patterns for Deploying Analytics in the Real World
PDF
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
PDF
DataStax & O'Reilly Media: Large Scale Data Analytics with Spark and Cassandr...
PDF
An End-to-End Spark-Based Machine Learning Stack in the Hybrid Cloud with Far...
PPTX
Spline 2 - Vision and Architecture Overview
PDF
Data Warehousing with Spark Streaming at Zalando
PDF
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
PDF
Webinar: Fusion for Data Science
PPTX
The evolution of the big data platform @ Netflix (OSCON 2015)
PDF
Streamsets and spark in Retail
Geosp.AI.tial: Applying Big Data and Machine Learning to Solve the World's To...
GPU Acceleration for Financial Services
Detecting Mobile Malware with Apache Spark with David Pryce
Indexing 3-dimensional trajectories: Apache Spark and Cassandra integration
Streamsets and spark
How To Achieve Real-Time Analytics On A Data Lake Using GPUs
Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan Volz
Realtime streaming architecture in INFINARIO
SplunkLive! Hunk Technical Deep Dive
An Update on Scaling Data Science Applications with SparkR in 2018 with Heiko...
WSO2Con ASIA 2016: Patterns for Deploying Analytics in the Real World
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
DataStax & O'Reilly Media: Large Scale Data Analytics with Spark and Cassandr...
An End-to-End Spark-Based Machine Learning Stack in the Hybrid Cloud with Far...
Spline 2 - Vision and Architecture Overview
Data Warehousing with Spark Streaming at Zalando
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
Webinar: Fusion for Data Science
The evolution of the big data platform @ Netflix (OSCON 2015)
Streamsets and spark in Retail
Ad

Similar to Check Point Big Data Forum m3 (20)

PDF
BigData Behind-the-Scenes~20150827
PPTX
Big Data Overview 2013-2014
PDF
Dba to data scientist -Satyendra
PPTX
Big data business case
PPSX
Big data with Hadoop - Introduction
PDF
Big Data for One Big Family
PPTX
Lessons learned from designing a QA Automation for analytics databases (big d...
PDF
Big Data Rampage
PDF
The datascientists workplace of the future, IBM developerDays 2014, Vienna by...
PPTX
Big data explanation with real time use case
PDF
LUISS - Deep Learning and data analyses - 09/01/19
PDF
Big Data - Module 1
PPT
Big data analytics, survey r.nabati
PDF
Big data rmoug
PPTX
Introduction to Big Data
KEY
SLQ vs NOSQL - friends or foes
PPTX
NoSQL A brief look at Apache Cassandra Distributed Database
PPTX
Big data presentation
PPTX
Big Data - An Overview
PPT
Big Data
BigData Behind-the-Scenes~20150827
Big Data Overview 2013-2014
Dba to data scientist -Satyendra
Big data business case
Big data with Hadoop - Introduction
Big Data for One Big Family
Lessons learned from designing a QA Automation for analytics databases (big d...
Big Data Rampage
The datascientists workplace of the future, IBM developerDays 2014, Vienna by...
Big data explanation with real time use case
LUISS - Deep Learning and data analyses - 09/01/19
Big Data - Module 1
Big data analytics, survey r.nabati
Big data rmoug
Introduction to Big Data
SLQ vs NOSQL - friends or foes
NoSQL A brief look at Apache Cassandra Distributed Database
Big data presentation
Big Data - An Overview
Big Data
Ad

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Cloud computing and distributed systems.
PDF
Machine learning based COVID-19 study performance prediction
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Approach and Philosophy of On baking technology
Understanding_Digital_Forensics_Presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Unlocking AI with Model Context Protocol (MCP)
MYSQL Presentation for SQL database connectivity
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Chapter 3 Spatial Domain Image Processing.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
20250228 LYD VKU AI Blended-Learning.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Spectral efficient network and resource selection model in 5G networks
Encapsulation_ Review paper, used for researhc scholars
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
Per capita expenditure prediction using model stacking based on satellite ima...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Mobile App Security Testing_ A Comprehensive Guide.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf

Check Point Big Data Forum m3

  • 1. 1©2017 Check Point Software Technologies Ltd. BIG DATA FORUM ALEXANDER FOK, BIG DATA ARCHITECT MARCH 2017 MEETING
  • 2. 2©2017 Check Point Software Technologies Ltd. • Big Data Forum Scope • Big Data Projects in CP • No SQL Overview Agenda
  • 3. 3©2017 Check Point Software Technologies Ltd. Big Data Forum Scope • Engineers talk about Engineering • Bring ideas, dilemmas, problems, technologies, reference architectures • Ask QUESTIONS • What is Big Data Problem?
  • 4. 4©2017 Check Point Software Technologies Ltd. Simple Exercise • What is more expensive • Network IO • Disk IO • RAM Memory IO • CPU Cache IO • CPU Context switch • Google interview question
  • 5. 5©2017 Check Point Software Technologies Ltd. How long would it take to copy 1TB file? • USB 2 ~480 Mbps • Network 1 Gbps • SATA drive 6 Gbps • SAS 12 Gbps • SSD PCIe 10 Gbps • Disk to disk 1TB/500Mbps = ~5h • Over LAN 1TB/250Mbps = ~10h • Over WAN 1TB/100Mbps = ~25h
  • 6. 6©2017 Check Point Software Technologies Ltd. Some World Numbers • NYSE – 1TB/day • Airbus A380 640TB/flight (10GB/s) • Verizon – 1M Events/s, 20 B/day
  • 7. 7©2017 Check Point Software Technologies Ltd. Big Data Forum for You • What is Big Data Forum for you? • What is Big Data for you? • What do you expect from this forum? • What are you willing to contribute to this forum?
  • 8. 8©2017 Check Point Software Technologies Ltd. Some CP Projects • Threat Cloud infrastructure – distributed threats DBs handles hundreds of millions of threat indicators (IPs, files, domains, etc) • Threat Cloud DataLake • MTP infrastructure – AWS based applications store • GWs TI Access Logs (AV, URL filtering, AB, etc) – 300 GB/day, 20K events/s • CPDiag 300GB/day diagnostic data from CPDiag enabled GWs
  • 9. 9©2017 Check Point Software Technologies Ltd. Rule of Thumb • If you can solve the problem with more RAM – do it • If you can solve the problem with more CPUs (threads) – do it • Else – you have to distribute the solution
  • 10. 10©2017 Check Point Software Technologies Ltd. Machine Learning vs Big Data Problems Big Data Machine Learning
  • 11. 11©2017 Check Point Software Technologies Ltd. • Volume – a lot of data to collect and make accessible • Velocity – processed quickly at a rapid pace (correlations, enrichments, etc) • Variety – no predefined data schema (veracity) Big Data Three V’s Challenges
  • 12. 12©2017 Check Point Software Technologies Ltd. • Sizing • Duplication prevention • Correlation • Data Integrity, Consistency • Visualization • Integration with other systems • Retention Policy, Distribution, Regulations, Security • Storage and Backup Big Data Three V’s Challenges in Real Life Main Challenge - How can we handle x10, 100, 1000 times more load?
  • 13. 13©2017 Check Point Software Technologies Ltd. • Scale out architecture • Cheap storage • Reasonable computation times for various data exploration scenarios: ̶ Key-value lookup ̶ Documents search ̶ Generic filtering and aggregation analysis - batch processing ̶ Interactive queries • Stream Processing ̶ (Near) Real Time Complex Events Processing NoSQL Selected Problems
  • 14. 14©2017 Check Point Software Technologies Ltd. NoSQL Trend Overview • Database providing mechanism for storage and retrieval of data other than tabular relations used by RDBMS • Data collection, visualization, access management – not always are part of the solution ecosystem • What now – everybody is looking for SQL over noSQL, or NewSQL • Compromised Consistency (CAP Theorem) • How many NoSQL DBs are there?
  • 15. 15©2017 Check Point Software Technologies Ltd. CAP (Brewer's) Theorem
  • 16. 16©2017 Check Point Software Technologies Ltd. NoSQL – Where To? • We have a lot of data • What next? • Emerging Computational Models
  • 17. 17©2017 Check Point Software Technologies Ltd. NoSQL Types • Key-Value stores • Document stores • Search Engines • Column stores • Graph DBs • Time Series • RDF, Object Oriented, Multivalue • Cloud Provided Solutions
  • 18. 18©2017 Check Point Software Technologies Ltd. Key-Value Stores • map concept – key->value ̶ Value can be any object (blob) • Cache – in memory • Store – have solid persistency model • Additional query mechanisms (secondary indexes, time series) • Abstract Data Types (data structure and algorithms on them) – sets, hashes, lists, queues, etc • Distribution • Aerospike, Couchbase, Dynamo, Redis, Riak, Hbase, Cassandra
  • 19. 19©2017 Check Point Software Technologies Ltd. Document stores • Semi structured data store • key-value retrieval and APIs based on document properties • Collections, tags, metadata, etc • Search engines • Couchbase (CouchDB + Memcached), MongoDB, , Riak, ElasticSearch
  • 20. 20©2017 Check Point Software Technologies Ltd. Search Engines • Limited Document Stores with poor raw data storage capabilities • Strong indexing mechanisms at ingestion time • Complex query capabilities • ElasticSearch, SolR, Splunk
  • 21. 21©2017 Check Point Software Technologies Ltd. Column stores • Good for append only scenarios • Good for batch data insertion • Good for analytics queries requiring massive data partial reads • Bad for updates • Bad for specific objects searches • Bad for real time analytics • Vertica, GreenPlum, Cassandra, File systems based solutions • OLAP usage, except Cassandra
  • 22. 22©2017 Check Point Software Technologies Ltd. Column stores
  • 23. 23©2017 Check Point Software Technologies Ltd. Graph DBs • Graph traversal queries ̶ Social Networks Recommendation engines • Data Modeling - not about implementation mechanism • Scale problems • Query Standardization issues – Cypher, SPARQL, XQuery, Gremlin • Neo4j, TITAN • Modern approach – computational engine oriented solutions – Spark Graph, ELK Graph
  • 24. 24©2017 Check Point Software Technologies Ltd. Time Series • Used for meters performance counters analysis – e.g. monitoring systems (CPU, RAM, etc) • Riak TS, TSDB, • https://prometheus.io/ • http://influxdb.com/ • http://opentsdb.net/ • https://github.com/kairosdb/kairosdb
  • 25. 25©2017 Check Point Software Technologies Ltd. Cloud Provided Solutions • Azure Tables • Amazon DynamoDB • Google BigTable, Spanner
  • 26. 26©2017 Check Point Software Technologies Ltd. Tip of The Day • Apache is cemetery full of dead bodies ̶ No one size fits all solutiontechnologyarchitecture ̶ COOL is not always RIGHT • http://nosql-database.org/ • http://db-engines.com
  • 27. 27©2017 Check Point Software Technologies Ltd. • Use of low language query languages (C++, Java, Python) and tools (Kibana, Jupyter Notebooks) • Lack of standardization • Integrative challenge • No all size fits all solutions – REAL NEED to KNOW and UNDERSTAND the underlying technology NoSQL Adaptation Barriers
  • 28. 28©2017 Check Point Software Technologies Ltd. Yuval Noah Harari on Big Data, Google and the end of free will http://www.ft.com/cms/s/2/50bb4830-6a4c-11e6-ae5b-a7cc5dd5a28c.html
  • 29. 29©2017 Check Point Software Technologies Ltd. “Listen, Google,” • “Well, I know you from the day you were born. I have read all your emails, recorded all your phone calls, and know your favourite films, your DNA and the entire biometric history of your heart. I have exact data about each date you went on, and I can show you second-by-second graphs of your heart rate, blood pressure and sugar levels whenever you went on a date with John or Paul. And, naturally enough, I know them as well as I know you. Based on all this information, on my superb algorithms and on decades’ worth of statistics about millions of relationships — I advise you to go with John, with an 87 per cent probability of being more satisfied with him in the long run. • “Indeed, I know you so well that I even know you don’t like this answer. Paul is much more handsome than John and, because you give external appearances too much weight, you secretly wanted me to say ‘Paul’. Looks matter, of course, but not as much as you think. Your biochemical algorithms — which evolved tens of thousands of years ago in the African savannah — give external beauty a weight of 35 per cent in their overall rating of potential mates. My algorithms — which are based on the most up-to-date studies and statistics — say that looks have only a 14 per cent impact on the long-term success of romantic relationships. So, even though I took Paul’s beauty into account, I still tell you that you would be better off with John.”
  • 30. 30©2017 Check Point Software Technologies Ltd. Alexander Fok, Big Data Architect THANK YOU

Editor's Notes