SlideShare a Scribd company logo
© Copyright 2017 NETMONASTERY Inc
Thinking Beyond the
Human Envelope
A Quick Workshop on Data Science using Machine Learning
1
Shomiron DAS GUPTA - Founder, CEO
NETMONASTERY Inc.
© Copyright 2017 NETMONASTERY Inc
Agenda
■ More on Analytics, ML and DL
■ Why Analytics today?
■ Where does Analytics take us
■ Machine Learning, Simply
■ The Royal Family of ML
■ Solving with ML
■ Deep Learning - Where to NEXT !
2
Building Context Grounds Up
© Copyright 2017 NETMONASTERY Inc
More on Analytics, ML and DL
Data Science has many children -
3
Let’s Get it Straight - Data Science is Big
Data Analytics Machine Learning Deep Learning
Data Analytics
4
© Copyright 2017 NETMONASTERY Inc
Why Analytics Today?
We created 90% of the world's data in the last 2 years
Largest customer in 2014 was generating 4TB / Month,
the same customer today is generating 69TB / Month
Need to look through and analyze the recorded past - there is a clear want to
reference and learn from our data, more so in the cyber security
Horizontal scaling is the key - brings speed with consistency
5
Bringing History Back into Cyber Security
© Copyright 2017 NETMONASTERY Inc
Where Does Analytics Take Us?
What Works Well with Big Data Analytics Engines
Thresholding / Multi-Dimensional Thresholding
Time Series Analytics including Heuristics
Baseline / Profilers
Correlators
6
Different Forks Available to Process Big Data
Wheels on Fire
Machine Learning
7
© Copyright 2017 NETMONASTERY Inc
Machine Learning, Simply
8
It’s the ability to make machines think and take experiential decisions,
it’s the ability of the machine to think beyond the human envelope.
Speed / Accuracy of decision making
Ability to perform “logically” in unknown conditions
Let’s Try to Define Machine Learning
© Copyright 2017 NETMONASTERY Inc
Reinforced Learning
9
© Copyright 2017 NETMONASTERY Inc
The Royal Family of ML
First - Classification and Regression
■ Trees - Decision Trees
■ Forests - Random Forests
■ Support Vector Machines
■ K Nearest Neighbours (KNN)
■ Linear / Logistic Regression
■ Neural Networks
10
Describing the Tip of the Iceberg
© Copyright 2017 NETMONASTERY Inc
How it Works - Simple Decision Tree
The Basics
Features
Labels
Dataset
Training
Testing
11
Consistent Decision Making - is something we could all use ;)
The Process
Selecting the right features
Moderating and cleaning the dataset
Splitting the dataset - train / test
Building and training a model
Testing for accuracy - chaining / pipelining etc.
© Copyright 2017 NETMONASTERY Inc
Solving with ML
■ Email SPAM !
■ Detecting / Generating DGN’s
■ Network Anomaly
Stuff we (DNIF) does
■ Detect InBots
■ Bad traffic to Applications
■ UBM - Bad Access Attempt
12
Cyber Security Challenges that have their roots in ML
© Copyright 2017 NETMONASTERY Inc
Deep Learning - Where to NEXT !
Google with Deep Learning
■ Voice API
■ Video API
■ Image API
■ Translation API
■ Speech API
■ Relearning
■ finally TensorFlow
13
What is currently available - Where are we trying to go...
© Copyright 2017 NETMONASTERY Inc
Where to Begin
1. Massive resources on YouTube
2. Best if you learn - Python / R ….. python will take you the farthest
3. Work on practical challenges - seek answers
4. Big data platforms with built-in logic
Splunk.com - Leader in Big Data Analytics
DNIF.it - Complete toolkit. 100G Free Forever
14
NO, YOU ARE NOT LATE !
© Copyright 2017 NETMONASTERY Inc
Thank You
email: shom@dnif.it
15

More Related Content

PPTX
Cloudera Sessions for Big Data & AI Highlights
PPT
SMART PAPER D.U.A.L. book
PPT
My data dual book 4.5.2017
PPTX
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
PDF
Big Data LDN 2017: Machine Learning: What Works And What They Won’t Tell You
PDF
Where is my big data: security, privacy and jurisdictions in the cloud
PDF
Big Data LDN 2017: Real World Impact of a Global Data Fabric
PDF
Making Big Data Work
Cloudera Sessions for Big Data & AI Highlights
SMART PAPER D.U.A.L. book
My data dual book 4.5.2017
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
Big Data LDN 2017: Machine Learning: What Works And What They Won’t Tell You
Where is my big data: security, privacy and jurisdictions in the cloud
Big Data LDN 2017: Real World Impact of a Global Data Fabric
Making Big Data Work

What's hot (11)

PPTX
Battling Skynet: The Role of Humanity in Artificial Intelligence
PPTX
Lets Talk Google BigQuery
PPTX
Stitch labs presentation
PDF
HPE Discover London 2016 Highlights
POT
Andy Lewis of Kovarus
DOCX
Publications
PDF
Cloud Infrastructure: Changing Today's World
PDF
5 important trends in big data cloud & big data services
PDF
About Me - Vinay Pandey
PDF
Event Report Equifax EFXForum 2017 - More International & DaaS
PPTX
ELT is Better. Here's Why.
Battling Skynet: The Role of Humanity in Artificial Intelligence
Lets Talk Google BigQuery
Stitch labs presentation
HPE Discover London 2016 Highlights
Andy Lewis of Kovarus
Publications
Cloud Infrastructure: Changing Today's World
5 important trends in big data cloud & big data services
About Me - Vinay Pandey
Event Report Equifax EFXForum 2017 - More International & DaaS
ELT is Better. Here's Why.
Ad

Similar to Workshop on Data Science at Best Practices Meet 2017, Data Security Council of India (20)

PPTX
Neural networks with python
PPTX
Big Sky Earth 2018 Introduction to machine learning
PPTX
Big data, big opportunities
PPTX
L15.pptx
PDF
Developer's Introduction to Machine Learning
PDF
Nexxworks bootcamp ML6 (27/09/2017)
PPTX
Machine Learning AND Deep Learning for OpenPOWER
PDF
General introduction to AI ML DL DS
PPTX
Machine Learning with Spark
PDF
ML crash course
PDF
Intro to machine learning
PDF
Demystifying Machine Learning - How to give your business superpowers.
PDF
The Data Science Process - Do we need it and how to apply?
PDF
Choosing a Machine Learning technique to solve your need
PDF
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
PPTX
Workshop_Presentation.pptx
PPTX
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
PDF
AIIA - Charting the Path to Intelligent Operations with Machine Learning - At...
PDF
machine learning basic unit1 for third year cse studnets
PDF
ML.pdf
Neural networks with python
Big Sky Earth 2018 Introduction to machine learning
Big data, big opportunities
L15.pptx
Developer's Introduction to Machine Learning
Nexxworks bootcamp ML6 (27/09/2017)
Machine Learning AND Deep Learning for OpenPOWER
General introduction to AI ML DL DS
Machine Learning with Spark
ML crash course
Intro to machine learning
Demystifying Machine Learning - How to give your business superpowers.
The Data Science Process - Do we need it and how to apply?
Choosing a Machine Learning technique to solve your need
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
Workshop_Presentation.pptx
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
AIIA - Charting the Path to Intelligent Operations with Machine Learning - At...
machine learning basic unit1 for third year cse studnets
ML.pdf
Ad

Recently uploaded (20)

PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Lecture1 pattern recognition............
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Database Infoormation System (DBIS).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Clinical guidelines as a resource for EBP(1).pdf
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
IB Computer Science - Internal Assessment.pptx
Reliability_Chapter_ presentation 1221.5784
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Acceptance and paychological effects of mandatory extra coach I classes.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Business Acumen Training GuidePresentation.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction-to-Cloud-ComputingFinal.pptx
Supervised vs unsupervised machine learning algorithms
oil_refinery_comprehensive_20250804084928 (1).pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Lecture1 pattern recognition............
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Database Infoormation System (DBIS).pptx
.pdf is not working space design for the following data for the following dat...
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx

Workshop on Data Science at Best Practices Meet 2017, Data Security Council of India

  • 1. © Copyright 2017 NETMONASTERY Inc Thinking Beyond the Human Envelope A Quick Workshop on Data Science using Machine Learning 1 Shomiron DAS GUPTA - Founder, CEO NETMONASTERY Inc.
  • 2. © Copyright 2017 NETMONASTERY Inc Agenda ■ More on Analytics, ML and DL ■ Why Analytics today? ■ Where does Analytics take us ■ Machine Learning, Simply ■ The Royal Family of ML ■ Solving with ML ■ Deep Learning - Where to NEXT ! 2 Building Context Grounds Up
  • 3. © Copyright 2017 NETMONASTERY Inc More on Analytics, ML and DL Data Science has many children - 3 Let’s Get it Straight - Data Science is Big Data Analytics Machine Learning Deep Learning
  • 5. © Copyright 2017 NETMONASTERY Inc Why Analytics Today? We created 90% of the world's data in the last 2 years Largest customer in 2014 was generating 4TB / Month, the same customer today is generating 69TB / Month Need to look through and analyze the recorded past - there is a clear want to reference and learn from our data, more so in the cyber security Horizontal scaling is the key - brings speed with consistency 5 Bringing History Back into Cyber Security
  • 6. © Copyright 2017 NETMONASTERY Inc Where Does Analytics Take Us? What Works Well with Big Data Analytics Engines Thresholding / Multi-Dimensional Thresholding Time Series Analytics including Heuristics Baseline / Profilers Correlators 6 Different Forks Available to Process Big Data Wheels on Fire
  • 8. © Copyright 2017 NETMONASTERY Inc Machine Learning, Simply 8 It’s the ability to make machines think and take experiential decisions, it’s the ability of the machine to think beyond the human envelope. Speed / Accuracy of decision making Ability to perform “logically” in unknown conditions Let’s Try to Define Machine Learning
  • 9. © Copyright 2017 NETMONASTERY Inc Reinforced Learning 9
  • 10. © Copyright 2017 NETMONASTERY Inc The Royal Family of ML First - Classification and Regression ■ Trees - Decision Trees ■ Forests - Random Forests ■ Support Vector Machines ■ K Nearest Neighbours (KNN) ■ Linear / Logistic Regression ■ Neural Networks 10 Describing the Tip of the Iceberg
  • 11. © Copyright 2017 NETMONASTERY Inc How it Works - Simple Decision Tree The Basics Features Labels Dataset Training Testing 11 Consistent Decision Making - is something we could all use ;) The Process Selecting the right features Moderating and cleaning the dataset Splitting the dataset - train / test Building and training a model Testing for accuracy - chaining / pipelining etc.
  • 12. © Copyright 2017 NETMONASTERY Inc Solving with ML ■ Email SPAM ! ■ Detecting / Generating DGN’s ■ Network Anomaly Stuff we (DNIF) does ■ Detect InBots ■ Bad traffic to Applications ■ UBM - Bad Access Attempt 12 Cyber Security Challenges that have their roots in ML
  • 13. © Copyright 2017 NETMONASTERY Inc Deep Learning - Where to NEXT ! Google with Deep Learning ■ Voice API ■ Video API ■ Image API ■ Translation API ■ Speech API ■ Relearning ■ finally TensorFlow 13 What is currently available - Where are we trying to go...
  • 14. © Copyright 2017 NETMONASTERY Inc Where to Begin 1. Massive resources on YouTube 2. Best if you learn - Python / R ….. python will take you the farthest 3. Work on practical challenges - seek answers 4. Big data platforms with built-in logic Splunk.com - Leader in Big Data Analytics DNIF.it - Complete toolkit. 100G Free Forever 14 NO, YOU ARE NOT LATE !
  • 15. © Copyright 2017 NETMONASTERY Inc Thank You email: shom@dnif.it 15