SlideShare a Scribd company logo
CarolinaCon 11
One Step Closer to the Matrix: Machine
Learning and Augmented Reality in
Streaming Data
Rob Weiss
John Eberhardt
What’s the Story?
• Rob and John have been working together for years
• Rob is a Network Engineer and Hacker
• John is a Data Scientist and Architect
• Two Great Tastes that Taste Great Together
• Different perspectives bring new answers
• Rob and John are interested in how to create a paradigm
shift in user interaction with data and network security
• We are also probably slightly insane
CarolinaCon 11
The Defender’s Challenge
• The attacker has an inherent advantage – no rules!
• So the defense problem is asymmetric
• Classical methods fail more rapidly as computing power
becomes cheaper and more readily available
• The Fortress or “Big Walls” security model is outdated and,
frankly, ineffective
• Qualified people are in short supply
• Can we crowdsource network defense?
CarolinaCon 11
How We Got Started
• A research project in a galaxy far, far away
• We started modeling zero day attacks
• We combined machine learning and streaming analytics to
detect novel patterns statistically
• It worked well enough, but there were limitations
• Not sensitive enough
• Not specific enough
• Proprietary software limited flexibility
• It still required a pretty sophisticated operator – and
those are in short supply
• So . . .
CarolinaCon 11
Taking a Different Approach
CarolinaCon 11
• Could we do for raw data what GUIs did for computers
and revolutionize human interaction with data?
• Complex streaming analytics are not tractable to the
human
• The “last mile” requires a user interface that creates flow
for the human analyst out of data
• Harness the power of metaphor to explain complex
concepts to the human analyst (e.g. Windows)
• Streaming Analytics + Streaming User Experience = “Data
Looming”
• Can we really make a prosthetic for the brain?
What? Don’t Flip Out . . .
CarolinaCon 11
Data Looming
• Can you point out every individual thread and show me
how it is woven? Probably not.
• Can you tell me what it is? I sure hope so!
CarolinaCon 11
Data Looming
Watch threads on a loom – to the naked eye,
the loom is too complex and moving too
quickly for you to pick out the details, but you
can quickly see when the overall pattern
changes – usually within very few iterations. A
simple, intuitive, scalable visualization of
streaming analytics allows the human analyst
to connect the “last mile” of disconnected
events and is at the heart of what we are doing
– merging complex streaming analytics with the
sparse pattern detection capabilities of the
human brain.
Pattern Recognition is For the Birds
A child can learn to recognize this pattern in 15 seconds, but a
computer still can’t.
#1 - Eagle #2 - Swan #3 - ????
CarolinaCon 11
Getting to The Big Idea
Zero Day Work
William Gibson’s
Neuromancer The Matrix
John Maeda’s Simplicity
by Design
Open Source Network Expertise Data Science
Expertise
Crowdsourcing
Hacktastic Innovation Explosion!!!
CarolinaCon 11
How I Did It by Victor Frankenstein
• Accelerate data analysis by extending streaming analytics to
broader groups of less skilled human analysts
• Combine the speed, precision and recall of a computer,
through an immersive interface, with the inherent sparse
pattern recognition capabilities of the human brain
• Streaming Analytics allow for rapid, real time
adjudication of data and make the user experience
dynamic
• An immersive user experience makes complex analytics
data “real” to the human and enables experiential
learning
• Combining them in a single environment enables sparse
pattern recognition in dynamic systems
CarolinaCon 11
How I Did It Continued (Abby Normal)
• Data: Streaming data from sensors, collectors, files, etc.
• Platform: Streaming analytics process and analyze these
data, including attribution to the real world
• Visual Language Construct: Integrates streaming data,
streaming analytics, and streaming user experience in a
pluggable architecture
• Streaming User Experience: Immersive 3-D user experience
allows analysts to interact directly with streaming data and
analytics
CarolinaCon 11
Architecture (Meet the Architect)
Data Sensor
(N+1)
Data Collector
(N+1)
Kafka
Zookeeper
Kafka
Queue
Nimbus
Worker Node
Storm
Trident-ML
Analytics
Platform
Visual Language
Construct
Streaming User
Experience
Analytics and
Countermeasures
Game Players
CarolinaCon 11
Design Principles
Principle Enables
Open Source Components Supports integration of streaming analytics and immersive user
experience to create a dynamic feedback loop –rapidly adapt
the platform from lessons learned from human experience
Streaming Analytics Accelerating analytics to keep pace with data collection
(facilitating high collection rate)
Immersive Streaming User
Experience
Extending the user interface to allow broader groups of analysts
to use sophisticated analytics (addressing the recruiting
challenge)
Pluggable Architecture “Bring your own” tools and analytics supports crowdsourcing
and allows for aggressive exploitation of new analytics and user
experience paradigms
CarolinaCon 11
Larry Byrd: Network Defender of the Future
A basketball player can watch your network. When an attack occurs, our player can quickly
identify pattern shift using the same brain computation as when the player identifies a
shift in the offensive strategy of the opposing basketball team. Think about this as a data
prosthetic for the human brain.
CarolinaCon 11
Enough of Us Talking at You
• Fight fire with fire – crowdsource all comers and create an
asymmetric defense
• Align economic incentives, human behaviors, and defense
objectives
• Do for data what GUIs did for computers – make it
accessible!
• This isn’t about technology . . . it’s about revolutionizing the
way humans interact with data to enable a game-changing
leap forward
CarolinaCon 11
Innovation Is Often Strange
CarolinaCon 11
But Wait, There’s More!
Altamira Technologies Corporation 2014
CarolinaCon 11
Demo Concept
Concept
• Normal work environment – “normal” patterns give way to aberrations
• This behavior is focused on network data, but could easily be any other
streaming data
Design
• Analytics cluster traffic based on source and destination port patterns
over time using k-means clustering
• Cubes represent nodes on the network; streaming spheres represent
packets
• Colors represent the behavior of nodes / packets based upon traffic –
Green is a client, Blue is a Server, Yellow is “undetermined behavior”
CarolinaCon 11
Green (client) Blue (server) Yellow (??)
Source Centroid 54760 1001 5066
Dest Centroid 791 54518 5511
Questions I Can Ask
• Is a given node on the network behaving as expected?
• Watch the node colors - they should be consistent in a normal network:
some white nodes, a lot of blue (client) nodes, and some green nodes.
What happens over time?
• Does my use of source and destination ports mark me out as a client or server?
Does my role appear consistent or change?
• The node colors indicate what they are – watch the colors of the nodes –
machines should have clear and consistent roles
• Is my pattern of nodes that I am interacting with consistent? Am I interacting
with different partners?
• Watch the stream patterns – machines should interact with consistent
groups
• Do my behaviors adhere to regular time cycles? Can I apply time cycles to any of
the above (e.g., a workday)?
• Watch the patterns change as cyclical time progresses in our “workday”
CarolinaCon 11
DEMO TIME!
Altamira Technologies Corporation 2014
CarolinaCon 11
About Rob and John
• Rob Weiss is a senior systems engineer at G2 (www.g2-
inc.com) with over 24 years of experience in government
and commercial markets. He started with Legos and is now a
tool builder and problem solver. Currently runs the Altamira
Red Team and performs information security research,
looking for hard problems to solve. Twitter: @3XPlo1T2
• John Eberhardt is a Data Scientist at 3E Services
(www.3eservicesllc.com) with 20 years of quantitative
problem solving and a penchant for trying to decipher
symbolism in obscure 16th century literature. John has
experience in analytical problem solving in healthcare, life
sciences, security, financial services, consumer products,
and transportation. Twitter: @JohnSEberhardt3
CarolinaCon 11
Repositories
• Apache Storm: https://github.com/apache/storm
• Trident-ML: https://github.com/pmerienne/trident-ml
• Rob Weiss: https://github.com/j105rob
CarolinaCon 11
Squiggly (probably won’t use this)
• A self organizing system consists of groups A, B, and C
interacting
• Hence, the current state of A is {A|B,C}
• They influence each other {B|A,C}, {C|A,B} which means
the system is described by f{{A|B,C},{B|A,C},{C|A,B}}
• However these groups are neither unitary nor static,
which means at any given time they can have sub-
attributes {Ai...An}, {Bi...Bn}, {Ci...Cn} that are unknown
• So now the system is described by f{{Ai | {Bi...Bn},
{Ci...Cn}},{Bi |{Ai...An}, {Ci...Cn}},{Ci |{Ai...An}, {Bi...Bn}}}
• How do you solve this np-hard problem?

More Related Content

PPTX
Tales from an ip worker in consulting and software
PDF
Secure Because Math: A Deep-Dive on Machine Learning-Based Monitoring (#Secur...
PPTX
2016 FS-ISAC Annual Summit (Miami) - Developing Effective Encryption Strategies
PDF
Biting into the Jawbreaker: Pushing the Boundaries of Threat Hunting Automation
PDF
Sharing is Caring: Understanding and Measuring Threat Intelligence Sharing Ef...
PDF
SANS CTI Summit 2016 - Data-Driven Threat Intelligence: Sharing
PDF
Future is private intel dev fest
PDF
Five Ways To Do Data Analytics "The Wrong Way"
Tales from an ip worker in consulting and software
Secure Because Math: A Deep-Dive on Machine Learning-Based Monitoring (#Secur...
2016 FS-ISAC Annual Summit (Miami) - Developing Effective Encryption Strategies
Biting into the Jawbreaker: Pushing the Boundaries of Threat Hunting Automation
Sharing is Caring: Understanding and Measuring Threat Intelligence Sharing Ef...
SANS CTI Summit 2016 - Data-Driven Threat Intelligence: Sharing
Future is private intel dev fest
Five Ways To Do Data Analytics "The Wrong Way"

What's hot (20)

PDF
Decentralized AI Draper
PDF
Scaling AI in production using PyTorch
PDF
Lessons from building a stream-first metadata platform | Shirshanka Das, Stealth
PPTX
AI in the Enterprise at Scale
PDF
Beyond Matching: Applying Data Science Techniques to IOC-based Detection
PPTX
AI in the Enterprise - Looking Forward
PDF
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
PPTX
Introduction to NetGuardians' Big Data Software Stack
PDF
Industry of Things World - Berlin 19-09-16
PPTX
BsidesLVPresso2016_JZeditsv6
PDF
Measuring the IQ of your Threat Intelligence Feeds (#tiqtest)
PDF
How to Create 80% of a Big Data Pilot Project
PPTX
Correlation Analysis Modeling Use Case - IBM Power Systems
PDF
Just because you can doesn't mean that you should - thingmonk 2016
PDF
EclipseCon France 2015 - Science Track
PPTX
Real time streaming analytics
PPTX
Era ofdataeconomyv4short
PDF
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
PDF
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
PDF
Adam Coates at AI Frontiers: AI for 100 Million People with Deep Learning
Decentralized AI Draper
Scaling AI in production using PyTorch
Lessons from building a stream-first metadata platform | Shirshanka Das, Stealth
AI in the Enterprise at Scale
Beyond Matching: Applying Data Science Techniques to IOC-based Detection
AI in the Enterprise - Looking Forward
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
Introduction to NetGuardians' Big Data Software Stack
Industry of Things World - Berlin 19-09-16
BsidesLVPresso2016_JZeditsv6
Measuring the IQ of your Threat Intelligence Feeds (#tiqtest)
How to Create 80% of a Big Data Pilot Project
Correlation Analysis Modeling Use Case - IBM Power Systems
Just because you can doesn't mean that you should - thingmonk 2016
EclipseCon France 2015 - Science Track
Real time streaming analytics
Era ofdataeconomyv4short
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Adam Coates at AI Frontiers: AI for 100 Million People with Deep Learning
Ad

Viewers also liked (14)

PPTX
Passive voice
PDF
Las 5 redes sociales
PDF
Ship construction project
PPTX
Tutorial sobre Eclipse
PPT
Note names part 1 ©
PPTX
Guide cоздание сайта на office 365.
PDF
PM 201: Emotional Intelligence for Project Managers
PDF
When should you get your house painted
PPTX
Practica 2
PPT
Sorrow The Movie
PPT
Note names part 7 ©
DOC
Máy điều hòa tủ lạnh
PPTX
Sales Tax Compliance within Oracle E-Business Suite / JD Edwards / PeopleSoft
PPTX
Pėdų šveitiklių micro pedi ir scholl velvet smooth palyginimas
Passive voice
Las 5 redes sociales
Ship construction project
Tutorial sobre Eclipse
Note names part 1 ©
Guide cоздание сайта на office 365.
PM 201: Emotional Intelligence for Project Managers
When should you get your house painted
Practica 2
Sorrow The Movie
Note names part 7 ©
Máy điều hòa tủ lạnh
Sales Tax Compliance within Oracle E-Business Suite / JD Edwards / PeopleSoft
Pėdų šveitiklių micro pedi ir scholl velvet smooth palyginimas
Ad

Similar to CarolinaCon Presentation on Streaming Analytics (20)

PDF
Unexperienced pasts
PDF
Big Data : Bits of History, Words of Advice
PPT
A First Step Towards Stream Reasoning at FIS 2008
PPT
Evolving Future Information Systems: Challenges, Perspectives and Applications
PDF
Smart Cities, Open Data and SMW - SMWCon Spring 2012 Keynote
PDF
20100423sage
PPTX
Looking into the future with web media analytics marshall sponder - montreal...
PPTX
Message passing & NoSQL (in English)
PDF
The FAIR data movement and 22 Feb 2023.pdf
PDF
GalvanizeU Seattle: Eleven Almost-Truisms About Data
DOCX
COM6905 Research Methods And Professional Issues.docx
PDF
Patterns (and Anti-Patterns) for Developing Machine Learning Systems
PDF
Eventshop 120721
PPTX
Cloud Programming Models: eScience, Big Data, etc.
PDF
Big Data And Analytics: A Summary Of The X 4.0 Era
PDF
Hadoop.mapreduce
PDF
Notes on data-intensive processing with Hadoop Mapreduce
PDF
FAIR data_ Superior data visibility and reuse without warehousing.pdf
PDF
Designing and Implementing Information Systems with Event Modeling, Bobby Cal...
PPTX
FIWARE and Smart Data Models
Unexperienced pasts
Big Data : Bits of History, Words of Advice
A First Step Towards Stream Reasoning at FIS 2008
Evolving Future Information Systems: Challenges, Perspectives and Applications
Smart Cities, Open Data and SMW - SMWCon Spring 2012 Keynote
20100423sage
Looking into the future with web media analytics marshall sponder - montreal...
Message passing & NoSQL (in English)
The FAIR data movement and 22 Feb 2023.pdf
GalvanizeU Seattle: Eleven Almost-Truisms About Data
COM6905 Research Methods And Professional Issues.docx
Patterns (and Anti-Patterns) for Developing Machine Learning Systems
Eventshop 120721
Cloud Programming Models: eScience, Big Data, etc.
Big Data And Analytics: A Summary Of The X 4.0 Era
Hadoop.mapreduce
Notes on data-intensive processing with Hadoop Mapreduce
FAIR data_ Superior data visibility and reuse without warehousing.pdf
Designing and Implementing Information Systems with Event Modeling, Bobby Cal...
FIWARE and Smart Data Models

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation theory and applications.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Approach and Philosophy of On baking technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
cuic standard and advanced reporting.pdf
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
MYSQL Presentation for SQL database connectivity
20250228 LYD VKU AI Blended-Learning.pptx
Getting Started with Data Integration: FME Form 101
The Rise and Fall of 3GPP – Time for a Sabbatical?
Mobile App Security Testing_ A Comprehensive Guide.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation theory and applications.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Per capita expenditure prediction using model stacking based on satellite ima...
Approach and Philosophy of On baking technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
cuic standard and advanced reporting.pdf
Spectroscopy.pptx food analysis technology
Programs and apps: productivity, graphics, security and other tools
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Electronic commerce courselecture one. Pdf
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
MYSQL Presentation for SQL database connectivity

CarolinaCon Presentation on Streaming Analytics

  • 1. CarolinaCon 11 One Step Closer to the Matrix: Machine Learning and Augmented Reality in Streaming Data Rob Weiss John Eberhardt
  • 2. What’s the Story? • Rob and John have been working together for years • Rob is a Network Engineer and Hacker • John is a Data Scientist and Architect • Two Great Tastes that Taste Great Together • Different perspectives bring new answers • Rob and John are interested in how to create a paradigm shift in user interaction with data and network security • We are also probably slightly insane CarolinaCon 11
  • 3. The Defender’s Challenge • The attacker has an inherent advantage – no rules! • So the defense problem is asymmetric • Classical methods fail more rapidly as computing power becomes cheaper and more readily available • The Fortress or “Big Walls” security model is outdated and, frankly, ineffective • Qualified people are in short supply • Can we crowdsource network defense? CarolinaCon 11
  • 4. How We Got Started • A research project in a galaxy far, far away • We started modeling zero day attacks • We combined machine learning and streaming analytics to detect novel patterns statistically • It worked well enough, but there were limitations • Not sensitive enough • Not specific enough • Proprietary software limited flexibility • It still required a pretty sophisticated operator – and those are in short supply • So . . . CarolinaCon 11
  • 5. Taking a Different Approach CarolinaCon 11 • Could we do for raw data what GUIs did for computers and revolutionize human interaction with data? • Complex streaming analytics are not tractable to the human • The “last mile” requires a user interface that creates flow for the human analyst out of data • Harness the power of metaphor to explain complex concepts to the human analyst (e.g. Windows) • Streaming Analytics + Streaming User Experience = “Data Looming” • Can we really make a prosthetic for the brain?
  • 6. What? Don’t Flip Out . . . CarolinaCon 11
  • 7. Data Looming • Can you point out every individual thread and show me how it is woven? Probably not. • Can you tell me what it is? I sure hope so! CarolinaCon 11 Data Looming Watch threads on a loom – to the naked eye, the loom is too complex and moving too quickly for you to pick out the details, but you can quickly see when the overall pattern changes – usually within very few iterations. A simple, intuitive, scalable visualization of streaming analytics allows the human analyst to connect the “last mile” of disconnected events and is at the heart of what we are doing – merging complex streaming analytics with the sparse pattern detection capabilities of the human brain.
  • 8. Pattern Recognition is For the Birds A child can learn to recognize this pattern in 15 seconds, but a computer still can’t. #1 - Eagle #2 - Swan #3 - ???? CarolinaCon 11
  • 9. Getting to The Big Idea Zero Day Work William Gibson’s Neuromancer The Matrix John Maeda’s Simplicity by Design Open Source Network Expertise Data Science Expertise Crowdsourcing Hacktastic Innovation Explosion!!! CarolinaCon 11
  • 10. How I Did It by Victor Frankenstein • Accelerate data analysis by extending streaming analytics to broader groups of less skilled human analysts • Combine the speed, precision and recall of a computer, through an immersive interface, with the inherent sparse pattern recognition capabilities of the human brain • Streaming Analytics allow for rapid, real time adjudication of data and make the user experience dynamic • An immersive user experience makes complex analytics data “real” to the human and enables experiential learning • Combining them in a single environment enables sparse pattern recognition in dynamic systems CarolinaCon 11
  • 11. How I Did It Continued (Abby Normal) • Data: Streaming data from sensors, collectors, files, etc. • Platform: Streaming analytics process and analyze these data, including attribution to the real world • Visual Language Construct: Integrates streaming data, streaming analytics, and streaming user experience in a pluggable architecture • Streaming User Experience: Immersive 3-D user experience allows analysts to interact directly with streaming data and analytics CarolinaCon 11
  • 12. Architecture (Meet the Architect) Data Sensor (N+1) Data Collector (N+1) Kafka Zookeeper Kafka Queue Nimbus Worker Node Storm Trident-ML Analytics Platform Visual Language Construct Streaming User Experience Analytics and Countermeasures Game Players CarolinaCon 11
  • 13. Design Principles Principle Enables Open Source Components Supports integration of streaming analytics and immersive user experience to create a dynamic feedback loop –rapidly adapt the platform from lessons learned from human experience Streaming Analytics Accelerating analytics to keep pace with data collection (facilitating high collection rate) Immersive Streaming User Experience Extending the user interface to allow broader groups of analysts to use sophisticated analytics (addressing the recruiting challenge) Pluggable Architecture “Bring your own” tools and analytics supports crowdsourcing and allows for aggressive exploitation of new analytics and user experience paradigms CarolinaCon 11
  • 14. Larry Byrd: Network Defender of the Future A basketball player can watch your network. When an attack occurs, our player can quickly identify pattern shift using the same brain computation as when the player identifies a shift in the offensive strategy of the opposing basketball team. Think about this as a data prosthetic for the human brain. CarolinaCon 11
  • 15. Enough of Us Talking at You • Fight fire with fire – crowdsource all comers and create an asymmetric defense • Align economic incentives, human behaviors, and defense objectives • Do for data what GUIs did for computers – make it accessible! • This isn’t about technology . . . it’s about revolutionizing the way humans interact with data to enable a game-changing leap forward CarolinaCon 11
  • 16. Innovation Is Often Strange CarolinaCon 11
  • 17. But Wait, There’s More! Altamira Technologies Corporation 2014 CarolinaCon 11
  • 18. Demo Concept Concept • Normal work environment – “normal” patterns give way to aberrations • This behavior is focused on network data, but could easily be any other streaming data Design • Analytics cluster traffic based on source and destination port patterns over time using k-means clustering • Cubes represent nodes on the network; streaming spheres represent packets • Colors represent the behavior of nodes / packets based upon traffic – Green is a client, Blue is a Server, Yellow is “undetermined behavior” CarolinaCon 11 Green (client) Blue (server) Yellow (??) Source Centroid 54760 1001 5066 Dest Centroid 791 54518 5511
  • 19. Questions I Can Ask • Is a given node on the network behaving as expected? • Watch the node colors - they should be consistent in a normal network: some white nodes, a lot of blue (client) nodes, and some green nodes. What happens over time? • Does my use of source and destination ports mark me out as a client or server? Does my role appear consistent or change? • The node colors indicate what they are – watch the colors of the nodes – machines should have clear and consistent roles • Is my pattern of nodes that I am interacting with consistent? Am I interacting with different partners? • Watch the stream patterns – machines should interact with consistent groups • Do my behaviors adhere to regular time cycles? Can I apply time cycles to any of the above (e.g., a workday)? • Watch the patterns change as cyclical time progresses in our “workday” CarolinaCon 11
  • 20. DEMO TIME! Altamira Technologies Corporation 2014 CarolinaCon 11
  • 21. About Rob and John • Rob Weiss is a senior systems engineer at G2 (www.g2- inc.com) with over 24 years of experience in government and commercial markets. He started with Legos and is now a tool builder and problem solver. Currently runs the Altamira Red Team and performs information security research, looking for hard problems to solve. Twitter: @3XPlo1T2 • John Eberhardt is a Data Scientist at 3E Services (www.3eservicesllc.com) with 20 years of quantitative problem solving and a penchant for trying to decipher symbolism in obscure 16th century literature. John has experience in analytical problem solving in healthcare, life sciences, security, financial services, consumer products, and transportation. Twitter: @JohnSEberhardt3 CarolinaCon 11
  • 22. Repositories • Apache Storm: https://github.com/apache/storm • Trident-ML: https://github.com/pmerienne/trident-ml • Rob Weiss: https://github.com/j105rob CarolinaCon 11
  • 23. Squiggly (probably won’t use this) • A self organizing system consists of groups A, B, and C interacting • Hence, the current state of A is {A|B,C} • They influence each other {B|A,C}, {C|A,B} which means the system is described by f{{A|B,C},{B|A,C},{C|A,B}} • However these groups are neither unitary nor static, which means at any given time they can have sub- attributes {Ai...An}, {Bi...Bn}, {Ci...Cn} that are unknown • So now the system is described by f{{Ai | {Bi...Bn}, {Ci...Cn}},{Bi |{Ai...An}, {Ci...Cn}},{Ci |{Ai...An}, {Bi...Bn}}} • How do you solve this np-hard problem?