SlideShare a Scribd company logo
A Software Design and Algorithms for Multicore Capture in Data Center Forensics
.
On the Way IN: DC Forensics
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 2/28
...
2/28
.
Forensics Basics
.
(traditional) Forensics Stages...
..
.... are collection, examination, analysis, and reporting
• many challenges in data centers
• collection: realtime is really really really difficult
• examiation: you can't examine what you can't collect, also flexibility is important
• analysis: deeper form of examination, same problems
• reporting: that part is actually easy, but DCs have no standards
◦ one standard is offered later in this presentation
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 3/28
...
3/28
.
Forensics : All is Traffic
.
Statement
..
.All information in data centers can be reduced to the traffic form
• logs are information carried on packets
• logging, storage, etc. are distributed -- have to be communicate using
traffic
• a corrolary: if something is not traffic, it might be useful to convert it into
traffic
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 4/28
...
4/28
.
Practical DC Forensics
• we want Deep Packet Inspection (DPI) back on the table
• we want to not use sampling, but capture everything
• we want to differentiate attention spent to different classes of
traffic
◦ called context-based sampling
◦ probability of capture/inspection depends on current context
• note: all these are gradually removed from practice for infeasibility
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 5/28
...
5/28
.
Conventional Multicore
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 6/28
...
6/28
.
Generic Multicore Design
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 7/28
...
7/28
.
Generic Multicore Capture
• 2 roles: manager and
core
• traditional parallel
processing: message
passing or shared
memory 05 06
05 M.Aldinucci+2 "FastFlow: Efficient Parallel Streaming Applications on Multi-core" U.Pisa Techreport (2009)
06 R.Brightwell "Workshop on Managed Many-Core Systems" 1st Managed Many-Core Systems (2008)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 8/28
...
8/28
.
Conventional Shortcomings
.
Reality is...
..
.
... that traditional parallel processing designs are extremely inefficient
on multicore
• overhead from parallelization is too high
• unit of processing is too small
• streamline designs are rare but are recently discussed in BigData 08
.
The solution is...
..
.... to use a lockfree (message-less) parallelization design
08 R.Chen+2 "Tiled-MapReduce: Optimizing Resource Usages ... on Multicore with Tiling" 19th PACT (2010)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 9/28
...
9/28
.
Conventional → Proposed
• spawn, but don't wait to merge
• collect results form cores
continuously to avoid lumps
• get used to not being able to
communicate to cores (no
messages)
◦ relatively short tasks diminish this
effect 02
02
myself+0 "Experiments with Practical On-Demand Multi-Core
Packet Capture" APNOMS (2013)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 10/28
...
10/28
.
Proposal : the New Multicore
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 11/28
...
11/28
.
Proposal : Mission Statement
.
Proposal Components
..
.
• lockfree design
• tasks-into-cores packing problem and optimization
• implementation that support lockfree design
• remember: the easiest way to aggregate traffic is to use IP address prefixes
• again, generic, so we do not care about the contents
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 12/28
...
12/28
.
Proposal : Shared Memory
• communication happens over
shared memory
04
• C/C++ implementation
is common, but will work in
other languages as well
• shared memory is persistent,
but cores come and go
04 K.Michael "The Linux Programming Interface" No Starch Press (2010)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 13/28
...
13/28
.
Proposal : DLL is Key
.
DLL stands for...
..
.... Double Linked List
• common in C/C++
designs
• extremely flexible --
you can swap
elements by
reassigning pointers
• sideways
DLL is a method
to avoid collisions in
hashing
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 14/28
...
14/28
.
Optimization
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 15/28
...
15/28
.
Optimization Targets
• few cores, many data units
• need to pack latter into former
• moreover: scheduling problem, which is packing but along the timeline
• moreover(2) : when packing, do you randomize input or not -- hashing
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 16/28
...
16/28
.
Prefix Packing Problem
minimize w1count(P) + w2max(M) + w3var(C)
subject of k1 < pi < k2 ∀ pi ∈ P.
Hashkey
- 32 bits0 -
k1
(shortest) k2
(longest)
Effective
range
Core0 Core1 Core2 …
p
(prefix)
p1
p3
p2 p4 p5
p6
p8
p7
m
(max)
n
Prefix Packing Problem
• prefix length between k1 and k2s
◦ hashkey or raw
◦ fixed in each run in this paper
• pi is a pack (group) of items
• n total items, mapped to set M of
prefixes in each of m cores
• C a set of item counts c across
prefixes,
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 17/28
...
17/28
.
Prefix Packing GA Heuristic
• Generic Algorithm (GA) 12
• chromosome is a tuple of prefixes packed into one core
gi = ⟨pi,1, pi,2, ..., pi,m⟩. (1)
• one gene (whole solution) is a tuple containing all chromosomes
Gj = ⟨g1, g2, ..., gn⟩. (2)
12 D.Knysh+1 "Parallel Genetic Algorithms: a Survey and Problem State of the Art" IJCSS (2010)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 18/28
...
18/28
.
Analysis
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 19/28
...
19/28
.
Analysis Setup
• actual packet traces -- trace-based simulation 16
• input: 2 cases -- hashing verus raw
• items are individual packets
◦ packets are packed into prefixes
◦ prefixes are packed into cores
• the above GA optimization heuristic
16 myself "MAWI Working Group Traffic Archive" http://mawi.wide.ad.jp/mawi (2014)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 20/28
...
20/28
.
Analysis (1) Cores
0 1 2 3 4 5 6 7 8 9
Time sequence
4.6
4.7
4.8
4.9
5
5.1
5.2
5.3
5.4
5.5
log(maxitemcount/core)
1 core
2 cores
3 cores
4 cores
5 cores
6 cores
7 cores
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 21/28
...
21/28
.
Analysis (2) Hashing
0 0.2 0.4 0.6 0.8 1
Increasing cutoff parameter
0
40
80
120
160
200
240
Numberofuniqueprefixes
hashed
raw
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 22/28
...
22/28
.
Forensics 2.0
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 23/28
...
23/28
.
Forensics 2.0
• reporting part: let's use sketches from data streaming 11
Core 1
Core 1
Core X
TABID
Manager
Now(replay)
….
BIG DATA TIMELINE
Cursor
Time
Direction
One Sketch One SketchOne Sketch
Start End End End
Read/prepare
Shared Memory
Start
11 M.Sung+3 "Scalable and Efficient Data Streaming Algorithms for Detecting Common Content..." ICDE (2006)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 24/28
...
24/28
.
Wrapup
• a natively multicore technology is proposed
• performance is opitimized using a packing heuristic
• raw input is found to be preferable to randomization
• future topics:
1. variable-length prefixes
2. optimization along the timeline
3. jitter minimization (fewer reasignments)
4. further lookup optimiation -- fast hashing
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 25/28
...
25/28
.
That’s all, thank you ...
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28
...
26/28
.
[01] myself+0 (2013)
...community-based architecture for measuring E2E QoS at DCc
IJCSE
[02] myself+0 (2013)
Experiments with Practical On-Demand Multi-Core Packet Capture
APNOMS
[03] myself+1 (2013)
A Graphical Method for Detection of Flash Crowds in Traffic
Telecom. Systems (TM)
[04] K.Michael (2010)
The Linux Programming Interface
No Starch Press
[05] M.Aldinucci+2 (2009)
FastFlow: Efficient Parallel Streaming Applications on Multi-core
U.Pisa Techreport
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28
...
26/28
.
[06] R.Brightwell (2008)
Workshop on Managed Many-Core Systems
1st Managed Many-Core Systems
[07] X.Sui+3 (2010)
Parallel Graph Partitioning on Multicore Architectures
23rd LCPC
[08] R.Chen+2 (2010)
Tiled-MapReduce: Optimizing Resource Usages ... on Multicore with Tiling
19th PACT
[09] I.Machdi+2 (2009)
Executing parallel TwigStack algorithm on a multi-core system
11th IIWAS
[10] S.Stoichev+1 (2009)
Parallel Algorithm for Integer Sorting with Multi-Core Processors
IT and Control
[11] M.Sung+3 (2006)
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28
...
26/28
.
Scalable and Efficient Data Streaming Algorithms for Detecting Common Content...
ICDE
[12] D.Knysh+1 (2010)
Parallel Genetic Algorithms: a Survey and Problem State of the Art
IJCSS
[13] Luca Deri (2009)
Modern Packet Capture and Analysis: Multi-Core, Multi-Gigabit, and Beyond
IM
[14] myself (2014)
MCoreMemory project page
https://github.com/maratishe/mcorememory
[15] myself (2013)
Rings-on-Cores project page
https://github.com/maratishe/ringsNcores
[16] myself (2014)
MAWI Working Group Traffic Archive
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28
...
26/28
.
http://mawi.wide.ad.jp/mawi
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 27/28
...
27/28
.
Extras (1) Per-Unit Cost
Hashing
Increasing
Per-Unit Cost
Manager
Prefix
Matching
Cores that
do not
match
Process
Stage 1 Stage 2 Stage 3
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 27/28
...
27/28
.
Extras (2) Share Memory Trick
M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 28/28
...
28/28

More Related Content

PDF
C++ neural networks and fuzzy logic
PDF
Solving Endgames in Large Imperfect-Information Games such as Poker
PDF
Real-time applications on IntelXeon/Phi
PDF
IPC in Microkernel Systems, Capabilities
PDF
HTCC poster for CERN Openlab opendays 2015
PPTX
Lec06 memory
PDF
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...
PDF
Introduction to National Supercomputer center in Tianjin TH-1A Supercomputer
C++ neural networks and fuzzy logic
Solving Endgames in Large Imperfect-Information Games such as Poker
Real-time applications on IntelXeon/Phi
IPC in Microkernel Systems, Capabilities
HTCC poster for CERN Openlab opendays 2015
Lec06 memory
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...
Introduction to National Supercomputer center in Tianjin TH-1A Supercomputer

What's hot (7)

PDF
Unikernels, Multikernels, Virtual Machine-based Kernels
PPTX
RL-Cache: Learning-Based Cache Admission for Content Delivery
PDF
(Im2col)accelerating deep neural networks on low power heterogeneous architec...
PPTX
Data-Centric Parallel Programming
PDF
Cache Optimization Techniques for General Purpose Graphic Processing Units
PDF
Cloud, Fog, or Edge: Where and When to Compute?
PDF
Deep Learning Initiative @ NECSTLab
Unikernels, Multikernels, Virtual Machine-based Kernels
RL-Cache: Learning-Based Cache Admission for Content Delivery
(Im2col)accelerating deep neural networks on low power heterogeneous architec...
Data-Centric Parallel Programming
Cache Optimization Techniques for General Purpose Graphic Processing Units
Cloud, Fog, or Edge: Where and When to Compute?
Deep Learning Initiative @ NECSTLab
Ad

Viewers also liked (20)

PDF
3-Way Scripts as a Base Unit for Flexible Scale-Out Code
PDF
On Context Management Using Metro Maps
PDF
The Vobile Cloud : Cars are Better for Mobile Clouds and Group Connect
PDF
YALMS: A Data Structure for Storage and Replay of Multimedia Content in Digit...
PDF
The All-In-One Package for Massively Multicore, Heterogeneous Jobs with Hotsp...
PDF
High Availability Cloud Storage as a Software Package with Social Graph, Thro...
PDF
MetroMaps versus Facets: What Exactly is the Ontological Context?
PDF
ReBot: a Recommendation Robot as a Fundamentally Distinct Approach towards Co...
PDF
Coins in Cloud Drives Can Use OAuth for Micropayments and Resource Metering A...
PDF
Cloud from Scratch / ゼロからクラウド構築
PDF
Reverse Engineering Twitter Hashtag Algorithm
PDF
Clientside/Offline (onefile) Lecture Player in a Web Browser
PDF
高インパクト発表資料作成法
PPT
Putting the pieces together: The MDA Framework
PPT
Guia argentina de tratamiento de la EPOC
PDF
Replayable BigData for Multicore Processing and Statistically Rigid Sketching
PDF
Halal Toronto second edition
PDF
Newsletter december 2010
PPT
Real food challenge workshop material
DOC
IPHA Resolution Supporting Principles of a Healthy, Sustainable Food System
3-Way Scripts as a Base Unit for Flexible Scale-Out Code
On Context Management Using Metro Maps
The Vobile Cloud : Cars are Better for Mobile Clouds and Group Connect
YALMS: A Data Structure for Storage and Replay of Multimedia Content in Digit...
The All-In-One Package for Massively Multicore, Heterogeneous Jobs with Hotsp...
High Availability Cloud Storage as a Software Package with Social Graph, Thro...
MetroMaps versus Facets: What Exactly is the Ontological Context?
ReBot: a Recommendation Robot as a Fundamentally Distinct Approach towards Co...
Coins in Cloud Drives Can Use OAuth for Micropayments and Resource Metering A...
Cloud from Scratch / ゼロからクラウド構築
Reverse Engineering Twitter Hashtag Algorithm
Clientside/Offline (onefile) Lecture Player in a Web Browser
高インパクト発表資料作成法
Putting the pieces together: The MDA Framework
Guia argentina de tratamiento de la EPOC
Replayable BigData for Multicore Processing and Statistically Rigid Sketching
Halal Toronto second edition
Newsletter december 2010
Real food challenge workshop material
IPHA Resolution Supporting Principles of a Healthy, Sustainable Food System
Ad

Similar to A Software Design and Algorithms for Multicore Capture in Data Center Forensics (20)

PDF
A proposed architecture for network
DOCX
SDOT Secure Hash, Semantic Keyword Extraction, and Dynamic Operator Pattern-B...
PPTX
Accelerating forensic and incident response workflow: the case for a new stan...
PPTX
Realtime traffic analyser
PPTX
Digital forensics lessons
PPTX
First Responders Course - Session 4 - Forensic Readiness [2004]
PPTX
Lect 4 computer forensics
PPT
Digital Forensics in the Archive
PPTX
Digital forensic tools
PPTX
Combining Phase Identification and Statistic Modeling for Automated Parallel ...
PPTX
Digital forensics research: The next 10 years
PPTX
Network and computer forensics
DOCX
Digital forensics Steps
PDF
Parallel Computing - Lec 4
PPT
computer forensics, involves the preservation, identification, extraction, an...
PDF
CNIT 121: 11 Analysis Methodology
PPT
Digital Forensics: The next 10 years
PPT
Parallel architecture
PPTX
Data acquisition and storage in Wireless Sensor Network
PDF
Proactive Data Containers (PDC): An Object-centric Data Store for Large-scale...
A proposed architecture for network
SDOT Secure Hash, Semantic Keyword Extraction, and Dynamic Operator Pattern-B...
Accelerating forensic and incident response workflow: the case for a new stan...
Realtime traffic analyser
Digital forensics lessons
First Responders Course - Session 4 - Forensic Readiness [2004]
Lect 4 computer forensics
Digital Forensics in the Archive
Digital forensic tools
Combining Phase Identification and Statistic Modeling for Automated Parallel ...
Digital forensics research: The next 10 years
Network and computer forensics
Digital forensics Steps
Parallel Computing - Lec 4
computer forensics, involves the preservation, identification, extraction, an...
CNIT 121: 11 Analysis Methodology
Digital Forensics: The next 10 years
Parallel architecture
Data acquisition and storage in Wireless Sensor Network
Proactive Data Containers (PDC): An Object-centric Data Store for Large-scale...

More from Tokyo University of Science (20)

PDF
A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...
PDF
Ultrasound Relative Positioning for IoT Devices in Dense Wireless Spaces
PDF
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...
PDF
What if We Atomize Student Data and Apps and Put Them on Docker Containers?
PDF
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...
PDF
On Performance Under Hotspots in Hadoop versus Bigdata Replay Platforms
PDF
Taking the Step from Software to Product Development \\ when teaching PBL at ...
PDF
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...
PDF
The Switchboard Optimization Problem and Heuristics for Cut-Through Networking
PDF
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...
PDF
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless Spaces
PDF
Fog Cloud Caching at Network Edge via Local Hardware Awareness Spaces
PDF
On a Hybrid Packets-and-Circuits Switching Logic
PDF
Image-Related Uses for Roadside Infrastructure \\ based on Wireless Beacons
PDF
Complexity Resolution Control for Context Based on Metromaps
PDF
The Declarative-Coordinated Model for Self-Optimization of Service Networks
PDF
3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds
PDF
Towards Social Robotics on Smartphones with Simple XYZV Sensor Feedback
PDF
Browser Visualization using PNGs Generated by HTML5 Workers on Multicore
PDF
Population Management in Clouds is a Do-It-Yourself Technology
A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...
Ultrasound Relative Positioning for IoT Devices in Dense Wireless Spaces
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...
What if We Atomize Student Data and Apps and Put Them on Docker Containers?
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...
On Performance Under Hotspots in Hadoop versus Bigdata Replay Platforms
Taking the Step from Software to Product Development \\ when teaching PBL at ...
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...
The Switchboard Optimization Problem and Heuristics for Cut-Through Networking
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless Spaces
Fog Cloud Caching at Network Edge via Local Hardware Awareness Spaces
On a Hybrid Packets-and-Circuits Switching Logic
Image-Related Uses for Roadside Infrastructure \\ based on Wireless Beacons
Complexity Resolution Control for Context Based on Metromaps
The Declarative-Coordinated Model for Self-Optimization of Service Networks
3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds
Towards Social Robotics on Smartphones with Simple XYZV Sensor Feedback
Browser Visualization using PNGs Generated by HTML5 Workers on Multicore
Population Management in Clouds is a Do-It-Yourself Technology

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
KodekX | Application Modernization Development
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Cloud computing and distributed systems.
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Approach and Philosophy of On baking technology
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
NewMind AI Monthly Chronicles - July 2025
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Diabetes mellitus diagnosis method based random forest with bat algorithm
KodekX | Application Modernization Development
Chapter 3 Spatial Domain Image Processing.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Machine learning based COVID-19 study performance prediction
MYSQL Presentation for SQL database connectivity
Cloud computing and distributed systems.
Reach Out and Touch Someone: Haptics and Empathic Computing
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Review of recent advances in non-invasive hemoglobin estimation
Approach and Philosophy of On baking technology
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
NewMind AI Monthly Chronicles - July 2025
The AUB Centre for AI in Media Proposal.docx
Understanding_Digital_Forensics_Presentation.pptx

A Software Design and Algorithms for Multicore Capture in Data Center Forensics

  • 2. . On the Way IN: DC Forensics M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 2/28 ... 2/28
  • 3. . Forensics Basics . (traditional) Forensics Stages... .. .... are collection, examination, analysis, and reporting • many challenges in data centers • collection: realtime is really really really difficult • examiation: you can't examine what you can't collect, also flexibility is important • analysis: deeper form of examination, same problems • reporting: that part is actually easy, but DCs have no standards ◦ one standard is offered later in this presentation M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 3/28 ... 3/28
  • 4. . Forensics : All is Traffic . Statement .. .All information in data centers can be reduced to the traffic form • logs are information carried on packets • logging, storage, etc. are distributed -- have to be communicate using traffic • a corrolary: if something is not traffic, it might be useful to convert it into traffic M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 4/28 ... 4/28
  • 5. . Practical DC Forensics • we want Deep Packet Inspection (DPI) back on the table • we want to not use sampling, but capture everything • we want to differentiate attention spent to different classes of traffic ◦ called context-based sampling ◦ probability of capture/inspection depends on current context • note: all these are gradually removed from practice for infeasibility M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 5/28 ... 5/28
  • 6. . Conventional Multicore M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 6/28 ... 6/28
  • 7. . Generic Multicore Design M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 7/28 ... 7/28
  • 8. . Generic Multicore Capture • 2 roles: manager and core • traditional parallel processing: message passing or shared memory 05 06 05 M.Aldinucci+2 "FastFlow: Efficient Parallel Streaming Applications on Multi-core" U.Pisa Techreport (2009) 06 R.Brightwell "Workshop on Managed Many-Core Systems" 1st Managed Many-Core Systems (2008) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 8/28 ... 8/28
  • 9. . Conventional Shortcomings . Reality is... .. . ... that traditional parallel processing designs are extremely inefficient on multicore • overhead from parallelization is too high • unit of processing is too small • streamline designs are rare but are recently discussed in BigData 08 . The solution is... .. .... to use a lockfree (message-less) parallelization design 08 R.Chen+2 "Tiled-MapReduce: Optimizing Resource Usages ... on Multicore with Tiling" 19th PACT (2010) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 9/28 ... 9/28
  • 10. . Conventional → Proposed • spawn, but don't wait to merge • collect results form cores continuously to avoid lumps • get used to not being able to communicate to cores (no messages) ◦ relatively short tasks diminish this effect 02 02 myself+0 "Experiments with Practical On-Demand Multi-Core Packet Capture" APNOMS (2013) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 10/28 ... 10/28
  • 11. . Proposal : the New Multicore M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 11/28 ... 11/28
  • 12. . Proposal : Mission Statement . Proposal Components .. . • lockfree design • tasks-into-cores packing problem and optimization • implementation that support lockfree design • remember: the easiest way to aggregate traffic is to use IP address prefixes • again, generic, so we do not care about the contents M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 12/28 ... 12/28
  • 13. . Proposal : Shared Memory • communication happens over shared memory 04 • C/C++ implementation is common, but will work in other languages as well • shared memory is persistent, but cores come and go 04 K.Michael "The Linux Programming Interface" No Starch Press (2010) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 13/28 ... 13/28
  • 14. . Proposal : DLL is Key . DLL stands for... .. .... Double Linked List • common in C/C++ designs • extremely flexible -- you can swap elements by reassigning pointers • sideways DLL is a method to avoid collisions in hashing M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 14/28 ... 14/28
  • 15. . Optimization M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 15/28 ... 15/28
  • 16. . Optimization Targets • few cores, many data units • need to pack latter into former • moreover: scheduling problem, which is packing but along the timeline • moreover(2) : when packing, do you randomize input or not -- hashing M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 16/28 ... 16/28
  • 17. . Prefix Packing Problem minimize w1count(P) + w2max(M) + w3var(C) subject of k1 < pi < k2 ∀ pi ∈ P. Hashkey - 32 bits0 - k1 (shortest) k2 (longest) Effective range Core0 Core1 Core2 … p (prefix) p1 p3 p2 p4 p5 p6 p8 p7 m (max) n Prefix Packing Problem • prefix length between k1 and k2s ◦ hashkey or raw ◦ fixed in each run in this paper • pi is a pack (group) of items • n total items, mapped to set M of prefixes in each of m cores • C a set of item counts c across prefixes, M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 17/28 ... 17/28
  • 18. . Prefix Packing GA Heuristic • Generic Algorithm (GA) 12 • chromosome is a tuple of prefixes packed into one core gi = ⟨pi,1, pi,2, ..., pi,m⟩. (1) • one gene (whole solution) is a tuple containing all chromosomes Gj = ⟨g1, g2, ..., gn⟩. (2) 12 D.Knysh+1 "Parallel Genetic Algorithms: a Survey and Problem State of the Art" IJCSS (2010) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 18/28 ... 18/28
  • 19. . Analysis M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 19/28 ... 19/28
  • 20. . Analysis Setup • actual packet traces -- trace-based simulation 16 • input: 2 cases -- hashing verus raw • items are individual packets ◦ packets are packed into prefixes ◦ prefixes are packed into cores • the above GA optimization heuristic 16 myself "MAWI Working Group Traffic Archive" http://mawi.wide.ad.jp/mawi (2014) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 20/28 ... 20/28
  • 21. . Analysis (1) Cores 0 1 2 3 4 5 6 7 8 9 Time sequence 4.6 4.7 4.8 4.9 5 5.1 5.2 5.3 5.4 5.5 log(maxitemcount/core) 1 core 2 cores 3 cores 4 cores 5 cores 6 cores 7 cores M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 21/28 ... 21/28
  • 22. . Analysis (2) Hashing 0 0.2 0.4 0.6 0.8 1 Increasing cutoff parameter 0 40 80 120 160 200 240 Numberofuniqueprefixes hashed raw M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 22/28 ... 22/28
  • 23. . Forensics 2.0 M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 23/28 ... 23/28
  • 24. . Forensics 2.0 • reporting part: let's use sketches from data streaming 11 Core 1 Core 1 Core X TABID Manager Now(replay) …. BIG DATA TIMELINE Cursor Time Direction One Sketch One SketchOne Sketch Start End End End Read/prepare Shared Memory Start 11 M.Sung+3 "Scalable and Efficient Data Streaming Algorithms for Detecting Common Content..." ICDE (2006) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 24/28 ... 24/28
  • 25. . Wrapup • a natively multicore technology is proposed • performance is opitimized using a packing heuristic • raw input is found to be preferable to randomization • future topics: 1. variable-length prefixes 2. optimization along the timeline 3. jitter minimization (fewer reasignments) 4. further lookup optimiation -- fast hashing M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 25/28 ... 25/28
  • 26. . That’s all, thank you ... M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28 ... 26/28
  • 27. . [01] myself+0 (2013) ...community-based architecture for measuring E2E QoS at DCc IJCSE [02] myself+0 (2013) Experiments with Practical On-Demand Multi-Core Packet Capture APNOMS [03] myself+1 (2013) A Graphical Method for Detection of Flash Crowds in Traffic Telecom. Systems (TM) [04] K.Michael (2010) The Linux Programming Interface No Starch Press [05] M.Aldinucci+2 (2009) FastFlow: Efficient Parallel Streaming Applications on Multi-core U.Pisa Techreport M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28 ... 26/28
  • 28. . [06] R.Brightwell (2008) Workshop on Managed Many-Core Systems 1st Managed Many-Core Systems [07] X.Sui+3 (2010) Parallel Graph Partitioning on Multicore Architectures 23rd LCPC [08] R.Chen+2 (2010) Tiled-MapReduce: Optimizing Resource Usages ... on Multicore with Tiling 19th PACT [09] I.Machdi+2 (2009) Executing parallel TwigStack algorithm on a multi-core system 11th IIWAS [10] S.Stoichev+1 (2009) Parallel Algorithm for Integer Sorting with Multi-Core Processors IT and Control [11] M.Sung+3 (2006) M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28 ... 26/28
  • 29. . Scalable and Efficient Data Streaming Algorithms for Detecting Common Content... ICDE [12] D.Knysh+1 (2010) Parallel Genetic Algorithms: a Survey and Problem State of the Art IJCSS [13] Luca Deri (2009) Modern Packet Capture and Analysis: Multi-Core, Multi-Gigabit, and Beyond IM [14] myself (2014) MCoreMemory project page https://github.com/maratishe/mcorememory [15] myself (2013) Rings-on-Cores project page https://github.com/maratishe/ringsNcores [16] myself (2014) MAWI Working Group Traffic Archive M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 26/28 ... 26/28
  • 30. . http://mawi.wide.ad.jp/mawi M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 27/28 ... 27/28
  • 31. . Extras (1) Per-Unit Cost Hashing Increasing Per-Unit Cost Manager Prefix Matching Cores that do not match Process Stage 1 Stage 2 Stage 3 M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 27/28 ... 27/28
  • 32. . Extras (2) Share Memory Trick M.Zhanikeev -- maratishe@gmail.com -- Design and Algorithms for Multicore Capture in Data Center Forensics-- http://bit.do/marat140603 -- 28/28 ... 28/28