SlideShare a Scribd company logo
Databases
and graph
analysis
back to the
future?
Krystian Piećko
CTO & Co-founder PiLab S.A.
2015-09-30
Introduction
2
Few words about PiLab
Mother of God…
3
Databases in
Enterprise Environment
Theory vs. Practise
How enterprises are using data
5
Two data modellers in the same room
Better fight than Pride with Nastula
6
Never forget about it
7
NoSQL
NoSQL rises
2009
• SQL is hard
• RDBMS can’t fetch data
so fast
• Schemas are for
grandpas
• Who needs SQL - we have
got Java and Python
• Analytics is to slow on
RDBMS
• Unstructured is growing
• other?
9
It is very simple to dim the lights in server
room
10
Power to the user
Limited access is the problem
11
NoSQL in practice
12
NoSQL
There is less java/python/whatever programmers than
people that know how to use SQL
13
NoSQL in practice
Do you all know this?
14
Schema vs. schemaless
15
NoSQL with SQL interface
Many of the new ones are missing here
16
After few years
Some people tried to change the meaning to Not Only
Short learning cycle :)
17
NewSQL
And it has begun
The shift showed the “old school” db guys, that there is a
chance on the marketplace
19
Inefficient query that runs agains NewSQL
is still inefficient but it runs faster :)
20
Truth ;)
21
The largest RDBMS
implementation I know
22
Facebook facts
2014
• 60k+ servers
• MySQL database
cluster
• Memcache
• 2.4+ billion pieces of
content and 750TB+ of
data every day
• 35TB+/h daily ingest to
HPVertica with SLA on
that
• 300+ identical nodes
with 10Gigs/s
connectivity
23
Probably true
The cost vs. value
24
Only some industries need it
all need to think about it
25
Database size of 90% of major vendor DB
usage is less than 900GB
26
Graph analysis
Graph is how we think
28
6 degrees of separation
29
6 degrees of separation
30
Cluster/filter/optimize
31
Graphs are growing
Entity relationship model is the graph
32
Investigations
Techniques
• There are no official data
about all vendors and the
software just rumours
• Software that is being
used was i2, Palantir,
other custom made
• Huge amount of work on
unstructured data
• 7 ways to write Al-Qaeda
• How to investigate tip?
• Manually tagging
unstructured data
• Social media cross join
34
Being able to connect the dots
Sharing the investigation information was
35
Field investigation was the clue
36
Clue to the case was the visualisation
and ad-hoc querying by not technical users
#SELECT * FROM users WHERE clue > 0
37
Banks
~40th Bank in US
has the size of the largest Polish Bank
39
IT companies in SV
40
Almost every part of the banking industry
market has been taken
Only few spots left
41
Minimum 10 different softwares involved
Regulatory dominant issues
Alerting system
Investigation systemCase management
system
Twist & Tune
//
//
42
About PiLab
Technology
44
Technology is not a product
45
Product is something
that solves the customer specified need and customer
wants to pay for it
46
Finding a niche
47
Short learning cycle
48
User with the data
49
Demo?
50
Thank you for your
attention
Krystian Piećko
krystian.piecko@pilab.pl
Wykorzystane strony
• http://database.ca/databasehumorhumourjokes.aspx
• https://smist08.files.wordpress.com/2012/01/nosql.png
• https://plus.google.com/+YaiEakwattana/posts/Lda6xNR7LV9
• http://image.slidesharecdn.com/sqlonhadoop-150617075749-lva1-app6892/95/sql-on-hadoop-6-638.jpg?
cb=1435155882
• http://vertica.tips/2014/08/27/how-vertica-met-facebooks-35tb-per-hour-ingest-sla/
• http://www.quickmeme.com/p/3vxe25
• http://image.slidesharecdn.com/thefivegraphsoffinance-graphconnectnyc-131108130053-phpapp02/95/the-
five-graphs-of-finance-philip-rathle-and-emil-eifrem-graphconnect-ny-2013-30-638.jpg?cb=1383915667
• http://www.koreatimes.co.kr/upload/news/100108_p06_cartoon.jpg
• http://cnt13.blogspot.com/2013/02/terrorist-networks-look-into-911-tragedy.html
• http://belfastgroup.digitalscholarship.emory.edu/static/img/2013-dh-graph.png
• http://www.datamodel.com/index.php/tag/database-design/
• http://www.koreatimes.co.kr/upload/news/100108_p06_cartoon.jpg
• http://www.frugaldaddy.co.uk/images/splash-guard.jpg
• http://img06.deviantart.net/bf9b/i/2013/301/4/1/funny_chuck_norris_converse_shoe__by_rickfrost-d6s4kqz.jpg
• http://www.weevermedia.com/wp-content/uploads/2015/08/fraud-mobile-app-marketing-300x241.jpg
• http://www.b-eye-network.com/blogs/rogers/thanksgiving.jpg
• http://www.neatorama.com
52

More Related Content

PPTX
Bi 2.0 hadoop everywhere
PDF
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
PDF
The Connected Data Imperative: Why Graphs at GraphDay LA
PDF
"Data Pipelines for Small, Messy and Tedious Data", Vladislav Supalov, CAO & ...
PDF
Software Analytics for Pragmatists [DevOps Camp 2017]
PDF
Development and Evaluation of High Loading oral dissolving film of aspirin an...
PDF
PDF
PRATT 4-PAGE
Bi 2.0 hadoop everywhere
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
The Connected Data Imperative: Why Graphs at GraphDay LA
"Data Pipelines for Small, Messy and Tedious Data", Vladislav Supalov, CAO & ...
Software Analytics for Pragmatists [DevOps Camp 2017]
Development and Evaluation of High Loading oral dissolving film of aspirin an...
PRATT 4-PAGE

Viewers also liked (13)

PDF
O fio apostila redken
PDF
Meodex introduction 2015 - Your custom LED modules
PPTX
Vivienne Strydom music
DOCX
Laporan 2
PPT
Презентация на тему "Виды аварий на РОО"
PPT
MilliCare Carpet Care
PPTX
オーガニックカカオ豆の5優れた健康効果
PPTX
Fa102a presentation-102815
PPTX
Summer assignment (Christopher Nolan)
DOCX
Basic-CV-template
PDF
空間互動設計_Toy Probes
PPTX
Evaluation question 3
PPT
Презентация на тему "Происхождение и виды наводнений"
O fio apostila redken
Meodex introduction 2015 - Your custom LED modules
Vivienne Strydom music
Laporan 2
Презентация на тему "Виды аварий на РОО"
MilliCare Carpet Care
オーガニックカカオ豆の5優れた健康効果
Fa102a presentation-102815
Summer assignment (Christopher Nolan)
Basic-CV-template
空間互動設計_Toy Probes
Evaluation question 3
Презентация на тему "Происхождение и виды наводнений"
Ad

Similar to Databases and graph analysis - back to the future? (20)

PPTX
Big iron 2 (published)
PDF
Considerations for using NoSQL technology on your next IT project
PDF
Considerations for using NoSQL technology on your next IT project
PDF
Considerations for using NoSQL technology on your next IT project
PDF
Considerations for using NoSQL technology on your next IT project
PPTX
Big Data Overview 2013-2014
PPTX
No SQL- The Future Of Data Storage
PPTX
NoSQL A brief look at Apache Cassandra Distributed Database
PPTX
Introducing NoSQL and MongoDB to complement Relational Databases (AMIS SIG 14...
DOCX
Relational Technologies Under Siege: Will Handsome Newcomers Displace the St...
DOCX
Report 1.0.docx
PPTX
UNIT II Evaluating NoSQL for various .pptx
PPTX
Introduction to Bigdata and NoSQL
PPTX
NoSQLDatabases
PDF
NoSQL Databases Introduction - UTN 2013
PPTX
NoSQL databases - An introduction
PDF
PDF
Nosql Essentials Navigating The World Of Nonrelational Databases Kameron Huss...
DOCX
Report 2.0.docx
PDF
Making Sense of Graph Databases
Big iron 2 (published)
Considerations for using NoSQL technology on your next IT project
Considerations for using NoSQL technology on your next IT project
Considerations for using NoSQL technology on your next IT project
Considerations for using NoSQL technology on your next IT project
Big Data Overview 2013-2014
No SQL- The Future Of Data Storage
NoSQL A brief look at Apache Cassandra Distributed Database
Introducing NoSQL and MongoDB to complement Relational Databases (AMIS SIG 14...
Relational Technologies Under Siege: Will Handsome Newcomers Displace the St...
Report 1.0.docx
UNIT II Evaluating NoSQL for various .pptx
Introduction to Bigdata and NoSQL
NoSQLDatabases
NoSQL Databases Introduction - UTN 2013
NoSQL databases - An introduction
Nosql Essentials Navigating The World Of Nonrelational Databases Kameron Huss...
Report 2.0.docx
Making Sense of Graph Databases
Ad

Recently uploaded (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
August Patch Tuesday
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PPTX
A Presentation on Touch Screen Technology
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Chapter 5: Probability Theory and Statistics
Getting Started with Data Integration: FME Form 101
A comparative analysis of optical character recognition models for extracting...
Web App vs Mobile App What Should You Build First.pdf
Encapsulation_ Review paper, used for researhc scholars
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
Enhancing emotion recognition model for a student engagement use case through...
Programs and apps: productivity, graphics, security and other tools
August Patch Tuesday
cloud_computing_Infrastucture_as_cloud_p
MIND Revenue Release Quarter 2 2025 Press Release
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
WOOl fibre morphology and structure.pdf for textiles
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
DP Operators-handbook-extract for the Mautical Institute
A Presentation on Touch Screen Technology
Accuracy of neural networks in brain wave diagnosis of schizophrenia
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Chapter 5: Probability Theory and Statistics

Databases and graph analysis - back to the future?