SlideShare a Scribd company logo
©2013-2015 FlockData LLC - All rights reserved
Open-source information management and data integration
COLLECT | CONNECT | COMPARE
TURNING DATA INTO INFORMATION
©2013-2015 FlockData LLC - All rights reserved
Companies
resistant to data 

do not thrive. 

Even if you have the
desire…

you can’t get the
data… 

It exists in 27
different computer
systems with
different structures.
©2013-2015 FlockData LLC - All rights reserved
What do the world’s largest & most
complex data stores have in common?
Sources: https://gigaom.com/2013/06/06/heres-how-the-nsa-analyzes-all-that-call-data/

https://gigaom.com/2013/06/07/under-the-covers-of-the-nsas-big-data-effort/

http://neo4j.com/blog/why-the-most-important-part-of-facebook-graph-search-is-graph/
•multiple instances each storing tens of petabytes
•backend of the agency’s most widely used analytical
capabilities
•Accumulo is especially adept at analyzing trillions of data
points in order to build massive graphs
•Technology giants such as Facebook, Google, and Twitter
have all built graph technologies from the ground up to
differentiate and grow their business. Building and
maintaining one’s own database management system
however is not a practical solution if you’re not Facebook.
•PageRank changed the fundamentals of web search - taking
into account how the pages are connected
•Facebook = Social graph
•Google = Knowledge graph
•Twitter = Interest graph
©2013-2015 FlockData LLC - All rights reserved
Multi-model is the key to
modern and future data
©2013-2015 FlockData LLC - All rights reserved
Connections Analytics Timeline
©2013-2015 FlockData LLC - All rights reserved
Graph Search Document
©2013-2015 FlockData LLC - All rights reserved
Graph Search Document
Any data source(s)
HTTP RESTful API integration
©2013-2015 FlockData LLC - All rights reserved
Graph Search Document
Any data source(s)
HTTP RESTful API integration
©2013-2015 FlockData LLC - All rights reserved
Graph Search Document
Reports Apps Structure
Any data source(s)
HTTP RESTful API integration
©2013-2015 FlockData LLC - All rights reserved
FlockData provides a single,
unified multi-model access
point for both data storage and
information retrieval
©2013-2015 FlockData LLC - All rights reserved
Connect Meta Data
Disclaimer: image not generated by AuditBucket
Meta-data captured
and stored
Connections are made for
analysis
See hidden relationships
fast!
©2013-2015 FlockData LLC - All rights reserved
Dashboards
©2013-2015 FlockData LLC - All rights reserved
Versions of Data
©2013-2015 FlockData LLC - All rights reserved
Industries:
- Online media
- Telecommunications
- Financial Services
- Healthcare
- Logistics
©2013 AuditBucket Pty Ltd & Entiviti LLC - Proprietary & Confidential - DO NOT SHARE
©2013-2015 FlockData LLC - All rights reserved
Solution Categories:
- Recommendation Engines
- Network mapping & analysis
- Cross-source analytics
- Data-driven apps
- Universal search & audit
©2013 AuditBucket Pty Ltd & Entiviti LLC - Proprietary & Confidential - DO NOT SHARE
©2013-2015 FlockData LLC - All rights reserved
Recommendation Engine
©2013 AuditBucket Pty Ltd & Entiviti LLC - Proprietary & Confidential - DO NOT SHARE
©2013-2015 FlockData LLC - All rights reserved
What is a recommendation engine?
Incorporates any number of factors about users

Notably including products or services consumed

Leverages multiple related factors (similar products, 

similar users, etc)

Traverses these factors as connections

Returns the most connected nodes as 

recommended products or services
An algorithm that:
©2013-2015 FlockData LLC - All rights reserved
Why build on graph data?
Only need to specify which

type of relationships to use

As little as 2-line queries

Performance:

1M rows —> ~20ms

But very little scale effect

Fast-enough for real-time performance

Efficient and flexible for expanded use
Lookalikes
©2013-2015 FlockData LLC - All rights reserved
Why build a recommendation engine?
Original search
Bought together = up-sell
Also bought = up-sell
Targeted ads = cross-sell
Also viewed = conversion
©2013-2015 FlockData LLC - All rights reserved
Overlay social graph of users
Insert taxonomy here
Add as many factors as you have
Each factor improves quality
©2013-2015 FlockData LLC - All rights reserved
Merged graph
©2013-2015 FlockData LLC - All rights reserved
Graph as the basis for recommendations
©2013-2015 FlockData LLC - All rights reserved
Case Study: Macro view of ebola
Obtain a sample data set of 48K Twitter posts.
Send tweets through NLP engine for tag capture, entity &
concept extraction, sentiment analysis
FlockData JSON transformation and import definition in
under 1 day
Leverage our automatic analysis tools (word cloud, graph,
visualizations) to find connections
Use dashboards to get overview of breakdown
Use cluster analysis to find “hot spots” in the data
©2013-2015 FlockData LLC - All rights reserved
Quick findings: From concept to insights in under 2 days
Sentiment, tags and concepts
are sortable, reportable, and
can be integrated with real-
time data feeds
Geo-location of user gives
automatic mapping
©2013-2015 FlockData LLC - All rights reserved
Quick findings: Locate hot spots
Data categories sorted by
co-occurrence - shows
organizations where to focus
for maximum impact
FlockData data profiling
during data load is used to
drive reporting

More Related Content

PPTX
Smart data onboarding webinar oct 10 2013
PDF
Climbing the AI Ladder
PPTX
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
PPTX
Making big data work
PDF
Agile v Warehouse? Maurice Lynch CEO of Nathaen Technologies - Dublinked Data...
PDF
Advanced Data Analytics and Open Data - Dr Ingo Keck of CeADAR - Dublinked Da...
PDF
Up Your Analytics Game with Pentaho and Vertica
PDF
Economic Impact of Coronavirus on Edge Data Center Market to Reap Excessive R...
Smart data onboarding webinar oct 10 2013
Climbing the AI Ladder
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Making big data work
Agile v Warehouse? Maurice Lynch CEO of Nathaen Technologies - Dublinked Data...
Advanced Data Analytics and Open Data - Dr Ingo Keck of CeADAR - Dublinked Da...
Up Your Analytics Game with Pentaho and Vertica
Economic Impact of Coronavirus on Edge Data Center Market to Reap Excessive R...

What's hot (20)

PDF
Hae, hyödynnä, johda - somea, ulkoista tietoa ja ketteryyttä - Qlikview Busin...
PDF
Middle east and north africa (mena) data center market 26-05-21
PPTX
Big Data, Big Deal? (A Big Data 101 presentation)
PPTX
Seo presentation-3
PPTX
BIG Data & Hadoop Applications in Logistics
PPTX
Data Activities in Austria
PDF
5 questions to ask before bringing AI to your business
PDF
Edmc use cases 2018 v2
PPTX
Jeff Kelly, Wikibon Slides; Big Data Summit 2015
PPTX
Oracle big data publix sector 1
PDF
Modernizing Data Architecture using Data Virtualization for Agile Data Delivery
PDF
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
PPTX
Big data landscape version 2.0
PPTX
"Social innovation with (big) data" - Maurice Fransen, Analytics Lead Public ...
PPTX
Moving beyond Big Data, BAE Systems Detica
PDF
Data Services and the Modern Data Ecosystem (Middle East)
PDF
Sources of data collection for business applications
PPTX
Tropos - Data as a Service - Business analytics insight
PPTX
Data Virtualization – Gateway to a Digital Business - Barry Devlin
PDF
Denodo DataFest 2016: The Governed Data Lake – Putting Big Data to Work
Hae, hyödynnä, johda - somea, ulkoista tietoa ja ketteryyttä - Qlikview Busin...
Middle east and north africa (mena) data center market 26-05-21
Big Data, Big Deal? (A Big Data 101 presentation)
Seo presentation-3
BIG Data & Hadoop Applications in Logistics
Data Activities in Austria
5 questions to ask before bringing AI to your business
Edmc use cases 2018 v2
Jeff Kelly, Wikibon Slides; Big Data Summit 2015
Oracle big data publix sector 1
Modernizing Data Architecture using Data Virtualization for Agile Data Delivery
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Big data landscape version 2.0
"Social innovation with (big) data" - Maurice Fransen, Analytics Lead Public ...
Moving beyond Big Data, BAE Systems Detica
Data Services and the Modern Data Ecosystem (Middle East)
Sources of data collection for business applications
Tropos - Data as a Service - Business analytics insight
Data Virtualization – Gateway to a Digital Business - Barry Devlin
Denodo DataFest 2016: The Governed Data Lake – Putting Big Data to Work
Ad

Viewers also liked (20)

PDF
Raytheon Reports 2004 Fourth Quarter Results
PDF
raytheon Q4 Earnings Presentation
PDF
Bloggen, niet alleen omdat het leuk is - Ikki.nl
PPT
PPT
Show me the Money - Proving success in social media
PPTX
jQuery From the Ground Up
PPT
Drug delivery in conflict situations
PPTX
Ohioans for transportation choice
PDF
Teaching Open Source In The University
PDF
emerson electricl Q2 2008 Earnings Presentation
PPTX
Library Labs
PPT
Top 9 WordPress Links For #WPMIA - October 2013
PDF
raytheon Q2 Earnings Presentation
PPT
Social Software for Business: Open Source Architectures and Tools for Social ...
PDF
manpower annual reports 2000
PDF
wyeth Download Documentation Credit Suisse Group Healthcare Conference
PDF
Measuring SEO Success - Rob Garner - iCrossing - SES Chicago 2011
PPT
Rupertup.Presentation.Respond
PPT
Powerbarrio
PPSX
Shock rukhsana qasim new 2013 slide show
Raytheon Reports 2004 Fourth Quarter Results
raytheon Q4 Earnings Presentation
Bloggen, niet alleen omdat het leuk is - Ikki.nl
Show me the Money - Proving success in social media
jQuery From the Ground Up
Drug delivery in conflict situations
Ohioans for transportation choice
Teaching Open Source In The University
emerson electricl Q2 2008 Earnings Presentation
Library Labs
Top 9 WordPress Links For #WPMIA - October 2013
raytheon Q2 Earnings Presentation
Social Software for Business: Open Source Architectures and Tools for Social ...
manpower annual reports 2000
wyeth Download Documentation Credit Suisse Group Healthcare Conference
Measuring SEO Success - Rob Garner - iCrossing - SES Chicago 2011
Rupertup.Presentation.Respond
Powerbarrio
Shock rukhsana qasim new 2013 slide show
Ad

Similar to FlockData Pitch Overview (20)

PDF
FlockData Overview
PDF
Using FlockData to power your Recommendation Engine
ODP
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
PPTX
Tools and Methods for Big Data Analytics by Dahl Winters
PPTX
Tools and Methods for Big Data Analytics by Dahl Winters
PDF
Master in Big Data Analytics and Social Mining 20015
ODP
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
PDF
Big Data Graph Analytics
PPTX
Big Data, Hadoop, NoSQL and more ...
PDF
Meet 1 - Introduction Data Mining - Dedi Darwis.pdf
PPTX
datamining-lect1.pptx
PDF
chương 1 - Tổng quan về khai phá dữ liệu.pdf
PDF
Enterprise Data Sources PowerPoint Presentation Slides
PDF
Recommendation engines : Matching items to users
PDF
Recommendation engines matching items to users
PDF
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
PPTX
Introduction about Applications of data mining
PPTX
Recommender system
PDF
Introduction to Big Data
PPTX
Data Mining Lecture_1.pptx
FlockData Overview
Using FlockData to power your Recommendation Engine
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
Master in Big Data Analytics and Social Mining 20015
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
Big Data Graph Analytics
Big Data, Hadoop, NoSQL and more ...
Meet 1 - Introduction Data Mining - Dedi Darwis.pdf
datamining-lect1.pptx
chương 1 - Tổng quan về khai phá dữ liệu.pdf
Enterprise Data Sources PowerPoint Presentation Slides
Recommendation engines : Matching items to users
Recommendation engines matching items to users
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Introduction about Applications of data mining
Recommender system
Introduction to Big Data
Data Mining Lecture_1.pptx

Recently uploaded (20)

PDF
How to Get Funding for Your Trucking Business
PDF
Comments on Crystal Cloud and Energy Star.pdf
PDF
Digital Marketing & E-commerce Certificate Glossary.pdf.................
PPT
340036916-American-Literature-Literary-Period-Overview.ppt
PDF
Module 2 - Modern Supervison Challenges - Student Resource.pdf
PDF
NEW - FEES STRUCTURES (01-july-2024).pdf
PPTX
3. HISTORICAL PERSPECTIVE UNIIT 3^..pptx
PPTX
ICG2025_ICG 6th steering committee 30-8-24.pptx
PDF
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
PPTX
Belch_12e_PPT_Ch18_Accessible_university.pptx
PDF
Daniels 2024 Inclusive, Sustainable Development
PPTX
Probability Distribution, binomial distribution, poisson distribution
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PDF
Nante Industrial Plug Factory: Engineering Quality for Modern Power Applications
PPTX
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
PDF
Hindu Circuler Economy - Model (Concept)
PDF
Charisse Litchman: A Maverick Making Neurological Care More Accessible
PDF
IFRS Notes in your pocket for study all the time
PDF
Power and position in leadershipDOC-20250808-WA0011..pdf
PDF
Ôn tập tiếng anh trong kinh doanh nâng cao
How to Get Funding for Your Trucking Business
Comments on Crystal Cloud and Energy Star.pdf
Digital Marketing & E-commerce Certificate Glossary.pdf.................
340036916-American-Literature-Literary-Period-Overview.ppt
Module 2 - Modern Supervison Challenges - Student Resource.pdf
NEW - FEES STRUCTURES (01-july-2024).pdf
3. HISTORICAL PERSPECTIVE UNIIT 3^..pptx
ICG2025_ICG 6th steering committee 30-8-24.pptx
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
Belch_12e_PPT_Ch18_Accessible_university.pptx
Daniels 2024 Inclusive, Sustainable Development
Probability Distribution, binomial distribution, poisson distribution
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
Nante Industrial Plug Factory: Engineering Quality for Modern Power Applications
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
Hindu Circuler Economy - Model (Concept)
Charisse Litchman: A Maverick Making Neurological Care More Accessible
IFRS Notes in your pocket for study all the time
Power and position in leadershipDOC-20250808-WA0011..pdf
Ôn tập tiếng anh trong kinh doanh nâng cao

FlockData Pitch Overview

  • 1. ©2013-2015 FlockData LLC - All rights reserved Open-source information management and data integration COLLECT | CONNECT | COMPARE TURNING DATA INTO INFORMATION
  • 2. ©2013-2015 FlockData LLC - All rights reserved Companies resistant to data 
 do not thrive. 
 Even if you have the desire…
 you can’t get the data… 
 It exists in 27 different computer systems with different structures.
  • 3. ©2013-2015 FlockData LLC - All rights reserved What do the world’s largest & most complex data stores have in common? Sources: https://gigaom.com/2013/06/06/heres-how-the-nsa-analyzes-all-that-call-data/
 https://gigaom.com/2013/06/07/under-the-covers-of-the-nsas-big-data-effort/
 http://neo4j.com/blog/why-the-most-important-part-of-facebook-graph-search-is-graph/ •multiple instances each storing tens of petabytes •backend of the agency’s most widely used analytical capabilities •Accumulo is especially adept at analyzing trillions of data points in order to build massive graphs •Technology giants such as Facebook, Google, and Twitter have all built graph technologies from the ground up to differentiate and grow their business. Building and maintaining one’s own database management system however is not a practical solution if you’re not Facebook. •PageRank changed the fundamentals of web search - taking into account how the pages are connected •Facebook = Social graph •Google = Knowledge graph •Twitter = Interest graph
  • 4. ©2013-2015 FlockData LLC - All rights reserved Multi-model is the key to modern and future data
  • 5. ©2013-2015 FlockData LLC - All rights reserved Connections Analytics Timeline
  • 6. ©2013-2015 FlockData LLC - All rights reserved Graph Search Document
  • 7. ©2013-2015 FlockData LLC - All rights reserved Graph Search Document Any data source(s) HTTP RESTful API integration
  • 8. ©2013-2015 FlockData LLC - All rights reserved Graph Search Document Any data source(s) HTTP RESTful API integration
  • 9. ©2013-2015 FlockData LLC - All rights reserved Graph Search Document Reports Apps Structure Any data source(s) HTTP RESTful API integration
  • 10. ©2013-2015 FlockData LLC - All rights reserved FlockData provides a single, unified multi-model access point for both data storage and information retrieval
  • 11. ©2013-2015 FlockData LLC - All rights reserved Connect Meta Data Disclaimer: image not generated by AuditBucket Meta-data captured and stored Connections are made for analysis See hidden relationships fast!
  • 12. ©2013-2015 FlockData LLC - All rights reserved Dashboards
  • 13. ©2013-2015 FlockData LLC - All rights reserved Versions of Data
  • 14. ©2013-2015 FlockData LLC - All rights reserved Industries: - Online media - Telecommunications - Financial Services - Healthcare - Logistics ©2013 AuditBucket Pty Ltd & Entiviti LLC - Proprietary & Confidential - DO NOT SHARE
  • 15. ©2013-2015 FlockData LLC - All rights reserved Solution Categories: - Recommendation Engines - Network mapping & analysis - Cross-source analytics - Data-driven apps - Universal search & audit ©2013 AuditBucket Pty Ltd & Entiviti LLC - Proprietary & Confidential - DO NOT SHARE
  • 16. ©2013-2015 FlockData LLC - All rights reserved Recommendation Engine ©2013 AuditBucket Pty Ltd & Entiviti LLC - Proprietary & Confidential - DO NOT SHARE
  • 17. ©2013-2015 FlockData LLC - All rights reserved What is a recommendation engine? Incorporates any number of factors about users
 Notably including products or services consumed
 Leverages multiple related factors (similar products, 
 similar users, etc)
 Traverses these factors as connections
 Returns the most connected nodes as 
 recommended products or services An algorithm that:
  • 18. ©2013-2015 FlockData LLC - All rights reserved Why build on graph data? Only need to specify which
 type of relationships to use
 As little as 2-line queries
 Performance:
 1M rows —> ~20ms
 But very little scale effect
 Fast-enough for real-time performance
 Efficient and flexible for expanded use Lookalikes
  • 19. ©2013-2015 FlockData LLC - All rights reserved Why build a recommendation engine? Original search Bought together = up-sell Also bought = up-sell Targeted ads = cross-sell Also viewed = conversion
  • 20. ©2013-2015 FlockData LLC - All rights reserved Overlay social graph of users Insert taxonomy here Add as many factors as you have Each factor improves quality
  • 21. ©2013-2015 FlockData LLC - All rights reserved Merged graph
  • 22. ©2013-2015 FlockData LLC - All rights reserved Graph as the basis for recommendations
  • 23. ©2013-2015 FlockData LLC - All rights reserved Case Study: Macro view of ebola Obtain a sample data set of 48K Twitter posts. Send tweets through NLP engine for tag capture, entity & concept extraction, sentiment analysis FlockData JSON transformation and import definition in under 1 day Leverage our automatic analysis tools (word cloud, graph, visualizations) to find connections Use dashboards to get overview of breakdown Use cluster analysis to find “hot spots” in the data
  • 24. ©2013-2015 FlockData LLC - All rights reserved Quick findings: From concept to insights in under 2 days Sentiment, tags and concepts are sortable, reportable, and can be integrated with real- time data feeds Geo-location of user gives automatic mapping
  • 25. ©2013-2015 FlockData LLC - All rights reserved Quick findings: Locate hot spots Data categories sorted by co-occurrence - shows organizations where to focus for maximum impact FlockData data profiling during data load is used to drive reporting