SlideShare a Scribd company logo
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 1
The Quest for Scalable and Intelligent
Trajectory Data Analytics Systems: Status
Report and Future Directions
Rim Moussa LaTICE Lab. Univ. of Tunis and University of Carthage
Ahmed Haddad LaTICE Lab. Univ. of Tunis and University of Carthage
Tarek Bejaoui MEDIATRON Lab. University of Carthage
1st
IEEE SmartNets @ Hammamet
16th
of November, 2018
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 2
Scalable Trips' Records Analysis
●Characteristics of trips' records
»Big volume
»Big velocity
●NYC Cabs' dataset
»http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml
»More than 200GB
»Yellow and Green taxi trips' records from 2009 to now
»csv format
»capturing pick-up and drop-off dates/times, pick-up and
drop-off locations, fares, rate types, payment types, and
driver-reported passenger counts
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 3
Goals of Scalable Trips' Records Analysis
●Turn trajectory data into knowledge
»Multi-dimensional analysis of trajectory data
●e.g. Average fare, Average trip duration... for a given
pick-up location and a given drop-off location between
9pm and 10pm
»Mining of Trajectory Patterns
●Hotspots and cold areas
●Frequent/Infrequent trajectory patterns
●Turn knowledge into decisions
●Intelligent urban computing
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 4
Outline
●Key Functional Requirements of Intelligent and Scalable
Trajectory Data Analysis
●Overview of state-of-the-art open -source Technologies
»Elastic stack -data shippers + search engine + visualization
»Geomondrian -spatial relational OLAP engine + Relational DBMS
»Leaflet -JavaScript library for mobile-friendly interactive maps +
relational data store
»Neo4j -graph database
●Neo4j Extension
●Conclusions
●Future Directions
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 5
Key Functional Requirements
↬ Spatial On-Line Analytical Processing (SOLAP)
●OLAP tools enable users to analyze multidimensional data
interactively from multiple perspectives
●Multi-dimensional data analysis
»Spatial dimension: GPS data, area, ...
»Time dimension: time range, day/night,
»Standard dimensions such as #passengers, payment type...
»Measures: count trips, sum trips' fares ….
●OLAP operations
»Drill-down: show trips' details for a combination of dimensions
»Roll-up: aggregate trips' data for a combination of dimensions
»Slice: filter along dimension
»Dice: filter along more than one dimension
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 6
Key Functional Requirements
↬ Spatial Data Mining
●Algorithms for learning trajectory patterns from historical
data
»Path patterns (infrequent/frequent path patterns, triangle
patterns)
»Hotspots/cold areas
»Co-location patterns, e.g. weather conditions and trips
patterns
»Stay Points, trips’ trajectory patterns, driving and speed
patterns.
●Algorithms for predicting future events such as a car
destination, future traffic congestion, trip’s cost, et cetera
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 7
Key Functional Requirements
↬ Geo-visualization
●Geo-visualization
»Interactive Maps
»change the visual appearance of the map (e.g. colors,
day/night theme)
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 8
Key Functional Requirements
↬ Geo-visualization (ctnd. 2)
●Zoom-in and zoom-out
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 9
Key Functional Requirements
↬ processing mode
●Batch processing
»Capacity to process historical data
●Real-time processing
»Capacity to process Real-time data
»Stream systems
●Lambda architecture
»Combine both systems
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 10
Overview of state-of-the-art Technologies
↬ elastic stack
●Elasticsearch
»Distributed search engine and document store system
»Distributed inverted indices for querying free text
●Logstash and Beats for data ingest
●Kibana for visualization
●Real world users: NASA, Uber, Lyft, Tinder, CISCO, New York
Times, eBay, Groupon, Wikipedia, Stackoverflow, GitHub...
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 11
Overview of state-of-the-art Technologies
↬ NYC cabs' records exploration with elastic stack
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 12
Overview of state-of-the-art Technologies
↬ NYC cabs' records exploration with elastic stack
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 13
Overview of state-of-the-art Technologies
↬ NYC cabs' records exploration with elastic stack
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 14
Overview of state-of-the-art Technologies
↬ elastic stack discussion
●Geo-visualization
»Interactive maps
●Spatial OLAP
»Use Domain Specific Language (DSL) to query
elasticsearch cluster
●Spatial data mining and trajectory patterns
»Not supported
●Processing mode
»Both batch and real-time
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 15
Overview of state-of-the-art Technologies
↬ Jpivot + Geomondrian + PostgreSQL
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 16
Overview of state-of-the-art Technologies
↬ Geomondrian discussion
●Geo-visualization
»No Interactive maps
»Need SOLAP client
●Spatial OLAP
»Use SQL to query the Relational store
●Spatial data mining and trajectory patterns
»pgpRouting -spatial extension of PostgreSQL
●Processing mode
»batch processing
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 17
Overview of state-of-the-art Technologies
↬ Leaflet + relational data store (MySQL)
Visualizing Millions of NYC taxi pick-ups locations for year
2014 (spatial points are clustered using superclustreing)
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 18
Overview of state-of-the-art Technologies
↬ Leaflet discussion
●Geo-visualization
»Interactive maps
●Spatial OLAP
»Use SQL to query the Relational store
●Spatial data mining and trajectory patterns
»Not supported
●Processing mode
»batch processing
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 19
Graph oriented data store solution
↬ Graph oriented databases: Neo4j, Graph Frames/Spark
●Directed Graph design
»Vertices: aggregated spatial locations
»Relationships: Bags of trips data
●Extend Neo4j CYPHER query language to support OLAP
operations
»Roll-up graph
»Drill down graph
●Use of Apache Spark for data preprocessing
»Processing Spatial data
●Map each GPS pick-up/drop-off into a geohash
●Geohash =12 → Area width x height: 3.7cm x 1.9cm
●Geohash =7 → Area width x height: 152.9m x 152.4m
●Geohash =4→ Area width x height: 39.1km x 19.5km ..
»Processing Time data
●Map each date-time pick-up/drop-off into a timehash
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 20
Graph oriented data store solution
↬ Graph oriented databases: Neo4j, Graph Frames/Spark
●Scalable data processing with CAPS -Cypher for Apache
Spark
●Most Trajectory patterns are provided or implemented
using either Neo4j-Cypher , Apache Spark/Graph Frames
or MLib
»Graph traversal Algorithms -Breadth First Search is
provided
»Depth First Search is implemented
»Frequent/infrequent trajectory patterns
»Hotspots and cold areas
»Page rank
»Connected components
»Clustering
● Work-in-progress
»Visualization on a world Map
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 21
Conclusion & Future work
●Conduct experiments on a HPC platform
»Benchmark NoSQL graph databases: Neo4j vs. Graph
Frames for each defined business query
●Extend Graph capabilities in Elastic
●Combine multiple datasets
» e.g. Trajectory data with open datasets such as weather
data, crime data in NYC
16th
November, 2018 1st
IEEE SmartNets @ Hammamet, Tunisia 22
Thank you for your Attention
Q & A
The Quest for Scalable and Intelligent
Trajectory Data Analytics Systems: Status
Report and Future Directions
Rim Moussa, Ahmed Haddad and Tarek Bejaoui
16th
of November, 2018
1st
IEEE SmartNets @ Hammamet, Tunisia

More Related Content

PPTX
Trb 2017 annual_conference_visualization_lightning_talk_rst
PDF
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
PDF
20180308 coptra wac_pub
PDF
Disaggregate accessibility planning using OSM data and OpenTripPlanner - Stat...
PPTX
Kharita: Robust Road Map Inference Through Network Alignment of Trajectories
PDF
Large Scale Geospatial Indexing and Analysis on Apache Spark
PPTX
Lynx project presentation at ENDORSE 2021 Conference
PDF
Smart city hackathon
Trb 2017 annual_conference_visualization_lightning_talk_rst
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
20180308 coptra wac_pub
Disaggregate accessibility planning using OSM data and OpenTripPlanner - Stat...
Kharita: Robust Road Map Inference Through Network Alignment of Trajectories
Large Scale Geospatial Indexing and Analysis on Apache Spark
Lynx project presentation at ENDORSE 2021 Conference
Smart city hackathon

What's hot (20)

PDF
Improving Traffic in Oulu
PDF
How Prague is opening data
PPTX
Hadoop World 2010 - BAH - Fuzzy Table
PPTX
A Knowledge Graph Framework for Detecting Traffic Events Using Stationary Cam...
PDF
CARTO Cloud Native – An Introduction to the Spatial Extension for BigQuery
PDF
Prague Hacks 2015
PDF
Sdwwg experiences and outlook
PDF
ESTA-LD exploring spatio-temporal linked statistical data
PDF
GIS and Map Tiles
PPTX
SexTant: Visualizing Time-Evolving Linked Geospatial Data
PDF
Developing Spatial Applications with CARTO for React v1.1
PDF
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
PPTX
Jerry chen final_demo
PDF
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
PDF
Creating and indoor routable network with QGIS and pgRouting
PDF
Reforming Traditional Machine Learning Algorithms with Spatio-Temporal Analy...
PDF
Resume 2017
PPTX
Dart presentation 4
PDF
GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...
PDF
Bus Analytics Toolkit Demo
Improving Traffic in Oulu
How Prague is opening data
Hadoop World 2010 - BAH - Fuzzy Table
A Knowledge Graph Framework for Detecting Traffic Events Using Stationary Cam...
CARTO Cloud Native – An Introduction to the Spatial Extension for BigQuery
Prague Hacks 2015
Sdwwg experiences and outlook
ESTA-LD exploring spatio-temporal linked statistical data
GIS and Map Tiles
SexTant: Visualizing Time-Evolving Linked Geospatial Data
Developing Spatial Applications with CARTO for React v1.1
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
Jerry chen final_demo
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
Creating and indoor routable network with QGIS and pgRouting
Reforming Traditional Machine Learning Algorithms with Spatio-Temporal Analy...
Resume 2017
Dart presentation 4
GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...
Bus Analytics Toolkit Demo
Ad

Similar to Smartnets2018 (20)

PDF
Keynote27nov
PDF
Gis capabilities on Big Data Systems
PDF
A Platform Approach to Digital Transformation
PDF
Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAU
PDF
Designing with data
PPTX
Big Data Pipelines and Machine Learning at Uber
PDF
MobiGIS 2016 workshop report: The Fifth ACM SIGSPATIAL International Workshop...
PDF
IRJET - A Framework for Tourist Identification and Analytics using Transport ...
PPTX
Towards Smart Transportation DSS 2018
PDF
Christian jensen advanced routing in spatial networks using big data
PPT
Complex Analysis in Public Transportation: A Step towards Smart Cities
PPTX
Mark Watkins Big Data Presentation
PDF
Transport for London: Using data to keep London moving
PPTX
Extracting Value from Big Data - Stuart Higgins
PDF
Spatio-Temporal Data Analysis using Deep Learning
PDF
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
PDF
Challenges in Analytics for BIG Data
PDF
On the Management, Analysis and Simulation of our LifeSteps
PDF
Big data: uncovering new mobility patterns and redefining planning practices
PPTX
Data Analytics Uber using google cloud and dashboard
Keynote27nov
Gis capabilities on Big Data Systems
A Platform Approach to Digital Transformation
Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAU
Designing with data
Big Data Pipelines and Machine Learning at Uber
MobiGIS 2016 workshop report: The Fifth ACM SIGSPATIAL International Workshop...
IRJET - A Framework for Tourist Identification and Analytics using Transport ...
Towards Smart Transportation DSS 2018
Christian jensen advanced routing in spatial networks using big data
Complex Analysis in Public Transportation: A Step towards Smart Cities
Mark Watkins Big Data Presentation
Transport for London: Using data to keep London moving
Extracting Value from Big Data - Stuart Higgins
Spatio-Temporal Data Analysis using Deep Learning
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
Challenges in Analytics for BIG Data
On the Management, Analysis and Simulation of our LifeSteps
Big data: uncovering new mobility patterns and redefining planning practices
Data Analytics Uber using google cloud and dashboard
Ad

Recently uploaded (20)

PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
Introduction to Building Materials
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
PDF
Empowerment Technology for Senior High School Guide
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PPTX
Lesson notes of climatology university.
PPTX
Unit 4 Skeletal System.ppt.pptxopresentatiom
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
Weekly quiz Compilation Jan -July 25.pdf
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
Trump Administration's workforce development strategy
PDF
Indian roads congress 037 - 2012 Flexible pavement
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Introduction to Building Materials
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
Empowerment Technology for Senior High School Guide
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Complications of Minimal Access Surgery at WLH
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Practical Manual AGRO-233 Principles and Practices of Natural Farming
Lesson notes of climatology university.
Unit 4 Skeletal System.ppt.pptxopresentatiom
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Weekly quiz Compilation Jan -July 25.pdf
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
Trump Administration's workforce development strategy
Indian roads congress 037 - 2012 Flexible pavement
Orientation - ARALprogram of Deped to the Parents.pptx
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE

Smartnets2018

  • 1. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 1 The Quest for Scalable and Intelligent Trajectory Data Analytics Systems: Status Report and Future Directions Rim Moussa LaTICE Lab. Univ. of Tunis and University of Carthage Ahmed Haddad LaTICE Lab. Univ. of Tunis and University of Carthage Tarek Bejaoui MEDIATRON Lab. University of Carthage 1st IEEE SmartNets @ Hammamet 16th of November, 2018
  • 2. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 2 Scalable Trips' Records Analysis ●Characteristics of trips' records »Big volume »Big velocity ●NYC Cabs' dataset »http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml »More than 200GB »Yellow and Green taxi trips' records from 2009 to now »csv format »capturing pick-up and drop-off dates/times, pick-up and drop-off locations, fares, rate types, payment types, and driver-reported passenger counts
  • 3. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 3 Goals of Scalable Trips' Records Analysis ●Turn trajectory data into knowledge »Multi-dimensional analysis of trajectory data ●e.g. Average fare, Average trip duration... for a given pick-up location and a given drop-off location between 9pm and 10pm »Mining of Trajectory Patterns ●Hotspots and cold areas ●Frequent/Infrequent trajectory patterns ●Turn knowledge into decisions ●Intelligent urban computing
  • 4. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 4 Outline ●Key Functional Requirements of Intelligent and Scalable Trajectory Data Analysis ●Overview of state-of-the-art open -source Technologies »Elastic stack -data shippers + search engine + visualization »Geomondrian -spatial relational OLAP engine + Relational DBMS »Leaflet -JavaScript library for mobile-friendly interactive maps + relational data store »Neo4j -graph database ●Neo4j Extension ●Conclusions ●Future Directions
  • 5. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 5 Key Functional Requirements ↬ Spatial On-Line Analytical Processing (SOLAP) ●OLAP tools enable users to analyze multidimensional data interactively from multiple perspectives ●Multi-dimensional data analysis »Spatial dimension: GPS data, area, ... »Time dimension: time range, day/night, »Standard dimensions such as #passengers, payment type... »Measures: count trips, sum trips' fares …. ●OLAP operations »Drill-down: show trips' details for a combination of dimensions »Roll-up: aggregate trips' data for a combination of dimensions »Slice: filter along dimension »Dice: filter along more than one dimension
  • 6. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 6 Key Functional Requirements ↬ Spatial Data Mining ●Algorithms for learning trajectory patterns from historical data »Path patterns (infrequent/frequent path patterns, triangle patterns) »Hotspots/cold areas »Co-location patterns, e.g. weather conditions and trips patterns »Stay Points, trips’ trajectory patterns, driving and speed patterns. ●Algorithms for predicting future events such as a car destination, future traffic congestion, trip’s cost, et cetera
  • 7. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 7 Key Functional Requirements ↬ Geo-visualization ●Geo-visualization »Interactive Maps »change the visual appearance of the map (e.g. colors, day/night theme)
  • 8. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 8 Key Functional Requirements ↬ Geo-visualization (ctnd. 2) ●Zoom-in and zoom-out
  • 9. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 9 Key Functional Requirements ↬ processing mode ●Batch processing »Capacity to process historical data ●Real-time processing »Capacity to process Real-time data »Stream systems ●Lambda architecture »Combine both systems
  • 10. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 10 Overview of state-of-the-art Technologies ↬ elastic stack ●Elasticsearch »Distributed search engine and document store system »Distributed inverted indices for querying free text ●Logstash and Beats for data ingest ●Kibana for visualization ●Real world users: NASA, Uber, Lyft, Tinder, CISCO, New York Times, eBay, Groupon, Wikipedia, Stackoverflow, GitHub...
  • 11. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 11 Overview of state-of-the-art Technologies ↬ NYC cabs' records exploration with elastic stack
  • 12. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 12 Overview of state-of-the-art Technologies ↬ NYC cabs' records exploration with elastic stack
  • 13. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 13 Overview of state-of-the-art Technologies ↬ NYC cabs' records exploration with elastic stack
  • 14. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 14 Overview of state-of-the-art Technologies ↬ elastic stack discussion ●Geo-visualization »Interactive maps ●Spatial OLAP »Use Domain Specific Language (DSL) to query elasticsearch cluster ●Spatial data mining and trajectory patterns »Not supported ●Processing mode »Both batch and real-time
  • 15. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 15 Overview of state-of-the-art Technologies ↬ Jpivot + Geomondrian + PostgreSQL
  • 16. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 16 Overview of state-of-the-art Technologies ↬ Geomondrian discussion ●Geo-visualization »No Interactive maps »Need SOLAP client ●Spatial OLAP »Use SQL to query the Relational store ●Spatial data mining and trajectory patterns »pgpRouting -spatial extension of PostgreSQL ●Processing mode »batch processing
  • 17. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 17 Overview of state-of-the-art Technologies ↬ Leaflet + relational data store (MySQL) Visualizing Millions of NYC taxi pick-ups locations for year 2014 (spatial points are clustered using superclustreing)
  • 18. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 18 Overview of state-of-the-art Technologies ↬ Leaflet discussion ●Geo-visualization »Interactive maps ●Spatial OLAP »Use SQL to query the Relational store ●Spatial data mining and trajectory patterns »Not supported ●Processing mode »batch processing
  • 19. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 19 Graph oriented data store solution ↬ Graph oriented databases: Neo4j, Graph Frames/Spark ●Directed Graph design »Vertices: aggregated spatial locations »Relationships: Bags of trips data ●Extend Neo4j CYPHER query language to support OLAP operations »Roll-up graph »Drill down graph ●Use of Apache Spark for data preprocessing »Processing Spatial data ●Map each GPS pick-up/drop-off into a geohash ●Geohash =12 → Area width x height: 3.7cm x 1.9cm ●Geohash =7 → Area width x height: 152.9m x 152.4m ●Geohash =4→ Area width x height: 39.1km x 19.5km .. »Processing Time data ●Map each date-time pick-up/drop-off into a timehash
  • 20. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 20 Graph oriented data store solution ↬ Graph oriented databases: Neo4j, Graph Frames/Spark ●Scalable data processing with CAPS -Cypher for Apache Spark ●Most Trajectory patterns are provided or implemented using either Neo4j-Cypher , Apache Spark/Graph Frames or MLib »Graph traversal Algorithms -Breadth First Search is provided »Depth First Search is implemented »Frequent/infrequent trajectory patterns »Hotspots and cold areas »Page rank »Connected components »Clustering ● Work-in-progress »Visualization on a world Map
  • 21. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 21 Conclusion & Future work ●Conduct experiments on a HPC platform »Benchmark NoSQL graph databases: Neo4j vs. Graph Frames for each defined business query ●Extend Graph capabilities in Elastic ●Combine multiple datasets » e.g. Trajectory data with open datasets such as weather data, crime data in NYC
  • 22. 16th November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia 22 Thank you for your Attention Q & A The Quest for Scalable and Intelligent Trajectory Data Analytics Systems: Status Report and Future Directions Rim Moussa, Ahmed Haddad and Tarek Bejaoui 16th of November, 2018 1st IEEE SmartNets @ Hammamet, Tunisia