SlideShare a Scribd company logo
Management and Analysis of Large
Scale Heterogeneous Time-Series
Data
Sensor and Government Data: Their Role in Public Policy
Martin Litzenberger
Safety and Security Department
AIT Austrian Institute of Technology
Martin Litzenberger | Senior Engineer | DSS SNI
Motivation
A plethora of heterogeneous data are collected by public institutions
with various sensors today
But the data and their use are (usually) restricted to the domain or
departments they belong to, e.g.
security surveillance, traffic, public transport, air quality, power grids, ...
Reasons: Lack of interoperability and often lack of communication
and cooperation of data owners
223.05.2014
Advantages
Connecting these data or even collecting them on a common
platform would allow for new ways of analysis and insight into
important and interesting mechanisms (e.g. traffic / air quality)
But data are heterogeneous in many aspects such as: format,
update frequency, representation, owner, accessibility .. which
makes a joint analysis a big challenge
Real-time 24/7 processing and availability, not a “one-time”
academic investigation!
323.05.2014
Challenge: Heterogeneity of Data
Temporal heterogeneity
Discrete events versus regular time series
Spatial heterogeneity
„On-site“ versus „as near as possible“
Semantic heterogeneity
The same parameters might have different significance under
different context
Technical heterogeneity
Non-standardized interfaces, formats, etc.
Political heterogeneity
“Owners” of data have different missions and goals
423.05.2014
523.05.2014
Investigating effects of
traffic state (free
flow/stop&go) on local air
quality
Data sources
Traffic monitor for
traffic volume and
acceleration
Black carbon sensor at
road side and a
background station
Meteorological station
Case Study
Case Study: Combined Air Quality and Traffic Monitoring
Different owners
City Council, State AQ Department and projects own sensors
Different data intervals
Traffic: Individual vehicles
(~ 4000 data sets (speed, acceleration, vehicle class)/hour !)
Air Quality & Meteo: fixed frequency, 30 min averages
(48 data points/day)
Pre-processing
Temporal alignment & Aggregation
Goal: Investigating a “black carbon equivalent” for traffic
Accelerating cars have a higher tailpipe emission than “free flowing”
vehicles
Approach:
Q”BC” = Qtotal-vehicles + 6 * Qaccelerating-vehicles
(can be even more complex including weight factors for HGV etc...)
Local (road-side) black carbon concentrations need to be reduced by
“background” values to “isolate” traffic related component
CBC = Croad – Cbackground
And of course wind speed is of interest at the same time ... !
723.05.2014
Solution: What is openUwedat?
OpenUwedat is a toolbox that allows to build Time Series related
Applications
The toolbox contains many ready-made, adaptable programs
The toolbox contains libraries to write your own programs which
integrate seamlessly with the existing ones
Driver
Driver
Database
Driver
configurable
What can I do with openUwedat?
openUwedat allows to interact with any kind of Time Series Device.
You can integrate new devices by writing new modules which act as
„drivers“.
Typical devices are:
Measurement Devices
Data Aquisition Systems (station computers)
Other Time Series Management Systems
Databases (SQL and no-SQL)
…
Implementation in openUwedat
Powerful scripting language “Formula 3”
Real time interfaces and real-time processing pipes
Example code how to implement the BC-Equivalent function in
Formula 3
@A="name=Database;
type=Aggregation;Source=TDS;Sensor=S4.TDS1;Lane=0"
@B="name=Database;
type=Aggregation;Source=TDS;Sensor=S4.TDS1;Lane=1"
<<(A.accCount[i]+B.accCount[i]+A.decCount[i]+B.decCount[i])*6+A.to
talFlow[i]+B.totalFlow[i]>> |
<< sum( _ ]t-60mins..t] ) >> every 60 mins
1023.05.2014
1123.05.2014
Very good correlation! But depending on meteo-conditions. During
episodes of stronger wind, the correlation drops!
Typical Result Traffic / Air Quality
Conclusions
Plenty of heterogeneous data are collected on regular basis by
public authorities day by day
The potential to analyse these data together stays mostly unused
because:
Lack of cooperation between authorities / departments
Lack of interoperability of the systems
Case study on traffic/air quality show potential of how
heterogeneous data analysis creates new insights
AIT’s OpenUwedat data management toolbox allows
Collection of Large Scale Heterogeneous Time-Series Data from
different sources
Complex analysis using a powerful scripting language
1223.05.2014
AIT Austrian Institute of Technology
your ingenious partner
Martin Litzenberger

More Related Content

PDF
Graph hoc cfp
PDF
Graph hoc cfp
PDF
CALL FOR PAPERS - 12th International Conference on Applications of Graph Theo...
PDF
12th International Conference on Applications of Graph Theory in Wireless Ad ...
PDF
13th International Conference on Applications of Graph Theory in Wireless Ad ...
PDF
Call for papers - 12th International Conference on Applications of Graph Theo...
PDF
Call for papers - 12th International Conference on Applications of Graph Theo...
PDF
Call for papers - 12th International Conference on Applications of Graph Theo...
Graph hoc cfp
Graph hoc cfp
CALL FOR PAPERS - 12th International Conference on Applications of Graph Theo...
12th International Conference on Applications of Graph Theory in Wireless Ad ...
13th International Conference on Applications of Graph Theory in Wireless Ad ...
Call for papers - 12th International Conference on Applications of Graph Theo...
Call for papers - 12th International Conference on Applications of Graph Theo...
Call for papers - 12th International Conference on Applications of Graph Theo...

What's hot (15)

PDF
11th International Conference on Applications of Graph Theory in Wireless Ad ...
PPTX
Realtime Big Data Analytics for Event Detection in Highways
PDF
13th International Conference on Applications of Graph Theory in Wireless Ad ...
PDF
11th International Conference on Applications of Graph Theory in Wireless Ad ...
PPTX
The D4Science Infrastructure
PDF
Josep Maria Salanova - Introduction to BDE+SC4
PDF
TranSMART Hackathon Introduction Amsterdam 2015
PDF
Spatial Data Analysis & Visualization with QGIS - Vienna Data Science Meetup
PDF
smart-city-application
PDF
BDE-SC6 Hangout - “Insight into Virtual Currency Ecosystems”
PPTX
BDE SC6 workshop - introduction 2016
PDF
TranSMART Development Highlights Amsterdam 2015
PDF
Luigi Selmi - The Big Data Integrator Platform
PPT
Stochastic kronecker graphs
PDF
SC7 Hangout 3: The BDE Secure Societies Pilot
11th International Conference on Applications of Graph Theory in Wireless Ad ...
Realtime Big Data Analytics for Event Detection in Highways
13th International Conference on Applications of Graph Theory in Wireless Ad ...
11th International Conference on Applications of Graph Theory in Wireless Ad ...
The D4Science Infrastructure
Josep Maria Salanova - Introduction to BDE+SC4
TranSMART Hackathon Introduction Amsterdam 2015
Spatial Data Analysis & Visualization with QGIS - Vienna Data Science Meetup
smart-city-application
BDE-SC6 Hangout - “Insight into Virtual Currency Ecosystems”
BDE SC6 workshop - introduction 2016
TranSMART Development Highlights Amsterdam 2015
Luigi Selmi - The Big Data Integrator Platform
Stochastic kronecker graphs
SC7 Hangout 3: The BDE Secure Societies Pilot
Ad

Similar to Management and Analysis of Large Scale Heterogeneous Time-Series Data (20)

PDF
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
PPT
PPTX
RMC_final
PDF
FIWARE Global Summit - The Digital Single Market - Benefits and Solutions for...
PDF
ENVIROFI for cross domain FI-PPP applications
PPTX
Challenges on geo spatial visual analytics eurographics
PDF
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
PPTX
Collaboration and Decision Making Tool for Emergency & Crises Situations, Ger...
PDF
Open Data Hub - Roberto Cavaliere - Open Data Hub Mobility Data Space
PDF
The NEEDS vs. the WANTS in IoT
PDF
Scientific Cloud Computing: Present & Future
PPTX
Open Government Open Innovation and the Cloud
PPTX
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
PPT
Summer school bz_fp7research_20100708
PDF
Introduction to Cloud Computing
PDF
chapter 4.pdf
DOCX
chapter 4.docx
PDF
TUW-ASE-Summer 2015: Advanced Services Engineering - Introduction
PPTX
Cloud Computing. – Fundamentals.pptx
PPT
Standards Show
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
RMC_final
FIWARE Global Summit - The Digital Single Market - Benefits and Solutions for...
ENVIROFI for cross domain FI-PPP applications
Challenges on geo spatial visual analytics eurographics
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
Collaboration and Decision Making Tool for Emergency & Crises Situations, Ger...
Open Data Hub - Roberto Cavaliere - Open Data Hub Mobility Data Space
The NEEDS vs. the WANTS in IoT
Scientific Cloud Computing: Present & Future
Open Government Open Innovation and the Cloud
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
Summer school bz_fp7research_20100708
Introduction to Cloud Computing
chapter 4.pdf
chapter 4.docx
TUW-ASE-Summer 2015: Advanced Services Engineering - Introduction
Cloud Computing. – Fundamentals.pptx
Standards Show
Ad

More from Danube University Krems, Centre for E-Governance (20)

PPTX
Smart Cities workshop at CeDEM17
PPTX
#CeDEM17 - Towards an Open Data based ICT Reference Architecture for Smart Ci...
PPTX
#CeDEM17 - Financial Payments and Smart Cities
PPTX
#CeDEM2017 Smart Cities of Self-Determined Data Subjects
PPTX
Open Data as Enabler of Public Service Co-creation: Exploring the Drivers and...
PDF
DatalEt-Ecosystem Provider - The DEEP project
PPTX
Towards Open Justice: ICT acceptance in the Greek justice system
PPTX
Using fuzzy cognitive maps as decision support tool for smart cities goraczek
PPTX
Understanding of smartphone divide dal yong
PPTX
The motivations behind open access publishing judith schossboeck
PPTX
Social media as hobed of racism and hate speech kobayashi, kaigo, kwak
PDF
Social media and citizen engagement in asia skoric
PDF
Realizin modeling and evaluation city's enerfy efficiency leonidas anthopoulos
PDF
Post 2015 paris c limate conference politics on the internet manuela hartwig
PPTX
Open government and national sovereignty ivo babaja
PPTX
Health r isk communication in the digital era myojung chung
PPTX
An analysis of japanese local government facebook profiles muneo kaigo
PDF
Datenschutzbeauftragte werden in Zukunft eine wichtige Rolle im Unternehmen s...
Smart Cities workshop at CeDEM17
#CeDEM17 - Towards an Open Data based ICT Reference Architecture for Smart Ci...
#CeDEM17 - Financial Payments and Smart Cities
#CeDEM2017 Smart Cities of Self-Determined Data Subjects
Open Data as Enabler of Public Service Co-creation: Exploring the Drivers and...
DatalEt-Ecosystem Provider - The DEEP project
Towards Open Justice: ICT acceptance in the Greek justice system
Using fuzzy cognitive maps as decision support tool for smart cities goraczek
Understanding of smartphone divide dal yong
The motivations behind open access publishing judith schossboeck
Social media as hobed of racism and hate speech kobayashi, kaigo, kwak
Social media and citizen engagement in asia skoric
Realizin modeling and evaluation city's enerfy efficiency leonidas anthopoulos
Post 2015 paris c limate conference politics on the internet manuela hartwig
Open government and national sovereignty ivo babaja
Health r isk communication in the digital era myojung chung
An analysis of japanese local government facebook profiles muneo kaigo
Datenschutzbeauftragte werden in Zukunft eine wichtige Rolle im Unternehmen s...

Recently uploaded (20)

PPT
Adolescent Health Orientation and Health care
PDF
buyers sellers meeting of mangoes in mahabubnagar.pdf
PDF
Population Estimates 2025 Regional Snapshot 08.11.25
PPTX
GOVERNMENT-ACCOUNTING1. bsa 4 government accounting
PPTX
Introduction_to_the_Study_of_Globalization.pptx
PPTX
怎么办休斯敦大学维多利亚分校毕业证电子版成绩单办理|UHV在读证明信
PPTX
GSA Q+A Follow-Up To EO's, Requirements & Timelines
PDF
Item # 2 - 934 Patterson Specific Use Permit (SUP)
PPTX
26.1.2025 venugopal K Awarded with commendation certificate.pptx
PDF
26.1.2025 venugopal K Awarded with commendation certificate.pdf
PDF
Abhay Bhutada and Other Visionary Leaders Reinventing Governance in India
PPTX
Vocational Education for educational purposes
PDF
PPT Item #s 2&3 - 934 Patterson SUP & Final Review
PPTX
Omnibus rules on leave administration.pptx
PDF
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
PPTX
Quiz - Saturday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PPT
Quality Management Ssystem PPT - Introduction.ppt
PDF
Courtesy Meeting NIPA and MBS Australia.
PPTX
The DFARS - Part 250 - Extraordinary Contractual Actions
PDF
Item # 3 - 934 Patterson Final Review.pdf
Adolescent Health Orientation and Health care
buyers sellers meeting of mangoes in mahabubnagar.pdf
Population Estimates 2025 Regional Snapshot 08.11.25
GOVERNMENT-ACCOUNTING1. bsa 4 government accounting
Introduction_to_the_Study_of_Globalization.pptx
怎么办休斯敦大学维多利亚分校毕业证电子版成绩单办理|UHV在读证明信
GSA Q+A Follow-Up To EO's, Requirements & Timelines
Item # 2 - 934 Patterson Specific Use Permit (SUP)
26.1.2025 venugopal K Awarded with commendation certificate.pptx
26.1.2025 venugopal K Awarded with commendation certificate.pdf
Abhay Bhutada and Other Visionary Leaders Reinventing Governance in India
Vocational Education for educational purposes
PPT Item #s 2&3 - 934 Patterson SUP & Final Review
Omnibus rules on leave administration.pptx
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
Quiz - Saturday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Quality Management Ssystem PPT - Introduction.ppt
Courtesy Meeting NIPA and MBS Australia.
The DFARS - Part 250 - Extraordinary Contractual Actions
Item # 3 - 934 Patterson Final Review.pdf

Management and Analysis of Large Scale Heterogeneous Time-Series Data

  • 1. Management and Analysis of Large Scale Heterogeneous Time-Series Data Sensor and Government Data: Their Role in Public Policy Martin Litzenberger Safety and Security Department AIT Austrian Institute of Technology Martin Litzenberger | Senior Engineer | DSS SNI
  • 2. Motivation A plethora of heterogeneous data are collected by public institutions with various sensors today But the data and their use are (usually) restricted to the domain or departments they belong to, e.g. security surveillance, traffic, public transport, air quality, power grids, ... Reasons: Lack of interoperability and often lack of communication and cooperation of data owners 223.05.2014
  • 3. Advantages Connecting these data or even collecting them on a common platform would allow for new ways of analysis and insight into important and interesting mechanisms (e.g. traffic / air quality) But data are heterogeneous in many aspects such as: format, update frequency, representation, owner, accessibility .. which makes a joint analysis a big challenge Real-time 24/7 processing and availability, not a “one-time” academic investigation! 323.05.2014
  • 4. Challenge: Heterogeneity of Data Temporal heterogeneity Discrete events versus regular time series Spatial heterogeneity „On-site“ versus „as near as possible“ Semantic heterogeneity The same parameters might have different significance under different context Technical heterogeneity Non-standardized interfaces, formats, etc. Political heterogeneity “Owners” of data have different missions and goals 423.05.2014
  • 5. 523.05.2014 Investigating effects of traffic state (free flow/stop&go) on local air quality Data sources Traffic monitor for traffic volume and acceleration Black carbon sensor at road side and a background station Meteorological station Case Study
  • 6. Case Study: Combined Air Quality and Traffic Monitoring Different owners City Council, State AQ Department and projects own sensors Different data intervals Traffic: Individual vehicles (~ 4000 data sets (speed, acceleration, vehicle class)/hour !) Air Quality & Meteo: fixed frequency, 30 min averages (48 data points/day) Pre-processing Temporal alignment & Aggregation
  • 7. Goal: Investigating a “black carbon equivalent” for traffic Accelerating cars have a higher tailpipe emission than “free flowing” vehicles Approach: Q”BC” = Qtotal-vehicles + 6 * Qaccelerating-vehicles (can be even more complex including weight factors for HGV etc...) Local (road-side) black carbon concentrations need to be reduced by “background” values to “isolate” traffic related component CBC = Croad – Cbackground And of course wind speed is of interest at the same time ... ! 723.05.2014
  • 8. Solution: What is openUwedat? OpenUwedat is a toolbox that allows to build Time Series related Applications The toolbox contains many ready-made, adaptable programs The toolbox contains libraries to write your own programs which integrate seamlessly with the existing ones Driver Driver Database Driver configurable
  • 9. What can I do with openUwedat? openUwedat allows to interact with any kind of Time Series Device. You can integrate new devices by writing new modules which act as „drivers“. Typical devices are: Measurement Devices Data Aquisition Systems (station computers) Other Time Series Management Systems Databases (SQL and no-SQL) …
  • 10. Implementation in openUwedat Powerful scripting language “Formula 3” Real time interfaces and real-time processing pipes Example code how to implement the BC-Equivalent function in Formula 3 @A="name=Database; type=Aggregation;Source=TDS;Sensor=S4.TDS1;Lane=0" @B="name=Database; type=Aggregation;Source=TDS;Sensor=S4.TDS1;Lane=1" <<(A.accCount[i]+B.accCount[i]+A.decCount[i]+B.decCount[i])*6+A.to talFlow[i]+B.totalFlow[i]>> | << sum( _ ]t-60mins..t] ) >> every 60 mins 1023.05.2014
  • 11. 1123.05.2014 Very good correlation! But depending on meteo-conditions. During episodes of stronger wind, the correlation drops! Typical Result Traffic / Air Quality
  • 12. Conclusions Plenty of heterogeneous data are collected on regular basis by public authorities day by day The potential to analyse these data together stays mostly unused because: Lack of cooperation between authorities / departments Lack of interoperability of the systems Case study on traffic/air quality show potential of how heterogeneous data analysis creates new insights AIT’s OpenUwedat data management toolbox allows Collection of Large Scale Heterogeneous Time-Series Data from different sources Complex analysis using a powerful scripting language 1223.05.2014
  • 13. AIT Austrian Institute of Technology your ingenious partner Martin Litzenberger