Mathias Herberts – CTO Cityzen Data
@CityzenSciences @CityzenData
@herberts
We are french startups
Yes, France has other cities than Paris!
Opening an international office soon
IoT Silicon Valley - Cityzen Sciences and Cityzen Data presentation
Members
Partners
Technology Provider
Conductive, elastic yarn
Technology Provider
Tooling for yarn preparation (gimp)
Technology Provider
Tooling for weaving the conductive yarn in fabric
Technology Provider
Flexible micro sensors
Technology Provider
All at industrial scale, ready for mass production
Innovation Enabler
We work with brands to help them integrate our technologies
Fabric is everywhere
D-Shirt™
Embedded EKG sensor
#FFN
On a flex PCB in phase 1
More integrated in phase 2
Less than 50 grams
Battery to power the sensors (10 hours battery life)
Accelerometers/Gyroscopes
GPS
Altimeter
Data logger
BT and BLE connectivity
Open platform based on OSGi
Gateway
Energy Harvesting
Companion App
IoT Silicon Valley - Cityzen Sciences and Cityzen Data presentation
Platform tailored for sensor data
Machines produce massive amounts of data, far more than humans
Our initial focus
D-Shirt ™
A space-time continuum
Our platform handles Geo Time Series
Performance requirements
D-Shirt ™
200,000 data points per hour
A platform for creating apps for a large user base, not a select group of analysts
Performance evaluation
TS 101M (10M readings from 1M different sensors)
TS 101 (10M readings from a single sensor)
125,000 readings/s ingestion rate per core in the worst case
200,000 readings/s retrieval rate
500,000 readings/s ingestion rate per core
500,000 readings/s retrieval rate
Security considerations
a language dedicated to manipulating Geo Time Series
Data manipulation environment
Where do we fit in?
Beyond Smart Fabric
0111100011001010010110111110110100110111110111010100101111111111101101110110101111111011110010100111011010000110
0101111010010001000101101110001001100110100111101011111100101110100100110101001001101000011110110110010001001001
1001101110101000001011000101110101010111110000100101100110110101101011110101001001110010011001011101011101011111
0101111110011000100111001101010101110111000111010101101100101110111100100110111001000101011001011110101001110101
0101111011101100100001000111010111111010011011010111111111001001110111100010111100001100100111111110110001101101
0101110010011000111101101011011010011101011101111001001001111011100111100001111101011000001111110101110101011001
0001110011111011010001111100111111110001000100000011000100011111011000011010111111111111100001011010101010101111
1011010001010110100010111100100111010001110011010111101101000001010010111110101101100011000110010111001011001011
1011100010010101100011011101001111100100011011000000011000111011000101101111001110111110001010111101001110001010
0111111110100010101001000011001101110010110111101111100110011010101000010010011101011011011110011010001100000110
Consumer IoT
Complete control on data usage
Ecosystem of third party applications
Apps must have user consent - data producer advantage
// Stack based language
// RPL like language (language of HP RPN calculators)
// Script results are returned as a JSON array of stack levels
// Has macros, conditionals, loops
// Has close to 250 functions, both simple and complex ones
// Includes data manipulation frameworks
// Scripts execute server side, where the data resides
Geo Time Series have a class and a set of valued labels
Data model
class{label0=v0,label1=v1}
Readings have one of four types
LONG DOUBLE BOOLEAN STRING
Example Result
[
{"c":"com.cityzendata.test","l":{".app":"doc.data","origin":"random"},"a":{},"v":
[[1395145993000500,97.5],[1395145993000499,99],[1395145993000498,103.5],
[1395145993000497,104],[1395145993000496,103.5],[1395145993000495,100],
[1395145993000494,99.5],[1395145993000493,101],[1395145993000492,97.5],
[1395145993000491,100],[1395145993000490,99.5],[1395145993000489,100],
[1395145993000488,96.5],[1395145993000487,98],[1395145993000486,98.5],[1395145993000485,94],
[1395145993000484,98.5],[1395145993000483,103],[1395145993000482,100.5],
[1395145993000481,98],[1395145993000480,101.5],[1395145993000479,101]},
{"c":"mozfest.light","l":
{"area":"1",".app":"doc.data","xbeeId":"XBee_40670F0D","moteId":"53"},"a":{},"v":
[[1382714858837000,51.50198796764016,0.005952995270490646,381],
[1382714858830000,51.50198796764016,0.005952995270490646,291],
[1382714828049000,51.50198796764016,0.005952995270490646,230],
[1382714797974000,51.50198796764016,0.005952995270490646,203]}
]
MAP – Do computations on a sliding window
BUCKETIZE – Normalize the ticks of series
REDUCE – Do computations on series partitioned by labels
APPLY – Apply n-ary functions to partitions of n lists of series
FILTER – Filter series previously partitioned by labels
Einstein Frameworks
POST /api/v0/update HTTP/1.0
Host: api.cityzendata.net
X-CityzenData-Token: <TOKEN>
<TIMESTAMP>/<LAT>:<LON>/<ELEV> <CLASS>{<LABELS} <VALUE>
...
1400000000000000/37.7833:-122.4167/90000 temp{th=1} 51
1400000060000000/37.7833:-122.4167/90000 temp{th=1} 51
1400000120000000/37.7833:-122.4167/90000 temp{th=1} 52
1400000180000000/37.7833:-122.4167/90000 temp{th=1} 53
1400000240000000/37.7833:-122.4167/90000 temp{th=1} 51
...
1400244098138164// 心拍数 {id=1} 85
Data ingestion using HTTP
Exponential Smoothing
// Retrieve data
'TOKEN'
'~linux.*loadavg.1' // Regexp for selecting class
'host' '10.0.0.1' 2 ->MAP // Label selector
'2014-01-01T00:00:00.000Z' '2014-05-01T00:00:00.000Z'
5 ->LIST FETCH
// Apply Single Exponential Smoothing
0.0025 SINGLEEXPONENTIALSMOOTHING // Alpha=0.0025
// Only retain 1000 values
bucketizer.mean 0 0 1000 5 ->LIST BUCKETIZE
Exponential Smoothing
Raw data
Smoothed data
Pattern Detection
// Retrieve data
...
FETCH
// Detect pattern (pattern extracted via PATTERNS)
'......2z' 1 ->LIST
160 // Sliding window
8 // Pattern length
64 // Quantization scale
PATTERNDETECTION
Pattern Detection
Discord detection
// Retrieve some data
...
FETCH
64 // Size of discord detection window
16 // Pattern length
512 // Quantization scale
1 // We want to detect a single discord
false // Discords are not allowed to overlap (N/A here)
2.0 // Distance ratio between nearest neighbors
DISCORDS
Discord detection
Discord detection
Discord detection
Ability to use Einstein on very large numbers of series
Hadoop integration
Publish/Subscribe mechanism based on WebSockets
Plasma
Real Time Dashboards
Complex Event Processing
Storm Esper
R integration
Machine learning functions (anomaly detection and forecasting)
Extensions to Einstein
WebComponents for dataviz
...
Roadmap
www.cityzensciences.fr/en
www.smartsensing.fr/en
www.cityzendata.com
@CityzenSciences @CityzenData

More Related Content

PPT
Triple screen viewing practices : diversification or compartmentalization?
PDF
MISSILE TELEMETRY DATALINK CALCULATION (A MATLAB PROGRAM)
PDF
Warp 10 Platform Presentation - Criteo Beer & Tech 2016-02-03
PPTX
How to find what you didn't know to look for, oractical anomaly detection
PDF
Ellen Friedman - Keynote NoSQL matters Barcelona 2014
PDF
No more (unsecure) secrets, Marty
PDF
Big Data - Open Coffee Brest - 20121121
PDF
This is not your father's monitoring.
Triple screen viewing practices : diversification or compartmentalization?
MISSILE TELEMETRY DATALINK CALCULATION (A MATLAB PROGRAM)
Warp 10 Platform Presentation - Criteo Beer & Tech 2016-02-03
How to find what you didn't know to look for, oractical anomaly detection
Ellen Friedman - Keynote NoSQL matters Barcelona 2014
No more (unsecure) secrets, Marty
Big Data - Open Coffee Brest - 20121121
This is not your father's monitoring.

Viewers also liked (13)

PDF
Artimon - Apache Flume (incubating) NYC Meetup 20111108
PDF
The Hadoop Ecosystem
PDF
Programmation fonctionnelle
PPTX
Unit 1 transport systems in animals
PDF
Smart Fabric - This shirt knows when you are tired, excited or unwell! - City...
PDF
Scala : programmation fonctionnelle
PDF
Time Series Analysis with Spark
PPTX
Frustration-Reduced Spark: DataFrames and the Spark Time-Series Library
PDF
The Lambda Calculus and The JavaScript
PDF
Analyzing Time Series Data with Apache Spark and Cassandra
PDF
Programmation fonctionnelle en JavaScript
PDF
Comprendre la programmation fonctionnelle, Blend Web Mix le 02/11/2016
PDF
Mathias Herberts fait le retour d'expérience Hadoop au Crédit Mutuel Arkéa
Artimon - Apache Flume (incubating) NYC Meetup 20111108
The Hadoop Ecosystem
Programmation fonctionnelle
Unit 1 transport systems in animals
Smart Fabric - This shirt knows when you are tired, excited or unwell! - City...
Scala : programmation fonctionnelle
Time Series Analysis with Spark
Frustration-Reduced Spark: DataFrames and the Spark Time-Series Library
The Lambda Calculus and The JavaScript
Analyzing Time Series Data with Apache Spark and Cassandra
Programmation fonctionnelle en JavaScript
Comprendre la programmation fonctionnelle, Blend Web Mix le 02/11/2016
Mathias Herberts fait le retour d'expérience Hadoop au Crédit Mutuel Arkéa
Ad

Similar to IoT Silicon Valley - Cityzen Sciences and Cityzen Data presentation (20)

PDF
Testing in a distributed world
PDF
Phil Day [Configured Things] | Policy-Driven Real-Time Data Filtering from Io...
PDF
Testing in the IoT Era
PDF
WSO2 Complex Event Processor - Product Overview
PDF
TEMS™ Pocket | Datasheet | Infovista
PDF
TEMS™ Pocket – Portable network testing and troubleshooting solution | Datash...
PPTX
Industrial Edge.pptx
PPT
Experimenting with the OSGi platform in the Aspire RFID middleware
PPTX
Flink Forward San Francisco 2018: David Reniz & Dahyr Vergara - "Real-time m...
PDF
Extended QA testing to enhance productivity of a leading networking solution ...
PDF
TEMS™ Cloud Datasheet | Infovista
PDF
Functional Verification of Large-integers Circuits using a Cosimulation-base...
PDF
TEMS™ Investigation | Datasheet | Infovista
PPT
CloudSME EU project by Tamas Kiss
PDF
Internet of Things: Understanding the Business Impact
PDF
Synergie-Cad we know how to connect
PDF
Fire Brochure 2015
PDF
Keynote: Trends in Modern Application Development - Gilly Dekel, IBM
PDF
Ernest: Efficient Performance Prediction for Advanced Analytics on Apache Spa...
Testing in a distributed world
Phil Day [Configured Things] | Policy-Driven Real-Time Data Filtering from Io...
Testing in the IoT Era
WSO2 Complex Event Processor - Product Overview
TEMS™ Pocket | Datasheet | Infovista
TEMS™ Pocket – Portable network testing and troubleshooting solution | Datash...
Industrial Edge.pptx
Experimenting with the OSGi platform in the Aspire RFID middleware
Flink Forward San Francisco 2018: David Reniz & Dahyr Vergara - "Real-time m...
Extended QA testing to enhance productivity of a leading networking solution ...
TEMS™ Cloud Datasheet | Infovista
Functional Verification of Large-integers Circuits using a Cosimulation-base...
TEMS™ Investigation | Datasheet | Infovista
CloudSME EU project by Tamas Kiss
Internet of Things: Understanding the Business Impact
Synergie-Cad we know how to connect
Fire Brochure 2015
Keynote: Trends in Modern Application Development - Gilly Dekel, IBM
Ernest: Efficient Performance Prediction for Advanced Analytics on Apache Spa...
Ad

More from Mathias Herberts (7)

PDF
2019-09-25 Paris Time Series Meetup - Warp 10 - Advanced Time Series Technolo...
PDF
20170516 hug france-warp10-time-seriesanalysisontopofhadoop
PDF
Big Data Tribute
PDF
Hadoop Pig Syntax Card
PDF
Hadoop Pig
PDF
WebScale Computing and Big Data a Pragmatic Approach
PDF
Leveraging Hadoop for Legacy Systems
2019-09-25 Paris Time Series Meetup - Warp 10 - Advanced Time Series Technolo...
20170516 hug france-warp10-time-seriesanalysisontopofhadoop
Big Data Tribute
Hadoop Pig Syntax Card
Hadoop Pig
WebScale Computing and Big Data a Pragmatic Approach
Leveraging Hadoop for Legacy Systems

Recently uploaded (20)

PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Five Habits of High-Impact Board Members
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
Unlock new opportunities with location data.pdf
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
A comparative study of natural language inference in Swahili using monolingua...
DOCX
search engine optimization ppt fir known well about this
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
Architecture types and enterprise applications.pdf
PPT
What is a Computer? Input Devices /output devices
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
Enhancing emotion recognition model for a student engagement use case through...
PPTX
observCloud-Native Containerability and monitoring.pptx
PPT
Geologic Time for studying geology for geologist
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Tartificialntelligence_presentation.pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Five Habits of High-Impact Board Members
Hindi spoken digit analysis for native and non-native speakers
Zenith AI: Advanced Artificial Intelligence
Taming the Chaos: How to Turn Unstructured Data into Decisions
Unlock new opportunities with location data.pdf
A contest of sentiment analysis: k-nearest neighbor versus neural network
A comparative study of natural language inference in Swahili using monolingua...
search engine optimization ppt fir known well about this
Getting started with AI Agents and Multi-Agent Systems
Architecture types and enterprise applications.pdf
What is a Computer? Input Devices /output devices
NewMind AI Weekly Chronicles – August ’25 Week III
Developing a website for English-speaking practice to English as a foreign la...
WOOl fibre morphology and structure.pdf for textiles
Enhancing emotion recognition model for a student engagement use case through...
observCloud-Native Containerability and monitoring.pptx
Geologic Time for studying geology for geologist
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Tartificialntelligence_presentation.pptx

IoT Silicon Valley - Cityzen Sciences and Cityzen Data presentation