SlideShare a Scribd company logo
Geospatial Big Data
Business Cases from proDataMarket
Dumitru Roman
dumitru.roman@sintef.no
Geospatial Big Data
Property Data and the proDataMarket project
Example Business Cases
Data Marketplace
http://www.millennium-project.org/
Geospatial Big Data
are
societal
opportunities
Geospatial Big Data
Raster Vector Sensors Mobile
It’s easier than ever to collect geospatial data,
but how can we exploit these geospatial big data?
Example: Property data
One of the most valuable datasets managed by
governments worldwide
Extensively used in various domains by private and
public organizations
Challenges in working with property data
• Difficult to access
• Cross-sectors
• Data is highly heterogeneous and possibly large
• Data quality
• Time-consuming integration
• Lack of innovation
• …
How can we innovate (and make money)
with property-related (Open) Data?
proDataMarket project goals
• To make property data more accessible,
more usable and easier to understand
• To make it easier for:
• Property data providers to publish and
distribute their data
• Data consumers to find and access
property data needed for their businesses
2.5 Years
(2015-2017)
€4.5M
20+
Datasets
proDataMarket deliveries
7
data-driven business
products and
services
1
data
marketplace
Example business case #1
Objective evaluation of the real estate properties
Business Intelligence companies
(e.g. Cerved)
Automation and cost-reduction in
property valuations, new services
Public administration Fact-driven social policy
Real estate agencies Speed up evaluation of properties, more
objective estimation of properties
Property buyers/sellers Eliminate intermediaries
Example business case #1 (cont’)
Objective evaluation of the real estate properties
in Italy, by
Istat Census
Snapshot of Italy, socio-
demographic data about: house
(its characteristics), people of the
family (personal data, education,
profession, work / study place)
people that live in house (guests)
OpenStreetMap
Point of interest of the city
about transport, downtown,
environment
Cadastral report
Property details (surface,
cadastral category, quality
status, age, ownership details)
~ 10M buildings
The evaluation of real estate
property
An up-to-date, objective evaluation
of the real estate properties in
territories in Italy
=
Market price €
++
=+
SYNTETIC INDEX ISTAT = -0.23 - (0.12 * UNEMPLOYED) + (0.2 * HISTORIC_BUILDINGS) + (0.58 * GRADUATES_ON_RESIDENTS) + (0.6 * STUDENTS_ON_RESIDENTS)
SYNTETIC INDEX POI = -0.5 + (0.15 * closest_metro_station) + (0.14 * closest_railway_station) + (0.24 * n_bus_stops_within_800m) +
(0.6 * n_small_green_areas_pois_within_800m) + (0.02 * n_pedestrian_paths_within_1000m) - (0.05 * closest_airport)
Example business case #1 (cont’)
Objective evaluation of the real estate properties
in Italy, by
Example business case #1 (cont’)
Objective evaluation of the real estate properties
in Italy, by
Sample technical challenges
Semantic data heterogeneity
How to translate a point of interest
into an OSM query?
How to retrieve data from the
whole Italy?
Structural data heterogeneity
How to compute indicators on
different data structures?
Messy data
How to exclude from computation
duplicated annotations of the same
real-world entities?
• Stakeholders:
• Public administration (e.g. FEGA
in Spain)
• Farmers and land owners
• Intermediaries (e.g. service
providers)
• Problems:
• Unfair grant assignment and
expenditure on audits
• Incorrect grant assignments
• Features defined subjectively
Example business case #2
Common Agriculture Policy (CAP) funds assignments
in Spain, by
Cadaster Information
Parcels and their features:
surfaces, limits, slope….
EFAs & LEs
Ecological focused areas and
Landscape elements accurately
defined using LIDAR
Satellite
Kind of crops, Health
status, Set aside zones,
Nitrogen fixing crops, CO2
fixing crops…
Accurately defined
CAP parameters objectively
defined, Automated process to
create new datasets related to
CAP Funds, Less errors, Less
audits and field visits…
=
CAP Funds
++
Fund assignment rules examples
• Crop Diversification
• Kind, density and surface of Ecological Focus Areas
• Conditionality
Example business case #2 (cont’)
Common Agriculture Policy (CAP) funds assignments
in Spain, by
4) There are patterns:
Groups, lines, isolated
trees, etc.
5) Trees in line, hedges
Non-aligned groups,
copses
6) A viewer
2) Classified points by
their height
1) Raw datasets,
just points
3) Points are grouped: Yellow
(soil), Green (trees), Orange
(bushes)
Example business case #2 (cont’)
Common Agriculture Policy (CAP) funds assignments
in Spain, by
Example business case #3 (cont’)
Augment Reality (AR) for Property-related Data
in Norway, by
AR for buildings AR for underground infrastructure
What’s the impact of a new
building on its surroundings?
Where are the underground pipes?
Geospatial Big Data: Business Cases from proDataMarket
Geospatial Big Data: Business Cases from proDataMarket
• A hard copy of 314 pages and as a PDF
file
• 6 Person-Months
• Data collection with spreadsheets
• Quality assurance through e-mails and
phone correspondence
Pains: Time consuming, Poor data quality,
Static report without live updating
• Live service
• Efficient sharing of data
• Simplified integration with
external datasets
• Live updating
• Reliable access
• …
• Risk and vulnerability analysis, e.g.
buildings affected by flooding
• Analysis of leasing prices
Report Reporting Service 3rd party services
Example business case #4
Reporting state-owned real estate properties
in Norway, by
https://datagraft.net 21
Linked Data Approach: DataGraft
DataGraft: Data Transformation and
Knowledge Graph Publication Process
• Interactive design of transformations
• Repeatable transformations
• Reuse/share transformations (user-
based access)
• Cloud-based deployment of
transformations
• Self-serviced process
• Data and Transformation as-a-Service
22
Transform
Generate
RDF
Ontology X
Ontology X
Ontology X
Ontology
mapping
RDF Graph
Raw Data Prepared Data
Map
Map
Semantic graph
database
Geospatial Data is BIGthing
Innovation with property-related
data in proDataMarket
Thank you!
Contact: dumitru.roman@sintef.no
http://prodatamarket.eu
https://datagraft.net
@prodatamarket
05.05.2016 25

More Related Content

PPTX
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
PPT
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
PPTX
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
PDF
Industry@RuleML2015 DataGraft
PPTX
From open data to data-driven services
PDF
Service innovation: the hidden value of open data
PPTX
Core Activities
PDF
Wed roman tut_open_datapub
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
Industry@RuleML2015 DataGraft
From open data to data-driven services
Service innovation: the hidden value of open data
Core Activities
Wed roman tut_open_datapub

What's hot (20)

PPTX
Analytical tools
PPTX
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
PDF
Open Data Support - bridging open data supply and demand
PPT
DATA WAREHOUSING AND DATA MINING
PDF
Using the Semantic Web Stack to Make Big Data Smarter
PPTX
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
PPT
Applying Digital Library Metadata Standards
PPTX
web mining
PDF
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
PPTX
Introduction to Metadata
PPTX
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
PPTX
Semantic Information Management using PoolParty 4
PPTX
JOSA TechTalk: Metadata Management
in Big Data
PPTX
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
PPTX
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
PPTX
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
PPTX
PoolParty 4 - From Text Mining to Linked Data
PDF
ENGAGE Workshop at OpenDataWeek2013
PDF
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
PPTX
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
Analytical tools
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
Open Data Support - bridging open data supply and demand
DATA WAREHOUSING AND DATA MINING
Using the Semantic Web Stack to Make Big Data Smarter
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
Applying Digital Library Metadata Standards
web mining
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
Introduction to Metadata
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
Semantic Information Management using PoolParty 4
JOSA TechTalk: Metadata Management
in Big Data
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
PoolParty 4 - From Text Mining to Linked Data
ENGAGE Workshop at OpenDataWeek2013
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
Ad

Viewers also liked (16)

PDF
BNI - Raimund Wiesinger (Spinnraum) - 10 Minuten Präsentation
PPT
Objectius gapi
DOC
Q. leyes de los gases
PDF
Artower Punta del Este
PDF
157 2016 norma colorazione ogive
PDF
Margherita_Bandini_reference_letter_Apr2012
PPTX
Actividad de cierre
PPT
Innovation, Environmental Policy And Lock In Effects
PDF
Eremin 9klass
PPTX
Calculo integral actividad de cierre
PDF
132 2016 esempi infortuni suva- caduta dalla scala
PPTX
Homeopatía en enfermedades respiratorias en niños, puebla
DOC
PDF
It octobus 2016_01
PPT
Aromaticos
PDF
Massive Sensors Array for Precision Sensing
BNI - Raimund Wiesinger (Spinnraum) - 10 Minuten Präsentation
Objectius gapi
Q. leyes de los gases
Artower Punta del Este
157 2016 norma colorazione ogive
Margherita_Bandini_reference_letter_Apr2012
Actividad de cierre
Innovation, Environmental Policy And Lock In Effects
Eremin 9klass
Calculo integral actividad de cierre
132 2016 esempi infortuni suva- caduta dalla scala
Homeopatía en enfermedades respiratorias en niños, puebla
It octobus 2016_01
Aromaticos
Massive Sensors Array for Precision Sensing
Ad

Similar to Geospatial Big Data: Business Cases from proDataMarket (20)

PDF
proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"
PPTX
R3 TREES - Integrated Management of Urban Green Areas
PDF
Building blocks for fair digital society
PDF
Chapter 3 introduction to the smart city concept, AUST 2015
PDF
CONSTRUCTION & CITIES 4.0… A *REAL* WORK IN PROGRESS
PPTX
Internet of Things - Call presentations and hints from presenters
PDF
Data IS the new dollar
PPT
INSPIREd computing for EO Based Services
PPTX
Verso le trusted smart statistics - prospettive di sviluppo e risultati del e...
PPT
Standard geodata models for Energy Performance of Buildings: experiences from...
PDF
ICTs for Green Growth: A Priority for Science Policy? - Richard Labelle, ICTs...
PPTX
CKX: Wellbeing Toronto - More Than Just a Map
PDF
Steps towards a Data Value Chain
PPTX
Bruce Thompson on digital disruption and the environment
PPTX
Big Data in a Digital City. Key Insights from the Smart City Case Study
PDF
Data Ecosystems for Geospatial Data
PPTX
Data sharing between private companies and research facilities
PPTX
DIGITAL TECHNOLOGY
PDF
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
PDF
L'economia europea dei dati. Politiche europee e opportunità di finanziamento...
proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"
R3 TREES - Integrated Management of Urban Green Areas
Building blocks for fair digital society
Chapter 3 introduction to the smart city concept, AUST 2015
CONSTRUCTION & CITIES 4.0… A *REAL* WORK IN PROGRESS
Internet of Things - Call presentations and hints from presenters
Data IS the new dollar
INSPIREd computing for EO Based Services
Verso le trusted smart statistics - prospettive di sviluppo e risultati del e...
Standard geodata models for Energy Performance of Buildings: experiences from...
ICTs for Green Growth: A Priority for Science Policy? - Richard Labelle, ICTs...
CKX: Wellbeing Toronto - More Than Just a Map
Steps towards a Data Value Chain
Bruce Thompson on digital disruption and the environment
Big Data in a Digital City. Key Insights from the Smart City Case Study
Data Ecosystems for Geospatial Data
Data sharing between private companies and research facilities
DIGITAL TECHNOLOGY
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
L'economia europea dei dati. Politiche europee e opportunità di finanziamento...

More from dapaasproject (7)

PDF
DataGraft: Data-as-a-Service for Open Data
PDF
Data-as-a-Service: DataGraft
PDF
"Cerved - A business perspective"
PDF
proDataMarket presentation at "European Data Forum"
PDF
proDataMarket presentation at "Spatial Data on The Web"
PDF
DataGraft: Data-as-a-Service for Open Data
PDF
DataGraft: Data-as-a-Service for Open Data
DataGraft: Data-as-a-Service for Open Data
Data-as-a-Service: DataGraft
"Cerved - A business perspective"
proDataMarket presentation at "European Data Forum"
proDataMarket presentation at "Spatial Data on The Web"
DataGraft: Data-as-a-Service for Open Data
DataGraft: Data-as-a-Service for Open Data

Recently uploaded (20)

PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
Quality review (1)_presentation of this 21
PDF
Mega Projects Data Mega Projects Data
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Lecture1 pattern recognition............
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Database Infoormation System (DBIS).pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Business Acumen Training GuidePresentation.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Quality review (1)_presentation of this 21
Mega Projects Data Mega Projects Data
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction to Knowledge Engineering Part 1
ISS -ESG Data flows What is ESG and HowHow
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Miokarditis (Inflamasi pada Otot Jantung)
Fluorescence-microscope_Botany_detailed content
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Lecture1 pattern recognition............
.pdf is not working space design for the following data for the following dat...
Galatica Smart Energy Infrastructure Startup Pitch Deck
Database Infoormation System (DBIS).pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Business Acumen Training GuidePresentation.pptx

Geospatial Big Data: Business Cases from proDataMarket

  • 1. Geospatial Big Data Business Cases from proDataMarket Dumitru Roman dumitru.roman@sintef.no
  • 2. Geospatial Big Data Property Data and the proDataMarket project Example Business Cases Data Marketplace
  • 4. Geospatial Big Data Raster Vector Sensors Mobile It’s easier than ever to collect geospatial data, but how can we exploit these geospatial big data?
  • 5. Example: Property data One of the most valuable datasets managed by governments worldwide Extensively used in various domains by private and public organizations
  • 6. Challenges in working with property data • Difficult to access • Cross-sectors • Data is highly heterogeneous and possibly large • Data quality • Time-consuming integration • Lack of innovation • …
  • 7. How can we innovate (and make money) with property-related (Open) Data?
  • 8. proDataMarket project goals • To make property data more accessible, more usable and easier to understand • To make it easier for: • Property data providers to publish and distribute their data • Data consumers to find and access property data needed for their businesses 2.5 Years (2015-2017) €4.5M 20+ Datasets
  • 10. Example business case #1 Objective evaluation of the real estate properties Business Intelligence companies (e.g. Cerved) Automation and cost-reduction in property valuations, new services Public administration Fact-driven social policy Real estate agencies Speed up evaluation of properties, more objective estimation of properties Property buyers/sellers Eliminate intermediaries
  • 11. Example business case #1 (cont’) Objective evaluation of the real estate properties in Italy, by Istat Census Snapshot of Italy, socio- demographic data about: house (its characteristics), people of the family (personal data, education, profession, work / study place) people that live in house (guests) OpenStreetMap Point of interest of the city about transport, downtown, environment Cadastral report Property details (surface, cadastral category, quality status, age, ownership details) ~ 10M buildings The evaluation of real estate property An up-to-date, objective evaluation of the real estate properties in territories in Italy = Market price € ++
  • 12. =+ SYNTETIC INDEX ISTAT = -0.23 - (0.12 * UNEMPLOYED) + (0.2 * HISTORIC_BUILDINGS) + (0.58 * GRADUATES_ON_RESIDENTS) + (0.6 * STUDENTS_ON_RESIDENTS) SYNTETIC INDEX POI = -0.5 + (0.15 * closest_metro_station) + (0.14 * closest_railway_station) + (0.24 * n_bus_stops_within_800m) + (0.6 * n_small_green_areas_pois_within_800m) + (0.02 * n_pedestrian_paths_within_1000m) - (0.05 * closest_airport) Example business case #1 (cont’) Objective evaluation of the real estate properties in Italy, by
  • 13. Example business case #1 (cont’) Objective evaluation of the real estate properties in Italy, by Sample technical challenges Semantic data heterogeneity How to translate a point of interest into an OSM query? How to retrieve data from the whole Italy? Structural data heterogeneity How to compute indicators on different data structures? Messy data How to exclude from computation duplicated annotations of the same real-world entities?
  • 14. • Stakeholders: • Public administration (e.g. FEGA in Spain) • Farmers and land owners • Intermediaries (e.g. service providers) • Problems: • Unfair grant assignment and expenditure on audits • Incorrect grant assignments • Features defined subjectively Example business case #2 Common Agriculture Policy (CAP) funds assignments in Spain, by
  • 15. Cadaster Information Parcels and their features: surfaces, limits, slope…. EFAs & LEs Ecological focused areas and Landscape elements accurately defined using LIDAR Satellite Kind of crops, Health status, Set aside zones, Nitrogen fixing crops, CO2 fixing crops… Accurately defined CAP parameters objectively defined, Automated process to create new datasets related to CAP Funds, Less errors, Less audits and field visits… = CAP Funds ++ Fund assignment rules examples • Crop Diversification • Kind, density and surface of Ecological Focus Areas • Conditionality Example business case #2 (cont’) Common Agriculture Policy (CAP) funds assignments in Spain, by
  • 16. 4) There are patterns: Groups, lines, isolated trees, etc. 5) Trees in line, hedges Non-aligned groups, copses 6) A viewer 2) Classified points by their height 1) Raw datasets, just points 3) Points are grouped: Yellow (soil), Green (trees), Orange (bushes) Example business case #2 (cont’) Common Agriculture Policy (CAP) funds assignments in Spain, by
  • 17. Example business case #3 (cont’) Augment Reality (AR) for Property-related Data in Norway, by AR for buildings AR for underground infrastructure What’s the impact of a new building on its surroundings? Where are the underground pipes?
  • 20. • A hard copy of 314 pages and as a PDF file • 6 Person-Months • Data collection with spreadsheets • Quality assurance through e-mails and phone correspondence Pains: Time consuming, Poor data quality, Static report without live updating • Live service • Efficient sharing of data • Simplified integration with external datasets • Live updating • Reliable access • … • Risk and vulnerability analysis, e.g. buildings affected by flooding • Analysis of leasing prices Report Reporting Service 3rd party services Example business case #4 Reporting state-owned real estate properties in Norway, by
  • 22. DataGraft: Data Transformation and Knowledge Graph Publication Process • Interactive design of transformations • Repeatable transformations • Reuse/share transformations (user- based access) • Cloud-based deployment of transformations • Self-serviced process • Data and Transformation as-a-Service 22 Transform Generate RDF Ontology X Ontology X Ontology X Ontology mapping RDF Graph Raw Data Prepared Data Map Map Semantic graph database
  • 23. Geospatial Data is BIGthing Innovation with property-related data in proDataMarket