SlideShare a Scribd company logo
How to Manage Open Police
Data – Tips for Data QA/QC
and Automation
Alice Pence, City of Auburn
Robert Schultz, RCMP in BC
Cat Gracey, Safe Software
Dale Lutz, Safe Software
Who are we?
Connect Your Applications | Transform Your Data | Automate Your Workflows
Police Data Initiative
53 U.S. jurisdictions. 41 million people. 90 datasets. And growing!
www.publicsafetydataportal.org
Why Should Police Data be Shared?
● Increase transparency and build trust
● Help provide improved public services
● Fuel more intelligent policing decisions
The Ethics:
● Need to protect victim identity
● Only share when it’s lawful to do so
● Must be accurate, free of errors
Dallas Police Department Experience
(by the numbers)
12
Dallas Police Department Experience
(by the numbers)
45
Dallas Police Department Experience
(by the numbers)
67
Alice Pence
City of Auburn
Automating Crime Data
to Import into GIS
Alice Pence
GIS/Database Specialist
City of Auburn, WA
Quick Bio
• BS in Geography from University of Idaho w/Certificate in GIS
• In GIS field for 3+ years – been in Auburn, WA for over 1 year
Alice Pence
Our Crimes Database
• Stored in a Proprietary Linux system
• Raw data with only addresses
• Can’t directly dump into GIS- needs to have data transformed for public/internal viewing, QA/QC, a
spatial component to view on maps and applications, attributes renamed…
First impression…
Our Mission
How to view all this
data spatially for
analysis and reporting?
Our first idea…
A Python Script
The Good
• It worked. Can add new offenses, change and update as needed.
• Ran in windows scheduler to automate.
The Bad
• Long, cumbersome script.
• Needed multiple databases (.gbd → SDE) to clean and move data.
The Ugly
• Can’t easily change connection to Crimes Database, no direct connection to source (connected
through ArcCatalog -then built through Model Builder- then exported out as a Python Script).
• No custom alerts or in depth logging capabilities.
• Not user friendly for non-python users.
An easier
solution?
Our next idea…
FME!!
First Challenge
Python FME
This continues on
for a few more
pages…
How to rename and group 4 letter offense codes
Attribute Mapper
Transformer
• As an example: RKBK
Crime Type= Felony Crimes Against Person
Crime Name= Robbery
1 2
3
*double bonus:
I was able to
automatically import
codes by a CSV file,
AND…FME
automatically read in
all the source values
for me
The Rest of the Process
Into the GIS
Environment
FME → SDE → MXD → Web Map
Service → Online Application =
Reporting and Analysis
Wait- there’s
more…
Automating the
workbench and logging
too!
FME Server Set Up
Email notifications when Workbench
succeed or failed
Logging Capabilities (FME Jobs)
Main point: FME rocks
Comparison
Python
• Cumbersome/long script
• Not very easy to
customize alerts or logs
• Hard to edit- other users
might not know the
python language
• No friendly user interface
FME
• Easy to update codes and
field changes
• Other users can easily edit-
even w/minimal knowledge of
FME
• Share directly to FME Server
for logs and alerts
• Friendly user interface for
FME Server and easy to set up
Our Next Great
Idea…
Socrata Open Data
Portal
Open Data Portal: Crimes Dataset
Updated nightly with new data, thanks to
FME Desktop and FME Server
FME Workbench
API Download and
Export Tools
Custom Data Lenses
(aka Graphs and
Maps
Thank you!
Alice Pence
GIS/Database Specialist
City of Auburn, WA
apence@auburnwa.gov
Links to sites discussed in slides:
Socrata Open Data Portal: https://data.auburnwa.gov/
Auburn Crimes Map:
https://maps.auburnwa.gov/html5viewer/Index.html?
viewer=crimes
Q&AChat your questions into the GoToWebinar control panel
Robert Schultz
RCMP
Property Crime
Spatially Anonymized Property Crime
Property Crime – High Density
Property Crime – Low Density
Open Data in San Francisco
The new ETL job platform.
Background.
● City’s official open data portal is SF OpenData.
The Vision.
● Empower use of City data.
“Our vision is that the City’s data is understood, documented, and high quality.”
The Project: Increase number and timeliness of datasets
● Few workspace authors
● Safe place to try out workspaces
The Results.
● Improved ETL job platform, design and architecture
● 100-500 ETL jobs
● Simple ETL Jobs
● Use best-practices
View the original presentation
by Samuel Valdez and
Janine Heiser
(Presented at the FME World Tour 2016)
Open Data in NYC
GIS for public use.
Background.
● The Citywide Street Centerline Database (CSCL) is the authoritative source
for location data used by New York City's public safety agencies (NYPD,
FDNY) for 911 call-taking and dispatch.
The Vision.
Make CSCL data easily accessible for the day-to-day GIS
user.
View the original presentation
by Chris Rado
(Presented at the FME World Tour 2016)
The Results.
New version of CSCL created that is better suited for general GIS tasks
and made it available to all city agencies, to the public via
NYC Open Data and to Batman.
Q&AChat your questions into the GoToWebinar control panel
Thank you!
Live chat us at www.safe.com
Free Resources:
FME Desktop Training (June 7-8)
FME Server Training (June 14-15)
Blog post: Guide to Open Data:
Using it, Sharing it, and Creating a
Portal
Open Data Webinar
Open Data eBook

More Related Content

PDF
FME Powers CKAN Open Data Portal
PPTX
The Joys of APIs & FME - Integrating CAD documents to GIS from a Document Man...
PDF
Brewing the Ultimate Data Fusion
PDF
Rediscovering the Value of Apache Kafka® in Modern Data Architecture
PDF
Continuous Intelligence for Customer Service Using Kafka Event Streams | Simo...
PDF
Money Heist - A Stream Processing Original! | Meha Pandey and Shengze Yu, Net...
PPTX
Data Warehousing Patterns for Hadoop
PDF
How to Rebuild an End-to-End ML Pipeline with Databricks and Upwork with Than...
FME Powers CKAN Open Data Portal
The Joys of APIs & FME - Integrating CAD documents to GIS from a Document Man...
Brewing the Ultimate Data Fusion
Rediscovering the Value of Apache Kafka® in Modern Data Architecture
Continuous Intelligence for Customer Service Using Kafka Event Streams | Simo...
Money Heist - A Stream Processing Original! | Meha Pandey and Shengze Yu, Net...
Data Warehousing Patterns for Hadoop
How to Rebuild an End-to-End ML Pipeline with Databricks and Upwork with Than...

What's hot (20)

PDF
Time Series Analysis Using an Event Streaming Platform
PDF
The Lyft data platform: Now and in the future
PDF
Enterprise Metadata Integration
PDF
Spark at Airbnb
PDF
Hybrid Streaming Analytics for Apache Kafka Users | Firat Tekiner, Google
PDF
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
PDF
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
PDF
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
PDF
Field Notes from Expeditions in the Cloud-(Matt Wood, Amazon Web Services)
PDF
Event Broker (Kafka) in a Modern Data Architecture
PDF
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
PPTX
Spline: Data Lineage For Spark Structured Streaming
PPTX
The Stream is the Database - Revolutionizing Healthcare Data Architecture
PPTX
Flink Forward San Francisco 2018: Fabian Hueske & Timo Walther - "Why and how...
PDF
Shifting Data Science into High Gear
PDF
Cloud Experience: Data-driven Applications Made Simple and Fast
PDF
Empowering Real-Time Decision Making with Data Streaming
PDF
Lambda architecture for real time big data
PPTX
Analysis of data science software 2020
PDF
Multi Model Machine Learning by Maximo Gurmendez and Beth Logan
Time Series Analysis Using an Event Streaming Platform
The Lyft data platform: Now and in the future
Enterprise Metadata Integration
Spark at Airbnb
Hybrid Streaming Analytics for Apache Kafka Users | Firat Tekiner, Google
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Field Notes from Expeditions in the Cloud-(Matt Wood, Amazon Web Services)
Event Broker (Kafka) in a Modern Data Architecture
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Spline: Data Lineage For Spark Structured Streaming
The Stream is the Database - Revolutionizing Healthcare Data Architecture
Flink Forward San Francisco 2018: Fabian Hueske & Timo Walther - "Why and how...
Shifting Data Science into High Gear
Cloud Experience: Data-driven Applications Made Simple and Fast
Empowering Real-Time Decision Making with Data Streaming
Lambda architecture for real time big data
Analysis of data science software 2020
Multi Model Machine Learning by Maximo Gurmendez and Beth Logan
Ad

Viewers also liked (17)

ODP
GIS based Incident Reporting System for Police Department
PDF
FME Extensive Usage Inside the Mapping Production System of Natural Resources...
PPTX
The Ultimate Geospatial Data Validation Checklist
PPTX
How to Master Raster (Faster) - Tips and Examples
PPTX
How to Load Data into Google Maps Engine
PPTX
Data Validation Victories: Tips for Better Data Quality
PDF
Transport for London - GIS
PDF
Deep Dive into FME Desktop 2017
PDF
Deep Dive into FME Server 2017.0
PPTX
Application of gis in urban traffic air quality
PPTX
Geographic information system(gis)
PDF
GIS for Urban Planning and Design
PPTX
Traffic light control using atmega16 ppt
PDF
What Is GIS?
PPT
GIS presentation
PPTX
1. STUDY ON QUALITY ASSURANCE AND QUALITY CONTROL MANAGEMENT SYSTEM IN 20 STO...
PPTX
Intelligent Traffic Light control using Embedded Systems
GIS based Incident Reporting System for Police Department
FME Extensive Usage Inside the Mapping Production System of Natural Resources...
The Ultimate Geospatial Data Validation Checklist
How to Master Raster (Faster) - Tips and Examples
How to Load Data into Google Maps Engine
Data Validation Victories: Tips for Better Data Quality
Transport for London - GIS
Deep Dive into FME Desktop 2017
Deep Dive into FME Server 2017.0
Application of gis in urban traffic air quality
Geographic information system(gis)
GIS for Urban Planning and Design
Traffic light control using atmega16 ppt
What Is GIS?
GIS presentation
1. STUDY ON QUALITY ASSURANCE AND QUALITY CONTROL MANAGEMENT SYSTEM IN 20 STO...
Intelligent Traffic Light control using Embedded Systems
Ad

Similar to How to Manage Open Police Data - Tips for Data QA/QC and Automation (20)

PPTX
Automating Crime Data to Import into GIS
PPTX
Fundamentalsof Crime Mapping 6
PDF
Analytic Information Data Exchange
PDF
Open Data in Trinidad and Tobago : presentation to civil society
PDF
Data-Driven Public Safety: Reliable Data When Every Second Counts
PDF
Achieve Your State Missions with Better Data
PDF
Getting Open Data Used
PDF
MADWD - Opendata in crime and justice
PPT
NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
PPTX
Australian open data presentation v2.0
PPTX
The Real-time Police Force: Publishing Analytic Information to the Field with...
PDF
IRJET- Online Crime Reporting and Management System using Data Mining
KEY
Open Data - Past, Present and Future
PPTX
Open data presentation on tools and automation
PDF
IRJET - Reporting and Management System for Online Crime
PDF
Open Data Conference - Stuart Harrison - Practical examples of use of Open Data
PDF
Umhoefer: Data-driven enterprise - handout
PDF
Workshop: Open Data - What's the Point?
PDF
Stuart Harrison Open data - Under the hood
PPTX
Open data presentation 2014 v1.3 - Nov 2014
Automating Crime Data to Import into GIS
Fundamentalsof Crime Mapping 6
Analytic Information Data Exchange
Open Data in Trinidad and Tobago : presentation to civil society
Data-Driven Public Safety: Reliable Data When Every Second Counts
Achieve Your State Missions with Better Data
Getting Open Data Used
MADWD - Opendata in crime and justice
NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
Australian open data presentation v2.0
The Real-time Police Force: Publishing Analytic Information to the Field with...
IRJET- Online Crime Reporting and Management System using Data Mining
Open Data - Past, Present and Future
Open data presentation on tools and automation
IRJET - Reporting and Management System for Online Crime
Open Data Conference - Stuart Harrison - Practical examples of use of Open Data
Umhoefer: Data-driven enterprise - handout
Workshop: Open Data - What's the Point?
Stuart Harrison Open data - Under the hood
Open data presentation 2014 v1.3 - Nov 2014

More from Safe Software (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
PDF
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
PDF
Notification System for Construction Logistics Application
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
PDF
FME in Overdrive - Peak of Data & AI 2025
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
PDF
Pipeline Industry IoT - Real Time Data Monitoring
PDF
FME in Overdrive: Unleashing the Power of Parallel Processing
PDF
Fiber to the People! By Deutsche Telekom
PDF
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
PDF
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Getting Started with Data Integration: FME Form 101
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Notification System for Construction Logistics Application
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Transforming Utility Networks: Large-scale Data Migrations with FME
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
FME in Overdrive - Peak of Data & AI 2025
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Pipeline Industry IoT - Real Time Data Monitoring
FME in Overdrive: Unleashing the Power of Parallel Processing
Fiber to the People! By Deutsche Telekom
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
5 Things to Consider When Deploying AI in Your Enterprise

Recently uploaded (20)

PDF
Digital Strategies for Manufacturing Companies
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
medical staffing services at VALiNTRY
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
Designing Intelligence for the Shop Floor.pdf
PPTX
history of c programming in notes for students .pptx
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PPTX
ai tools demonstartion for schools and inter college
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PPTX
Introduction to Artificial Intelligence
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Digital Strategies for Manufacturing Companies
How to Migrate SBCGlobal Email to Yahoo Easily
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Reimagine Home Health with the Power of Agentic AI​
medical staffing services at VALiNTRY
Navsoft: AI-Powered Business Solutions & Custom Software Development
Internet Downloader Manager (IDM) Crack 6.42 Build 41
VVF-Customer-Presentation2025-Ver1.9.pptx
Designing Intelligence for the Shop Floor.pdf
history of c programming in notes for students .pptx
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
ai tools demonstartion for schools and inter college
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Introduction to Artificial Intelligence
Softaken Excel to vCard Converter Software.pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
How to Choose the Right IT Partner for Your Business in Malaysia
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...

How to Manage Open Police Data - Tips for Data QA/QC and Automation

  • 1. How to Manage Open Police Data – Tips for Data QA/QC and Automation Alice Pence, City of Auburn Robert Schultz, RCMP in BC Cat Gracey, Safe Software Dale Lutz, Safe Software
  • 2. Who are we? Connect Your Applications | Transform Your Data | Automate Your Workflows
  • 3. Police Data Initiative 53 U.S. jurisdictions. 41 million people. 90 datasets. And growing! www.publicsafetydataportal.org
  • 4. Why Should Police Data be Shared? ● Increase transparency and build trust ● Help provide improved public services ● Fuel more intelligent policing decisions The Ethics: ● Need to protect victim identity ● Only share when it’s lawful to do so ● Must be accurate, free of errors
  • 5. Dallas Police Department Experience (by the numbers) 12
  • 6. Dallas Police Department Experience (by the numbers) 45
  • 7. Dallas Police Department Experience (by the numbers) 67
  • 9. Automating Crime Data to Import into GIS Alice Pence GIS/Database Specialist City of Auburn, WA
  • 10. Quick Bio • BS in Geography from University of Idaho w/Certificate in GIS • In GIS field for 3+ years – been in Auburn, WA for over 1 year Alice Pence
  • 11. Our Crimes Database • Stored in a Proprietary Linux system • Raw data with only addresses • Can’t directly dump into GIS- needs to have data transformed for public/internal viewing, QA/QC, a spatial component to view on maps and applications, attributes renamed…
  • 13. Our Mission How to view all this data spatially for analysis and reporting? Our first idea…
  • 14. A Python Script The Good • It worked. Can add new offenses, change and update as needed. • Ran in windows scheduler to automate. The Bad • Long, cumbersome script. • Needed multiple databases (.gbd → SDE) to clean and move data. The Ugly • Can’t easily change connection to Crimes Database, no direct connection to source (connected through ArcCatalog -then built through Model Builder- then exported out as a Python Script). • No custom alerts or in depth logging capabilities. • Not user friendly for non-python users.
  • 16. FME!!
  • 17. First Challenge Python FME This continues on for a few more pages… How to rename and group 4 letter offense codes
  • 18. Attribute Mapper Transformer • As an example: RKBK Crime Type= Felony Crimes Against Person Crime Name= Robbery 1 2 3 *double bonus: I was able to automatically import codes by a CSV file, AND…FME automatically read in all the source values for me
  • 19. The Rest of the Process
  • 20. Into the GIS Environment FME → SDE → MXD → Web Map Service → Online Application = Reporting and Analysis
  • 22. FME Server Set Up Email notifications when Workbench succeed or failed
  • 25. Comparison Python • Cumbersome/long script • Not very easy to customize alerts or logs • Hard to edit- other users might not know the python language • No friendly user interface FME • Easy to update codes and field changes • Other users can easily edit- even w/minimal knowledge of FME • Share directly to FME Server for logs and alerts • Friendly user interface for FME Server and easy to set up
  • 26. Our Next Great Idea… Socrata Open Data Portal
  • 27. Open Data Portal: Crimes Dataset Updated nightly with new data, thanks to FME Desktop and FME Server
  • 29. API Download and Export Tools Custom Data Lenses (aka Graphs and Maps
  • 30. Thank you! Alice Pence GIS/Database Specialist City of Auburn, WA apence@auburnwa.gov Links to sites discussed in slides: Socrata Open Data Portal: https://data.auburnwa.gov/ Auburn Crimes Map: https://maps.auburnwa.gov/html5viewer/Index.html? viewer=crimes
  • 31. Q&AChat your questions into the GoToWebinar control panel
  • 35. Property Crime – High Density
  • 36. Property Crime – Low Density
  • 37. Open Data in San Francisco The new ETL job platform.
  • 38. Background. ● City’s official open data portal is SF OpenData. The Vision. ● Empower use of City data. “Our vision is that the City’s data is understood, documented, and high quality.”
  • 39. The Project: Increase number and timeliness of datasets ● Few workspace authors ● Safe place to try out workspaces The Results. ● Improved ETL job platform, design and architecture ● 100-500 ETL jobs ● Simple ETL Jobs ● Use best-practices View the original presentation by Samuel Valdez and Janine Heiser (Presented at the FME World Tour 2016)
  • 40. Open Data in NYC GIS for public use.
  • 41. Background. ● The Citywide Street Centerline Database (CSCL) is the authoritative source for location data used by New York City's public safety agencies (NYPD, FDNY) for 911 call-taking and dispatch. The Vision. Make CSCL data easily accessible for the day-to-day GIS user. View the original presentation by Chris Rado (Presented at the FME World Tour 2016)
  • 42. The Results. New version of CSCL created that is better suited for general GIS tasks and made it available to all city agencies, to the public via NYC Open Data and to Batman.
  • 43. Q&AChat your questions into the GoToWebinar control panel
  • 44. Thank you! Live chat us at www.safe.com Free Resources: FME Desktop Training (June 7-8) FME Server Training (June 14-15) Blog post: Guide to Open Data: Using it, Sharing it, and Creating a Portal Open Data Webinar Open Data eBook