SlideShare a Scribd company logo
Reaktor
Mannerheimintie 2
00100, Helsinki Finland
tel: +358 9 4152 0200
www.reaktor.com
info@reaktor.com
Confidential
©2015 Reaktor
All rights reserved
garbage in, garbage outData quality in a TMS world
Simo Ahava
Senior Data Advocate
Simo Ahava
Senior Data Advocate, Reaktor
Google Developer Expert, Google Analytics
Blogger, developer, www.simoahava.com
Twitter-er, @SimoAhava
Google+:er, +SimoAhava
Data quality isn’t fixed.
Depending on the
hypothesis, a single data
set can shift from
useless to incredibly
insightful without a
single datum changing
shape, size, form, or
function.
#1 Data is subjective
Plug-and-play Analytics
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Plug-and-play Analytics
Data quality isn’t acquired — it’s earned.
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
from online-behavior.com
from online-behavior.com
Claim 1:
Data quality is destroyed
by laziness and lack of
ambition.
Claim 2:
A TMS empowers
developers more than
others.
The root of all evil
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
The root of all evil
The "project"
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Your organization is
creating absurd
amounts of data with
every passing second,
and it’s very difficult to
adapt to the fluctuations
without an agile,
process-driven mindset.
#2 Data is a process
The project is often a
series of handovers,
breeding non-
involvement.
Specification Implementation Analysis Results
Specification Implementation Analysis Results
Business

owner
Specification Implementation Analysis Results
Business

owner
Marketing
Specification Implementation Analysis Results
Business

owner
Marketing
Developer
This leads necessarily to
silos, which have entry
and exit conditions.
Implementation Analysis Results
Business

owner
Marketing
Developer
Specification
Implementation Analysis Results
Business

owner
Marketing
Developer
Specification
Implementation Analysis Results
Business

owner
Marketing
Developer
Specification
Implementation Analysis Results
Business

owner
Marketing
Developer
Specification
Implementation Analysis Results
Business

owner
Marketing
Developer
Specification
Implementation Analysis Results
Business

owner
Marketing
Developer
Specification
Silos, so what?
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Silos, so what?
As long as the work gets done, right?
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Data is the lifeblood of the
organization. It flows
through all departments,
across job titles,
permeating the very
fabric of the organization,
reinforcing its
foundations for growth.
#3 Data abhors silos
Do these sound familiar:
Monthly reports which lack relevance, are rife with generic suggestions that lack research
in the context of your business, reiteration of previous month’s points, even if there are
solid reasons why they weren’t addressed.
Ridiculously ugly and ineffective JavaScript hacks for measurement points which should
be tackled in the Data Layer.
Hiding behind data, and passing blame to other silos.
Could someone fix the
Bounce Rate metric on our
site?
Analytics feature requests are deprioritized, and deployed extremely infrequently.
Fix
transactionRevenue
to show revenue, not
customer weight.
Communication is difficult due to the overhead of meeting face-to-face, project plans are
set in stone during sales, and it’s difficult to change existing project goals or set new ones
due to consultants being hired as "extra pairs of hands" rather than advisors.
These are symptoms of
data being treated as a
project outcome.
Cure I: The Data Layer
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Cure I: The Data Layer
Using technology to solve communication problems
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Typically, there are three
definitions
of Data Layer that we use in
the digital world.
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World
1. Set of business
requirements

for tracking
digital assets,

visits, and
visitors.
1. Set of business
requirements

for tracking
digital assets,

visits, and
visitors.
2. Encoded, global
data structure,
accessed and
modified by
connected
platforms.
1. Set of business
requirements

for tracking
digital assets,

visits, and
visitors.
2. Encoded, global
data structure,
accessed and
modified by
connected
platforms.
2. Data model of a
connected
platform, which
copies or digests
information in the
global structure.
1. Set of business
requirements

for tracking
digital assets,

visits, and
visitors.
2. Encoded, global
data structure,
accessed and
modified by
connected
platforms.
2. Data model of a
connected
platform, which
copies or digests
information in the
global structure.
dataLayer.push({
'pageType' : 'home'
});
google_tag_manager['GTM-123']

.dataLayer
.set('pageType', 'home');
Across all three definitions,
the purpose of a Data Layer
is simple:
DMP / DWH / TMS / etc.
X X
Actions Presentation
Data Layer
DMP / DWH / TMS / etc.
X X
Actions Presentation
Data Layer
The purpose of a Data Layer is to provide a bilateral layer on the digital
asset, which decouples, normalises, and uniformly encodes semantic
information passed through and stored within.
The Data Layer is a joint
venture, where people and
systems communicate
across silos.
DMP / DWH / TMS / etc.
let tracker = GANTracker.sharedTracker()
tracker.trackEvent("revenue", action:"Q1",
value:"15678000")
tracker.trackEvent("revenue", action:"Q2",
value:"16888000")
tracker.trackEvent("revenue", action:"Q3",
value:"15991000")
tracker.trackEvent("revenue", action:"Q4",
value:"19133000")
rq12014,rq22014,rq32014,rq42015

15677998,16887988,15990988,19133400
analytics.collect({
'revenueQ1' : '15677998.00',
'revenueQ2' : '16887988.00',
'revenueQ3' : '15990988.00',
'revenueQ4' : '19133400.00'
})
DMP / DWH / TMS / etc.
let dataLayer = new Array()
dataLayer.push({
"revenue_Q1_2014" : "15677998.00",
"revenue_Q2_2014" : "16887988.00",
"revenue_Q3_2014" : "15990988.00",
"revenue_Q4_2014" : "19133400.00"
})
Cure II: The Process
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Cure II: The Process
Involve, involve, involve
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
An iterative, agile
process is necessary for
optimal utilization of a
TMS.
Definition of Done
Definition of Done
Developed features
do not impede
measurement.
Developed features
are trackable.
Sprint
Definition of Done
Developed features
do not impede
measurement.
Developed features
are trackable.
Sprint
If necessary, feature
is encoded with
tracking attributes.
If necessary, feature
is linked to a Data
Layer object.
Feature
Definition of Done
Developed features
do not impede
measurement.
Developed features
are trackable.
Sprint
If necessary, feature
is encoded with
tracking attributes.
If necessary, feature
is linked to a Data
Layer object.
Feature
Attribute syntax is
correct for tracking.
Data Layer object
syntax is correct.
Task
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World
Constant participation
Constant participation
Transparency
Cure III: Empowerment
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Cure III: Empowerment
We are all hybrid beings
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
The entire life cycle of a
single data point, from
collection to reports,
requires knowledge and
expertise to manage.
#4 Data is difficult
Developer facilitation is
crucial to data quality
and optimized data
collection.
1: Education
1. JavaScript: www.codecademy.com, www.codeschool.com,
Professional JavaScript for Web Developers, DOM
Enlightenment…
2. Digital analytics: www.kaushik.net, www.simoahava.com,
Successful Analytics, Practical Google Analytics and Google Tag
Manager for Developers…
3. Training, courses, certifications: Digital Analytics Association,
Digital Analytics Fundamentals (Google), Market Motive…
4. Conferences: MeasureCamp, SMX, eMetrics, Digital Analytics
Hub, ConversionXL, Superweek, All Things Data…
treat content as a product2: hybrid skills
"Business owner"

- No operational skills

+ Strategic
"Developer"

- Uncooperative

+ Methodical
"Marketer"

- Bully

+ Consultative
+ Passionate, actively interested

+ Understands ever-changing requirements

+ Good grasp of digital tech

+ Statistical mindset

+ Knows the product / service inside and out

+ Critical about the present, curious about the future
treat content as a product3: Passionate interest
+ Dedicated sandbox


+ Website or blog to test
new ideas on


+ Test and debug setups in
Google Analytics and
Google Tag Manager


+ Utilization of GTM
environments
Hire to educate, not to delegate
PO
Developer Analyst
Hire to educate, not to delegate
PO
Developer Analyst
Data is difficult
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Data is difficult
Data quality is earned, not acquired
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
Thank you!
simo.ahava@reaktor.com
www.simoahava.com
Twitter: @SimoAhava
Google+: +SimoAhava
Data is difficult - http://goo.gl/53aFUU
The Schema Conspiracy - http://goo.gl/o2Pwys
Further reading:
10 Truths About Data - http://goo.gl/EpesEj

More Related Content

PDF
You can't spell MEASURE without CUSTOMIZATION
PDF
Essential Search Marketing Tweaks For Google Analytics And Google Tag Manager
PDF
Google Tag Manager - 5 years. What have we learned?
PDF
Meaningful Data - Best Internet Conference 2015 (Lithuania)
PDF
Simo's Top 30 GTM tips
PPTX
Rationalizing Tag Management
PDF
Meaningful Data - Reaktor Breakpoint 2015
PDF
Agile Analytics
You can't spell MEASURE without CUSTOMIZATION
Essential Search Marketing Tweaks For Google Analytics And Google Tag Manager
Google Tag Manager - 5 years. What have we learned?
Meaningful Data - Best Internet Conference 2015 (Lithuania)
Simo's Top 30 GTM tips
Rationalizing Tag Management
Meaningful Data - Reaktor Breakpoint 2015
Agile Analytics

What's hot (20)

PDF
Browser Tracking Protections - SuperWeek 2020
PPTX
Google Tag Manager - Introduction & Implementation
PPTX
Tag Management Solutions - Best Data Ever (Marketing Festival 2014)
PDF
Data Layer - MeasureCamp VII 2015
PDF
Content Analytics - The Whys And Hows For Google Analytics
PPTX
Advanced Form Tracking in Google Tag Manager
PDF
Ambitious Analytics: Google Analytics Customisation
PDF
29 Advanced Google Tag Manager Tips Every Marketer Should Know
PPTX
Google Tag Manager Can Do What
PDF
Refael Bitton @ All Things DATA 2016 - Firebase Analytics
PDF
Skill session Google Analytics @WeWork Amsterdam (Mike van Hoenselaar)
PDF
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
PPTX
Google tag manager fundamentals question and answer (june 23 and july 24, 2015)
PPTX
Google Tag Manager for beginners
PDF
UK Top 5,000 Websites; Mobile Site Speed Benchmark - BrightonSEO
PDF
TechSEO Boost 2021 - SEO Experimentation
PPTX
How to build a scalable content production system.
PDF
Cut the Crap: Next Level Content Audits with Crawlers - Sam Marsden, SEO & Co...
PDF
BrightonSEO April 2018 Mobile-First & Crawl Budget
PDF
Speed & Performance Optimisation: How to Meet Users' High Expectations - Rach...
Browser Tracking Protections - SuperWeek 2020
Google Tag Manager - Introduction & Implementation
Tag Management Solutions - Best Data Ever (Marketing Festival 2014)
Data Layer - MeasureCamp VII 2015
Content Analytics - The Whys And Hows For Google Analytics
Advanced Form Tracking in Google Tag Manager
Ambitious Analytics: Google Analytics Customisation
29 Advanced Google Tag Manager Tips Every Marketer Should Know
Google Tag Manager Can Do What
Refael Bitton @ All Things DATA 2016 - Firebase Analytics
Skill session Google Analytics @WeWork Amsterdam (Mike van Hoenselaar)
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google tag manager fundamentals question and answer (june 23 and july 24, 2015)
Google Tag Manager for beginners
UK Top 5,000 Websites; Mobile Site Speed Benchmark - BrightonSEO
TechSEO Boost 2021 - SEO Experimentation
How to build a scalable content production system.
Cut the Crap: Next Level Content Audits with Crawlers - Sam Marsden, SEO & Co...
BrightonSEO April 2018 Mobile-First & Crawl Budget
Speed & Performance Optimisation: How to Meet Users' High Expectations - Rach...
Ad

Viewers also liked (20)

PDF
MeasureCamp IX (London) - 10 JavaScript Concepts for web analysts
PDF
Search Marketer's Toolkit for Google Tag Manager and Google Analytics
PPTX
Be Critical: Going Beyond The Defaults With GA And GTM (SMX Munich 2015)
PPTX
What's the weather like? MeasureFest 2014
PPTX
Content Engagement with Google Analytics (Emerce Conversion 2015)
PPTX
Google Tag Manager For Nerds
PDF
Key Insights From Funnels - Enhanced Ecommerce For Google Analytics
PDF
Tricks and tweaks for Google Analytics and Google Tag Manager
PPTX
Google Analytics Bag O' Tricks
PPTX
Enhanced Ecommerce For Content (SMX München 2015)
KEY
Exploring Big Data value for your business
PPTX
Value proposition of open government data
PDF
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...
PDF
Turning Data Into Value
PDF
Value Creation for SMBs with Big Data
PPTX
Food waste collection in the Netherlands
PPTX
Business Aspects of the IoT: Making Products Smart
PPTX
Emerging Business Models for the Open Data Industry and Open Data Value Capab...
PDF
Industrial Data Space Key Facts
PPTX
DMP - Multi-screen mobile ecommerce
MeasureCamp IX (London) - 10 JavaScript Concepts for web analysts
Search Marketer's Toolkit for Google Tag Manager and Google Analytics
Be Critical: Going Beyond The Defaults With GA And GTM (SMX Munich 2015)
What's the weather like? MeasureFest 2014
Content Engagement with Google Analytics (Emerce Conversion 2015)
Google Tag Manager For Nerds
Key Insights From Funnels - Enhanced Ecommerce For Google Analytics
Tricks and tweaks for Google Analytics and Google Tag Manager
Google Analytics Bag O' Tricks
Enhanced Ecommerce For Content (SMX München 2015)
Exploring Big Data value for your business
Value proposition of open government data
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...
Turning Data Into Value
Value Creation for SMBs with Big Data
Food waste collection in the Netherlands
Business Aspects of the IoT: Making Products Smart
Emerging Business Models for the Open Data Industry and Open Data Value Capab...
Industrial Data Space Key Facts
DMP - Multi-screen mobile ecommerce
Ad

Similar to SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World (20)

PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
PDF
The Data Unicorns
PDF
What makes an effective data team?
PDF
Harness the power of data
PDF
Architecting a Platform for Enterprise Use - Strata London 2018
PDF
apidays LIVE Jakarta - Data Reporting and Analytics with API's by Tarush Agga...
PDF
Success Through an Actionable Data Science Stack
PDF
Big Data LA 2016: Backstage to a Data Driven Culture
PDF
white-paper-its-data-qualitys-world-en-na-f01
PDF
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
PDF
How to succeed at data without even trying!
PDF
Module 2 Data Collection and Management.pdf
PDF
Don't Let Your Data Get SMACked: Introducing 3-D Data Management
PPTX
Finance and Accounting BPM
PPTX
Usama Fayyad talk in South Africa: From BigData to Data Science
PDF
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
PDF
Data Quality Success Stories
PDF
Data Strategy - Enabling the Data-Guided Enterprise
PDF
The Rise of Big Data and the Chief Data Officer (CDO)
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
The Data Unicorns
What makes an effective data team?
Harness the power of data
Architecting a Platform for Enterprise Use - Strata London 2018
apidays LIVE Jakarta - Data Reporting and Analytics with API's by Tarush Agga...
Success Through an Actionable Data Science Stack
Big Data LA 2016: Backstage to a Data Driven Culture
white-paper-its-data-qualitys-world-en-na-f01
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
How to succeed at data without even trying!
Module 2 Data Collection and Management.pdf
Don't Let Your Data Get SMACked: Introducing 3-D Data Management
Finance and Accounting BPM
Usama Fayyad talk in South Africa: From BigData to Data Science
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Data Quality Success Stories
Data Strategy - Enabling the Data-Guided Enterprise
The Rise of Big Data and the Chief Data Officer (CDO)

Recently uploaded (20)

PPTX
Slides PPTX World Game (s) Eco Economic Epochs.pptx
PDF
Automated vs Manual WooCommerce to Shopify Migration_ Pros & Cons.pdf
PPTX
SAP Ariba Sourcing PPT for learning material
PPTX
E -tech empowerment technologies PowerPoint
PPTX
Introuction about ICD -10 and ICD-11 PPT.pptx
PPT
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
PPTX
international classification of diseases ICD-10 review PPT.pptx
PPT
Design_with_Watersergyerge45hrbgre4top (1).ppt
PDF
Paper PDF World Game (s) Great Redesign.pdf
PDF
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
PPTX
Introuction about WHO-FIC in ICD-10.pptx
DOCX
Unit-3 cyber security network security of internet system
PPTX
Digital Literacy And Online Safety on internet
PDF
RPKI Status Update, presented by Makito Lay at IDNOG 10
PDF
Slides PDF The World Game (s) Eco Economic Epochs.pdf
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
PPTX
INTERNET------BASICS-------UPDATED PPT PRESENTATION
PDF
An introduction to the IFRS (ISSB) Stndards.pdf
PPTX
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
PDF
Sims 4 Historia para lo sims 4 para jugar
Slides PPTX World Game (s) Eco Economic Epochs.pptx
Automated vs Manual WooCommerce to Shopify Migration_ Pros & Cons.pdf
SAP Ariba Sourcing PPT for learning material
E -tech empowerment technologies PowerPoint
Introuction about ICD -10 and ICD-11 PPT.pptx
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
international classification of diseases ICD-10 review PPT.pptx
Design_with_Watersergyerge45hrbgre4top (1).ppt
Paper PDF World Game (s) Great Redesign.pdf
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
Introuction about WHO-FIC in ICD-10.pptx
Unit-3 cyber security network security of internet system
Digital Literacy And Online Safety on internet
RPKI Status Update, presented by Makito Lay at IDNOG 10
Slides PDF The World Game (s) Eco Economic Epochs.pdf
Unit-1 introduction to cyber security discuss about how to secure a system
INTERNET------BASICS-------UPDATED PPT PRESENTATION
An introduction to the IFRS (ISSB) Stndards.pdf
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
Sims 4 Historia para lo sims 4 para jugar

SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World