SlideShare a Scribd company logo
@MicahHerstand
INNOVATION IN MARKETING:
DATA SCIENCE & ENGINEERING
@MicahHerstand
Software Engineer, User Advocate,
Writer, Actor, Singer-Songwriter
@MICAHHERSTAND
ˈmikə
@MicahHerstand
“Marketing has become a
technology-powered discipline,
and therefore, 

marketing organizations must 

infuse technical capabilities 

into their DNA.”
~Scott Brinker, MarTech Conference Program Chair
@MicahHerstand
LESSON OBJECTIVES: Theory
Discover how data science enables marketing innovation
Measure, Metric, CSF, KPI
Customer segmentation
Big Data, Open Data, Linked Data
Growth Hacking
Ensure your org’s data engineering empowers marketers
Database
SQL (Relational), NoSQL (Document, Graph)
Data warehouse
@MicahHerstand
LESSON OBJECTIVES: Setup
Install a database manager
Sequel Pro for Mac
MySQL Workbench for Windows & Linux
Connect to your org’s database
Standard, SSH, SSL
Bookmark SQL helpers
SQLZoo (run SQL online!), Tutorials Point, Khan Academy
Google queries: “site:docs.oracle.com UNKNOWN TERM”
@MicahHerstand
LESSON OBJECTIVES: Practice SQL
Create a mental model for what it’s like to query SQL using English first
Acquire the vocabulary to understand a SQL query
Encounter example SQL queries and see their results
Practice your knowledge through exercises
@MicahHerstand
DATA SCIENCE
Measure, Metric, CSF, KPI
Customer segmentation
Big Data, Open Data, Linked Data
Growth Hacking
@MicahHerstand
DATA SCIENCE
@MicahHerstand
DATA SCIENCE: Measure, Metric, CSF, KPI
It’s the metrics, stupid!
“The price of light is less than the cost of darkness.”
~Arthur C. Nielsen, namesake of Nielsen TV ratings
“What gets measured, gets managed.”
“There is nothing so useless as doing efficiently that which should not be
done at all.”
"Management is doing things right; leadership is doing the right things."
~Peter Drucker, the founder of modern management
@MicahHerstand
DATA SCIENCE: Measure
Definition: Anything that can be measured
Caveat: Must be a single variable measure
E.g. № of users
E.g. № of active users
Challenges:
Definition of terms
E.g. Does account creation make someone a customer?
Measurement process
E.g. How frequently should data be collected?
@MicahHerstand
DATA SCIENCE: Metric
Definition: Value derived from 2+ measures
Metric selection: Efficiency vs effectiveness
E.g. cost of customer acquisition vs customer lifetime value
Analysis: Information vs insights
E.g. customer value vs value of customers acquired through LinkedIn
Optimization: Source vs campaign
E.g. customers w/ expired CC vs customer bounce rate when CC expired
Caution: Vanity, engagement, and benchmark metrics
E.g. Facebook Likes, Time on Page, DVD sales
@MicahHerstand
DATA SCIENCE: CSF (Critical Success Factor)
Definition: What is required to achieve business objectives.
E.g. acquire new customers
Prerequisites: Business objectives
E.g. to obtain 10% market share (BO), must acquire new customers (CSF)
More on CSFs: bit.ly/sidata-csf
@MicahHerstand
DATA SCIENCE: KPI (Key Performance Indicator)
Definition: A measurable value that demonstrates how effectively a
company is achieving key business objectives
E.g. cost per lead, customer lifetime value, traffic-to-lead ratio, retweets of
last ten tweets, landing page conversion rates
Prerequisites: Critical Success Factors
E.g. to acquire new customers (CSF), track those acquired per week (KPI)
Requisites: SMART (Specific, Measurable, Achievable, Relevant, Time)
E.g. weekly rate of customer acquisition
Caution: Perverse incentives and unintended consequences
E.g. referral programs to increase customer acquisition
More on KPIs: bit.ly/sidata-kpi
@MicahHerstand
DATA SCIENCE: CSFs vs KPIs
Graphic origin: bit.ly/sidata-kpi-vs-csf
@MicahHerstand
DATA SCIENCE: Prioritization
“Never confuse motion with action.” ~Benjamin Franklin
Graphic Origin: bit.ly/sidata-metrics-graphic
@MicahHerstand
DATA SCIENCE
Measure, Metric, CSF, KPI
Customer segmentation
Big Data, Open Data, Linked Data
Growth Hacking
@MicahHerstand
DATA SCIENCE: Customer Segmentation
@MicahHerstand
DATA SCIENCE: Customer Segmentation, 2.0
@MicahHerstand
DATA SCIENCE: Customer Segmentation, 3.0
@MicahHerstand
DATA SCIENCE: Customer Segmentation
Definition: the practice of dividing a customer base into groups of individuals
that are similar in specific ways relevant to marketing
E.g. SI grads, New Yorkers, users who have yet to purchase
Utility: One size does not fit all. Allows for novel KPIs.
Prerequisites: Business Objectives, Metrics
E.g. Want to gain 10% salon market (Biz Objective), while 25% of total
customers are men (metric), target men as it’s an under-saturated market
Types: A priori, Needs-based, and Value-based
Caution: Don’t break the law by targeting protected classes
E.g. AirBnb cannot offer Iranian-Americans discounts for Nowruz
More on KPIs: bit.ly/sidata-kpi
@MicahHerstand
DATA SCIENCE
Measure, Metric, CSF, KPI
Customer segmentation
Big Data, Open Data, Linked Data
Growth Hacking
@MicahHerstand
DATA SCIENCE: Big Data, Open Data, Linked Data
@MicahHerstand
DATA SCIENCE: Big Data, Open Data, Linked Data
"Big Data will spell the death of customer segmentation and force the
marketer to understand each customer as an individual.”
~Ginni Rometty, CEO, IBM
"Google only gives you answers for questions people have asked before.”
“A mark of a good site is realizing you're not the only site in the world.”
~Tim Berners-Lee, inventor of the World Wide Web
@MicahHerstand
DATA SCIENCE: Big Data
@MicahHerstand
DATA SCIENCE: Big Data
Definition: Data sets that are so large or complex that traditional data
processing applications are inadequate to deal with them.
Technical Challenges:
Volume (amount of data)
Velocity (speed of data in and out)
Variety (range of data types and sources)
Human Challenges:
No magic bullets, easy to overstate current capabilities
Novel Opportunities:
Real-time pricing, Sentiment analysis, Optimized offers
@MicahHerstand Designed by Forrester Research, accessed at bit.ly/sidata-bigdata
@MicahHerstand
Designed by Forrester Research, accessed at bit.ly/sidata-bigdata
@MicahHerstand
Designed by Forrester Research, accessed at bit.ly/sidata-bigdata
@MicahHerstand
DATA SCIENCE: Open Data
Definition: Data should be freely available to everyone to use and republish
as they wish, without restrictions from copyright, patents or other
mechanisms of control. “Free as in speech, not beer.”
E.g. data.gov, census.gov
Alternate Definition: Public or private data stores available for integration
into one’s own data system.
E.g. developer.nytimes.com, Thomson Reuters
Challenges:
Low cost, high quality, and large quantity—pick two
Data normalization (e.g. gender and sex, China bowls vs China country)
@MicahHerstand
DATA SCIENCE: Linked Data
Definition: A method of publishing structured data so that it can be
interlinked and become more useful through semantic queries.
E.g. Facebook’s Open Graph, Google Rich Snippets, Twitter Cards
Novelty: Data sources share schema so no middleware necessary
Challenges:
Comparatively few data sources
Data analysis tools less mature
Fewer trained developers
"Marketing department might want to dominate the Linked Data web.”
~Ralph Swick, COO of the W3C, organization responsible for World Wide Web standards
@MicahHerstand
DATA SCIENCE: Linked Data
"When companies post
data as Linked Data
they can be held
accountable. Regex
has [fuzzy]
responsibility.”
~Ralph Swick, COO of the W3C,
organization responsible for
World Wide Web’s technology
standards
Accessed March 8th, 2017
@MicahHerstand
DATA SCIENCE
Measure, Metric, CSF, KPI
Customer segmentation
Big Data, Open Data, Linked Data
Growth Hacking
@MicahHerstand
DATA SCIENCE: Growth Hacking
Graphic origin: bit.ly/sidata-gh-cartoon-2
@MicahHerstand
DATA SCIENCE: Growth Hacking
Graphic origin: bit.ly/sidata-gh-cartoon-3
@MicahHerstand
DATA SCIENCE: Growth Hacking
“Growth hackers are a hybrid of marketer and coder.”
“[Growth hacking] requires a blurring of lines between marketing, product,
and engineering, so that they work together to make the product market
itself.”
~Andrew Chen, Head of Rider Growth at Uber
“The true unicorns are those who can go end-to-end designing, building,
measuring, analyzing, and iterating with a combination of user intuition and
deep analytics.”
~Matt Humphrey, Sold his startup HomeRun for $100M+ after 18 months
@MicahHerstand
DATA SCIENCE: Growth Hacking
Definition: A process of rapid experimentation across marketing
channels and product development to identify the most effective, efficient
ways to grow a business.
E.g. Airbnb cross-listing on Craigslist
Novelty: Interdisciplinary skills and knowledge
Prerequisites: Interdisciplinary teams, acceptance of failure, outside-the-box
thinking
Requisites: Measurable, metric-based
@MicahHerstand
DATA SCIENCE: Growth Hacking Example
@MicahHerstand
DATA SCIENCE: Growth Hacking Example
@MicahHerstand
DATA ENGINEERING
Database
SQL (Relational)
NoSQL (Document, Graph)
Data warehouse
@MicahHerstand
DATA ENGINEERING: Database
@MicahHerstand
DATA ENGINEERING: Database
Definition: A collection of structured data, organized for rapid search by
an automated computer program.
Novelty: List or calculate data from various sources
E.g. How much revenue has been made by sales from customers whose
first visit was referred by a Facebook ad?
E.g. How many customers (who have made at least $100 in purchases
total) have used our referral program?
@MicahHerstand
DATA ENGINEERING: Database
Structured data
Primary
key
@MicahHerstand
DATA ENGINEERING: Levels of structure
Graphic Origin: http://5stardata.info/
@MicahHerstand
DATA ENGINEERING: Database Keys
@MicahHerstand
DATA ENGINEERING: Database Security
@MicahHerstand
DATA ENGINEERING
Database
SQL (Relational)
NoSQL (Document, Graph)
Data warehouse
@MicahHerstand
DATA ENGINEERING: Relational Database
Definition: A type of database that organizes data into tables (think
spreadsheet) and creates clearly defined relationships between those tables.
E.g. SQL (MySQL, PostgreSQL, SQLite, Oracle Database, MS SQL)
SQL is a programming language that lets people setup relational
database as well as add, update, delete, and lookup data within them.
Novelty: Up-front schema, data integrity checks, transactions.
E.g. ensure a movie cannot be added without an associated director
Challenges: Large datasets and an evolving schema are difficult to manage.
E.g. you want to track customers’ age, then decide not to, then decide to
track gender as a binary, then decide to make gender a free-text option…
bit.ly/sidata-sql-vs-nosql
@MicahHerstand
DATA ENGINEERING: Relational Database
Foreign
key
Movies
@MicahHerstand
DATA ENGINEERING: Relational Database
Movies
Directors
@MicahHerstand
DATA ENGINEERING
Database
SQL (Relational)
NoSQL (Document, Graph)
Data warehouse
@MicahHerstand
DATA ENGINEERING: NoSQL Databases
Definition: A database that is not a relational database. (NoSQL is colloquial
jargon, not a standard)
E.g. MongoDB, Redis, Couchbase, neo4j
Novelty: No schema required to store data. Easily scalable. Super fast
lookups.
E.g. easy to track customers’ age, then decide not to, then decide to track
gender as a binary, then decide to make gender a free-text option…
Challenges: Data integrity, stable transactions.
E.g. cannot ensure a director is always included when adding a movie
bit.ly/sidata-sql-vs-nosql
@MicahHerstand
DATA ENGINEERING
Database
SQL (Relational)
NoSQL (Document, Graph)
Data warehouse
@MicahHerstand
DATA ENGINEERING: Data warehouse
@MicahHerstand
DATA ENGINEERING: Data warehouse
Definition: a computer system optimized for analytical and informational
processing that is filled with data copied from both inside and outside the
enterprise
E.g. a database with both a sales table and a google analytics table and a
census table.
Novelty: analyze business data without affecting day-to-day operations
E.g. you want to see employee clock-in times without preventing them
from simultaneously clocking out.
Challenges: large overhead and maintenance costs without being necessary
@MicahHerstand
DATABASE SETUP
Database manager application
Database Connections
SQL Helpers to Bookmark
@MicahHerstand
DATABASE SETUP: DB Manager Application
Definition: A graphical user interface that simplifies database
interactions for developers
Examples:
Sequel Pro for Mac: bit.ly/sidata-mac
MySQL Workbench for Windows & Linux: bit.ly/sidata-not-mac
PHPMyAdmin for web access
@MicahHerstand
DATABASE SETUP
Database manager application
Database Connections
SQL Helpers to Bookmark
@MicahHerstand
DATABASE SETUP: Database connections
Unsecured Connections are often called “standard” and require no setup
besides the application you just downloaded
Secured Connections can use SSH or SSL and require additional
encryption technology to be installed on your computer.
Your company should have documentation on how to use these.
@MicahHerstand
DATABASE SETUP: DB Connection Info
Server: www.herstand.com
User: sistudents
Password: Hf68S9CpK67RUDV3
Database: simovies
Port: 3306 (default MySQL port)
@MicahHerstand
DATABASE SETUP
Database manager application
Database Connections
SQL Helpers to Bookmark
@MicahHerstand
DATABASE SETUP: SQL Helpers to Bookmark
Learn: TutorialsPoint.com, KhanAcademy.com
Play: SQLZoo.net (run SQL online!)
Cheatsheet: bit.ly/sidata-sql-cheat-sheet
Cheatsheet with examples: bit.ly/sidata-cheat-with-examples
RTFM: bit.ly/sidata-mysql-rtfm
@MicahHerstand
PRACTICE SQL
English queries
Vocabulary
Stock SQL queries
Exercises
@MicahHerstand
PRACTICE SQL: English queries
Questions SQL can answer: Who, What, Which, Where, When, How Many
E.g. Who directed the film Get Out?
E.g. Who acted in the film Get Out?
E.g. What films were released before Jan 1, 2000?
E.g. Where did the director of Get Out go to college?
E.g. Which colleges had the most graduates direct films since Jan 1, 2000.
E.g. When was Get Out released?
E.g. How many actors were in both Get Out and The West Wing?
@MicahHerstand
PRACTICE SQL
English queries
Vocabulary
Stock SQL queries
Exercises
@MicahHerstand
PRACTICE SQL: Vocabulary
Syntax
,
.
;
( )
“ ”
*
Verbs
SELECT
INSERT
UPDATE
DELETE
Query Parts
AS
FROM
WHERE
HAVING
ORDER BY
GROUP BY
Filters
LIKE
NOT
>
<
=
!=
>=
<=
AND
OR
IN
%
Sort
ASC
DESC
Aggregate
Functions
MIN,
MAX,
SUM,
AVG,
COUNT
Advanced
Functions
INNER JOIN
OUTER JOIN
REGEXP
@MicahHerstand
PRACTICE SQL
English queries
Vocabulary
Stock SQL queries
Exercises
@MicahHerstand
PRACTICE SQL: Anatomy of a Query
SELECT * FROM movies;
Result
@MicahHerstand
PRACTICE SQL: Anatomy of a Query
SELECT FROM movies WHERE ;
title AND release_date
title
COUNT(title) AS num_of_titles
title AND MIN(release_date)
title = “%Star Wars%”
release_date > ‘2000-1-1’
release_date > ‘2000-1-1’ AND
title = “%Star Wars%”
title = “Get Out”*
Result
@MicahHerstand
PRACTICE SQL: Anatomy of a Query
SELECT * FROM movies
GROUP BY release_date
titleORDER BY ASC
DESC
Result
;
@MicahHerstand
PRACTICE SQL
English queries
Vocabulary
Stock SQL queries
Exercises
@MicahHerstand
PRACTICE SQL: Exercises
List all movies and their average rating, the average column should
be called 'average'
List only the top-rated movie
List only the bottom-rated movie
Which user gives the highest rating on average?

More Related Content

PPTX
Social media analytics powered by data science
PDF
Dmc white paper data economics
PDF
TechConnectr's Big Data Connection. Digital Marketing KPIs, Targeting, Analy...
PDF
Where Data and Story Meet - Building the Data Storytelling Capability
PPTX
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
PDF
BigData & Supply Chain: A "Small" Introduction
PDF
IT and Procurement: Opportunities and Implementation of New Analytics Technol...
PDF
Evolution of Data Analytics: the past, the present and the future
Social media analytics powered by data science
Dmc white paper data economics
TechConnectr's Big Data Connection. Digital Marketing KPIs, Targeting, Analy...
Where Data and Story Meet - Building the Data Storytelling Capability
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
BigData & Supply Chain: A "Small" Introduction
IT and Procurement: Opportunities and Implementation of New Analytics Technol...
Evolution of Data Analytics: the past, the present and the future

What's hot (19)

PDF
Big Data: The Force That’s Good for Consumers and Society
PDF
[Webinar] High Speed Retail Analytics
PDF
Ecommerce website development company
PDF
Big Data - How Marketing Has Revolutionised - by Sean Singleton
PDF
Big Data for the Retail Business I Swan Insights I Solvay Business School
PDF
MIT report: How data analytics and machine learning reap competitive advantage.
PDF
Unlocking the Value of Big Data (Innovation Summit 2014)
PPTX
Securing the Right Metadata and Making it Work for You
PDF
Large Scale Data Analytics
PDF
Deriving Business Value from Big Data using Sentiment analysis
PPTX
What is big data
PPTX
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
PDF
Media futures launch august 16 2013
PDF
OSSC x CH Short
PDF
M2828_Marketing Analytics Brochure_5-26-2016.pdf
PDF
Data science market insights usa
PDF
Sales Growth: Find Big Growth in Big Data - Lattice Engines & McKinsey
PDF
Improve Customer Experiences With Big Data #DataTalk
PPTX
Teradata Integrated Web Intelligence
Big Data: The Force That’s Good for Consumers and Society
[Webinar] High Speed Retail Analytics
Ecommerce website development company
Big Data - How Marketing Has Revolutionised - by Sean Singleton
Big Data for the Retail Business I Swan Insights I Solvay Business School
MIT report: How data analytics and machine learning reap competitive advantage.
Unlocking the Value of Big Data (Innovation Summit 2014)
Securing the Right Metadata and Making it Work for You
Large Scale Data Analytics
Deriving Business Value from Big Data using Sentiment analysis
What is big data
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Media futures launch august 16 2013
OSSC x CH Short
M2828_Marketing Analytics Brochure_5-26-2016.pdf
Data science market insights usa
Sales Growth: Find Big Growth in Big Data - Lattice Engines & McKinsey
Improve Customer Experiences With Big Data #DataTalk
Teradata Integrated Web Intelligence
Ad

Viewers also liked (20)

KEY
The Next Decade in Web Design
PDF
Андрей Циликов, директор по развитию Sendsay
PPT
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
PPTX
Мастер-класс: Системное мышление
PDF
Юрий Войнлилов, Алена Нефедова. Личные роботы и генная инженерия: к каким инн...
PDF
Алеш Живкович. Университет Иннополис. "Оптимизация затрат на ИТ с помощью фре...
PPTX
Практики жизненного цикла систем машинного обучения
PDF
Progression art direction
PPTX
Конкуренция городов среди ИТ-специалистов
PDF
project presentation
PPT
Principles for knowledge engineering on the Web
PPT
The artof of knowledge engineering, or: knowledge engineering of art
PDF
Мониторинг рынка труда IT-специалистов 2016 от Яндекс
PDF
Презентация ИТ Кластер Сколково
PPT
Artificial Intelligence: Knowledge Engineering
PPT
Image quality
PDF
Лучшие мировые практики умных городов и их внедрение в российскую среду
PPTX
Data Engineering @ Patistic Innovations
PDF
Introduction on Data Science
PDF
7 key recipes for data engineering
The Next Decade in Web Design
Андрей Циликов, директор по развитию Sendsay
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Мастер-класс: Системное мышление
Юрий Войнлилов, Алена Нефедова. Личные роботы и генная инженерия: к каким инн...
Алеш Живкович. Университет Иннополис. "Оптимизация затрат на ИТ с помощью фре...
Практики жизненного цикла систем машинного обучения
Progression art direction
Конкуренция городов среди ИТ-специалистов
project presentation
Principles for knowledge engineering on the Web
The artof of knowledge engineering, or: knowledge engineering of art
Мониторинг рынка труда IT-специалистов 2016 от Яндекс
Презентация ИТ Кластер Сколково
Artificial Intelligence: Knowledge Engineering
Image quality
Лучшие мировые практики умных городов и их внедрение в российскую среду
Data Engineering @ Patistic Innovations
Introduction on Data Science
7 key recipes for data engineering
Ad

Similar to Data Science and Engineering for Marketers (20)

PPTX
What MBA Students Need to Know about CX, Data Science and Surveys
PPTX
Taming the Data Lake with Scalable Metrics Model Framework
PPTX
Big data for sales and marketing people
PDF
Transitioning to-lean-at-infochimps
PDF
Big Data, Analytics and Data Science
DOCX
Monitoring Analytics To Create Customer Value And Experience
PPTX
Taming data lake - scalable metrics model
PPSX
Intro to Data Science Big Data
PPTX
CMO Dashboard
PPTX
Big data and Marketing by Edward Chenard
PPTX
Big Data Marketing Analytics
PPTX
ETE 2013: Going Big with Big Data...one step at a time
PDF
360i Report: Big Data
PDF
Data Science Powerpoint Presentation Slides
PPTX
From Architecture to Analytics: A look at Simply Business’s data strategy
PDF
Big data unit i
PDF
Big Data and Internet of Things for Managers
PPTX
Big Data Meetup by Chad Richeson
PDF
Analytics 101 - How to build a data-driven organisation? - Rafał Małanij, Get...
PDF
Digi-Tech Marketing Data Strategy
What MBA Students Need to Know about CX, Data Science and Surveys
Taming the Data Lake with Scalable Metrics Model Framework
Big data for sales and marketing people
Transitioning to-lean-at-infochimps
Big Data, Analytics and Data Science
Monitoring Analytics To Create Customer Value And Experience
Taming data lake - scalable metrics model
Intro to Data Science Big Data
CMO Dashboard
Big data and Marketing by Edward Chenard
Big Data Marketing Analytics
ETE 2013: Going Big with Big Data...one step at a time
360i Report: Big Data
Data Science Powerpoint Presentation Slides
From Architecture to Analytics: A look at Simply Business’s data strategy
Big data unit i
Big Data and Internet of Things for Managers
Big Data Meetup by Chad Richeson
Analytics 101 - How to build a data-driven organisation? - Rafał Małanij, Get...
Digi-Tech Marketing Data Strategy

Recently uploaded (20)

PPTX
Fixing-AI-Hallucinations-The-NeuroRanktm-Approach.pptx
PDF
exceptionalinsights.group visitor traffic statistics 08-08-25
PPTX
Ranking a Webpage with SEO (And Tracking It with the Right Attribution Type a...
PDF
UNIT 2 - 5 DISTRIBUTION IN RURAL MARKETS.pdf
PDF
UNIT 1 -3 Factors Influencing RURAL CONSUMER BEHAVIOUR.pdf
PDF
How a Travel Company Can Implement Content Marketing
PDF
Hidden gems in Microsoft ads with Navah Hopkins
PPTX
Tea and different types of tea in India
PDF
Proven AI Visibility: From SEO Strategy To GEO Tactics
PDF
Digital Marketing Agency in Thrissur with Proven Strategies for Local Growth
PDF
Master Fullstack Development Course in Chennai – Enroll Now!
PDF
Wondershare Filmora Crack Free Download 2025
PDF
UNIT 1 -4 Profile of Rural Consumers (1).pdf
PPTX
Best Digital marketing service provider in Chandigarh.pptx
PDF
You Need SEO for Your Business. Here’s Why..pdf
PDF
5 free to use google tools to understand your customers online behavior in 20...
PPTX
UNIT 3 - 5 INDUSTRIAL PRICING.ppt x
PDF
Best digital marketing company in Mumbai
PDF
Digital Marketing - clear pictire of marketing
PDF
Mastering Bulk Email Campaign Optimization for 2025
Fixing-AI-Hallucinations-The-NeuroRanktm-Approach.pptx
exceptionalinsights.group visitor traffic statistics 08-08-25
Ranking a Webpage with SEO (And Tracking It with the Right Attribution Type a...
UNIT 2 - 5 DISTRIBUTION IN RURAL MARKETS.pdf
UNIT 1 -3 Factors Influencing RURAL CONSUMER BEHAVIOUR.pdf
How a Travel Company Can Implement Content Marketing
Hidden gems in Microsoft ads with Navah Hopkins
Tea and different types of tea in India
Proven AI Visibility: From SEO Strategy To GEO Tactics
Digital Marketing Agency in Thrissur with Proven Strategies for Local Growth
Master Fullstack Development Course in Chennai – Enroll Now!
Wondershare Filmora Crack Free Download 2025
UNIT 1 -4 Profile of Rural Consumers (1).pdf
Best Digital marketing service provider in Chandigarh.pptx
You Need SEO for Your Business. Here’s Why..pdf
5 free to use google tools to understand your customers online behavior in 20...
UNIT 3 - 5 INDUSTRIAL PRICING.ppt x
Best digital marketing company in Mumbai
Digital Marketing - clear pictire of marketing
Mastering Bulk Email Campaign Optimization for 2025

Data Science and Engineering for Marketers

  • 2. @MicahHerstand Software Engineer, User Advocate, Writer, Actor, Singer-Songwriter @MICAHHERSTAND ˈmikə
  • 3. @MicahHerstand “Marketing has become a technology-powered discipline, and therefore, 
 marketing organizations must 
 infuse technical capabilities 
 into their DNA.” ~Scott Brinker, MarTech Conference Program Chair
  • 4. @MicahHerstand LESSON OBJECTIVES: Theory Discover how data science enables marketing innovation Measure, Metric, CSF, KPI Customer segmentation Big Data, Open Data, Linked Data Growth Hacking Ensure your org’s data engineering empowers marketers Database SQL (Relational), NoSQL (Document, Graph) Data warehouse
  • 5. @MicahHerstand LESSON OBJECTIVES: Setup Install a database manager Sequel Pro for Mac MySQL Workbench for Windows & Linux Connect to your org’s database Standard, SSH, SSL Bookmark SQL helpers SQLZoo (run SQL online!), Tutorials Point, Khan Academy Google queries: “site:docs.oracle.com UNKNOWN TERM”
  • 6. @MicahHerstand LESSON OBJECTIVES: Practice SQL Create a mental model for what it’s like to query SQL using English first Acquire the vocabulary to understand a SQL query Encounter example SQL queries and see their results Practice your knowledge through exercises
  • 7. @MicahHerstand DATA SCIENCE Measure, Metric, CSF, KPI Customer segmentation Big Data, Open Data, Linked Data Growth Hacking
  • 9. @MicahHerstand DATA SCIENCE: Measure, Metric, CSF, KPI It’s the metrics, stupid! “The price of light is less than the cost of darkness.” ~Arthur C. Nielsen, namesake of Nielsen TV ratings “What gets measured, gets managed.” “There is nothing so useless as doing efficiently that which should not be done at all.” "Management is doing things right; leadership is doing the right things." ~Peter Drucker, the founder of modern management
  • 10. @MicahHerstand DATA SCIENCE: Measure Definition: Anything that can be measured Caveat: Must be a single variable measure E.g. № of users E.g. № of active users Challenges: Definition of terms E.g. Does account creation make someone a customer? Measurement process E.g. How frequently should data be collected?
  • 11. @MicahHerstand DATA SCIENCE: Metric Definition: Value derived from 2+ measures Metric selection: Efficiency vs effectiveness E.g. cost of customer acquisition vs customer lifetime value Analysis: Information vs insights E.g. customer value vs value of customers acquired through LinkedIn Optimization: Source vs campaign E.g. customers w/ expired CC vs customer bounce rate when CC expired Caution: Vanity, engagement, and benchmark metrics E.g. Facebook Likes, Time on Page, DVD sales
  • 12. @MicahHerstand DATA SCIENCE: CSF (Critical Success Factor) Definition: What is required to achieve business objectives. E.g. acquire new customers Prerequisites: Business objectives E.g. to obtain 10% market share (BO), must acquire new customers (CSF) More on CSFs: bit.ly/sidata-csf
  • 13. @MicahHerstand DATA SCIENCE: KPI (Key Performance Indicator) Definition: A measurable value that demonstrates how effectively a company is achieving key business objectives E.g. cost per lead, customer lifetime value, traffic-to-lead ratio, retweets of last ten tweets, landing page conversion rates Prerequisites: Critical Success Factors E.g. to acquire new customers (CSF), track those acquired per week (KPI) Requisites: SMART (Specific, Measurable, Achievable, Relevant, Time) E.g. weekly rate of customer acquisition Caution: Perverse incentives and unintended consequences E.g. referral programs to increase customer acquisition More on KPIs: bit.ly/sidata-kpi
  • 14. @MicahHerstand DATA SCIENCE: CSFs vs KPIs Graphic origin: bit.ly/sidata-kpi-vs-csf
  • 15. @MicahHerstand DATA SCIENCE: Prioritization “Never confuse motion with action.” ~Benjamin Franklin Graphic Origin: bit.ly/sidata-metrics-graphic
  • 16. @MicahHerstand DATA SCIENCE Measure, Metric, CSF, KPI Customer segmentation Big Data, Open Data, Linked Data Growth Hacking
  • 20. @MicahHerstand DATA SCIENCE: Customer Segmentation Definition: the practice of dividing a customer base into groups of individuals that are similar in specific ways relevant to marketing E.g. SI grads, New Yorkers, users who have yet to purchase Utility: One size does not fit all. Allows for novel KPIs. Prerequisites: Business Objectives, Metrics E.g. Want to gain 10% salon market (Biz Objective), while 25% of total customers are men (metric), target men as it’s an under-saturated market Types: A priori, Needs-based, and Value-based Caution: Don’t break the law by targeting protected classes E.g. AirBnb cannot offer Iranian-Americans discounts for Nowruz More on KPIs: bit.ly/sidata-kpi
  • 21. @MicahHerstand DATA SCIENCE Measure, Metric, CSF, KPI Customer segmentation Big Data, Open Data, Linked Data Growth Hacking
  • 22. @MicahHerstand DATA SCIENCE: Big Data, Open Data, Linked Data
  • 23. @MicahHerstand DATA SCIENCE: Big Data, Open Data, Linked Data "Big Data will spell the death of customer segmentation and force the marketer to understand each customer as an individual.” ~Ginni Rometty, CEO, IBM "Google only gives you answers for questions people have asked before.” “A mark of a good site is realizing you're not the only site in the world.” ~Tim Berners-Lee, inventor of the World Wide Web
  • 25. @MicahHerstand DATA SCIENCE: Big Data Definition: Data sets that are so large or complex that traditional data processing applications are inadequate to deal with them. Technical Challenges: Volume (amount of data) Velocity (speed of data in and out) Variety (range of data types and sources) Human Challenges: No magic bullets, easy to overstate current capabilities Novel Opportunities: Real-time pricing, Sentiment analysis, Optimized offers
  • 26. @MicahHerstand Designed by Forrester Research, accessed at bit.ly/sidata-bigdata
  • 27. @MicahHerstand Designed by Forrester Research, accessed at bit.ly/sidata-bigdata
  • 28. @MicahHerstand Designed by Forrester Research, accessed at bit.ly/sidata-bigdata
  • 29. @MicahHerstand DATA SCIENCE: Open Data Definition: Data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other mechanisms of control. “Free as in speech, not beer.” E.g. data.gov, census.gov Alternate Definition: Public or private data stores available for integration into one’s own data system. E.g. developer.nytimes.com, Thomson Reuters Challenges: Low cost, high quality, and large quantity—pick two Data normalization (e.g. gender and sex, China bowls vs China country)
  • 30. @MicahHerstand DATA SCIENCE: Linked Data Definition: A method of publishing structured data so that it can be interlinked and become more useful through semantic queries. E.g. Facebook’s Open Graph, Google Rich Snippets, Twitter Cards Novelty: Data sources share schema so no middleware necessary Challenges: Comparatively few data sources Data analysis tools less mature Fewer trained developers "Marketing department might want to dominate the Linked Data web.” ~Ralph Swick, COO of the W3C, organization responsible for World Wide Web standards
  • 31. @MicahHerstand DATA SCIENCE: Linked Data "When companies post data as Linked Data they can be held accountable. Regex has [fuzzy] responsibility.” ~Ralph Swick, COO of the W3C, organization responsible for World Wide Web’s technology standards Accessed March 8th, 2017
  • 32. @MicahHerstand DATA SCIENCE Measure, Metric, CSF, KPI Customer segmentation Big Data, Open Data, Linked Data Growth Hacking
  • 33. @MicahHerstand DATA SCIENCE: Growth Hacking Graphic origin: bit.ly/sidata-gh-cartoon-2
  • 34. @MicahHerstand DATA SCIENCE: Growth Hacking Graphic origin: bit.ly/sidata-gh-cartoon-3
  • 35. @MicahHerstand DATA SCIENCE: Growth Hacking “Growth hackers are a hybrid of marketer and coder.” “[Growth hacking] requires a blurring of lines between marketing, product, and engineering, so that they work together to make the product market itself.” ~Andrew Chen, Head of Rider Growth at Uber “The true unicorns are those who can go end-to-end designing, building, measuring, analyzing, and iterating with a combination of user intuition and deep analytics.” ~Matt Humphrey, Sold his startup HomeRun for $100M+ after 18 months
  • 36. @MicahHerstand DATA SCIENCE: Growth Hacking Definition: A process of rapid experimentation across marketing channels and product development to identify the most effective, efficient ways to grow a business. E.g. Airbnb cross-listing on Craigslist Novelty: Interdisciplinary skills and knowledge Prerequisites: Interdisciplinary teams, acceptance of failure, outside-the-box thinking Requisites: Measurable, metric-based
  • 41. @MicahHerstand DATA ENGINEERING: Database Definition: A collection of structured data, organized for rapid search by an automated computer program. Novelty: List or calculate data from various sources E.g. How much revenue has been made by sales from customers whose first visit was referred by a Facebook ad? E.g. How many customers (who have made at least $100 in purchases total) have used our referral program?
  • 43. @MicahHerstand DATA ENGINEERING: Levels of structure Graphic Origin: http://5stardata.info/
  • 47. @MicahHerstand DATA ENGINEERING: Relational Database Definition: A type of database that organizes data into tables (think spreadsheet) and creates clearly defined relationships between those tables. E.g. SQL (MySQL, PostgreSQL, SQLite, Oracle Database, MS SQL) SQL is a programming language that lets people setup relational database as well as add, update, delete, and lookup data within them. Novelty: Up-front schema, data integrity checks, transactions. E.g. ensure a movie cannot be added without an associated director Challenges: Large datasets and an evolving schema are difficult to manage. E.g. you want to track customers’ age, then decide not to, then decide to track gender as a binary, then decide to make gender a free-text option… bit.ly/sidata-sql-vs-nosql
  • 48. @MicahHerstand DATA ENGINEERING: Relational Database Foreign key Movies
  • 49. @MicahHerstand DATA ENGINEERING: Relational Database Movies Directors
  • 51. @MicahHerstand DATA ENGINEERING: NoSQL Databases Definition: A database that is not a relational database. (NoSQL is colloquial jargon, not a standard) E.g. MongoDB, Redis, Couchbase, neo4j Novelty: No schema required to store data. Easily scalable. Super fast lookups. E.g. easy to track customers’ age, then decide not to, then decide to track gender as a binary, then decide to make gender a free-text option… Challenges: Data integrity, stable transactions. E.g. cannot ensure a director is always included when adding a movie bit.ly/sidata-sql-vs-nosql
  • 54. @MicahHerstand DATA ENGINEERING: Data warehouse Definition: a computer system optimized for analytical and informational processing that is filled with data copied from both inside and outside the enterprise E.g. a database with both a sales table and a google analytics table and a census table. Novelty: analyze business data without affecting day-to-day operations E.g. you want to see employee clock-in times without preventing them from simultaneously clocking out. Challenges: large overhead and maintenance costs without being necessary
  • 55. @MicahHerstand DATABASE SETUP Database manager application Database Connections SQL Helpers to Bookmark
  • 56. @MicahHerstand DATABASE SETUP: DB Manager Application Definition: A graphical user interface that simplifies database interactions for developers Examples: Sequel Pro for Mac: bit.ly/sidata-mac MySQL Workbench for Windows & Linux: bit.ly/sidata-not-mac PHPMyAdmin for web access
  • 57. @MicahHerstand DATABASE SETUP Database manager application Database Connections SQL Helpers to Bookmark
  • 58. @MicahHerstand DATABASE SETUP: Database connections Unsecured Connections are often called “standard” and require no setup besides the application you just downloaded Secured Connections can use SSH or SSL and require additional encryption technology to be installed on your computer. Your company should have documentation on how to use these.
  • 59. @MicahHerstand DATABASE SETUP: DB Connection Info Server: www.herstand.com User: sistudents Password: Hf68S9CpK67RUDV3 Database: simovies Port: 3306 (default MySQL port)
  • 60. @MicahHerstand DATABASE SETUP Database manager application Database Connections SQL Helpers to Bookmark
  • 61. @MicahHerstand DATABASE SETUP: SQL Helpers to Bookmark Learn: TutorialsPoint.com, KhanAcademy.com Play: SQLZoo.net (run SQL online!) Cheatsheet: bit.ly/sidata-sql-cheat-sheet Cheatsheet with examples: bit.ly/sidata-cheat-with-examples RTFM: bit.ly/sidata-mysql-rtfm
  • 63. @MicahHerstand PRACTICE SQL: English queries Questions SQL can answer: Who, What, Which, Where, When, How Many E.g. Who directed the film Get Out? E.g. Who acted in the film Get Out? E.g. What films were released before Jan 1, 2000? E.g. Where did the director of Get Out go to college? E.g. Which colleges had the most graduates direct films since Jan 1, 2000. E.g. When was Get Out released? E.g. How many actors were in both Get Out and The West Wing?
  • 65. @MicahHerstand PRACTICE SQL: Vocabulary Syntax , . ; ( ) “ ” * Verbs SELECT INSERT UPDATE DELETE Query Parts AS FROM WHERE HAVING ORDER BY GROUP BY Filters LIKE NOT > < = != >= <= AND OR IN % Sort ASC DESC Aggregate Functions MIN, MAX, SUM, AVG, COUNT Advanced Functions INNER JOIN OUTER JOIN REGEXP
  • 67. @MicahHerstand PRACTICE SQL: Anatomy of a Query SELECT * FROM movies; Result
  • 68. @MicahHerstand PRACTICE SQL: Anatomy of a Query SELECT FROM movies WHERE ; title AND release_date title COUNT(title) AS num_of_titles title AND MIN(release_date) title = “%Star Wars%” release_date > ‘2000-1-1’ release_date > ‘2000-1-1’ AND title = “%Star Wars%” title = “Get Out”* Result
  • 69. @MicahHerstand PRACTICE SQL: Anatomy of a Query SELECT * FROM movies GROUP BY release_date titleORDER BY ASC DESC Result ;
  • 71. @MicahHerstand PRACTICE SQL: Exercises List all movies and their average rating, the average column should be called 'average' List only the top-rated movie List only the bottom-rated movie Which user gives the highest rating on average?