SlideShare a Scribd company logo
Paul Shapiro | @fighto
SEO Automation to Increase
Efficiency and Effectiveness
Working Smarter
Paul Shapiro | @fighto
Paul Shapiro | @fighto
Why Automate?
1.Work faster and free up time for the
important stuff
2.Look at more data
3.Improved consistency and procedure
Paul Shapiro | @fighto
What Can You Automate?
• If you’re doing something on a routine basis,
it can probably be automated (but doesn’t
mean it should).
• Any procedures that can be broken down into
smaller, micro-tasks that can be handled a by
a computer.
• Machine Learning can help with more complex
decision making (think power of AlphaGo).
Paul Shapiro | @fighto
Paul Shapiro | @fighto
Paul Shapiro | @fighto
How Do You Automate?
Paul Shapiro | @fighto
How to Conceptualize Automation:
Break into Micro-tasks
1
2 3 4
Paul Shapiro | @fighto
How to Work with APIs
API Endpoint:
http://api.grepwords.com/lookup?apikey=random_string&q=keyword
Simple API key authentication via GET request
String is
unique to
you
(authentic
ation)
Variable,
changes
and often
looped
Paul Shapiro | @fighto
How to Work with APIs
http://api.grepwords.com/lookup?apikey=secret&q=board+games
Output (JSON):
Simple API key authentication via GET request
[{"keyword":"board games","updated_cpc":"2018-04-30","updated_cmp":"2018-04-
30","updated_lms":"2018-04-30","updated_history":"2018-04-
30","lms":246000,"ams":246000,"gms":246000,"competition":0.86204091185173,"com
petetion":0.86204091185173,"cmp":0.86204091185173,"cpc":0.5,"m1":201000,"m1_mo
nth":"2018-02","m2":246000,"m2_month":"2018-01","m3":450000,"m3_month":"2017-
12","m4":368000,"m4_month":"2017-11","m5":201000,"m5_month":"2017-
10","m6":201000,"m6_month":"2017-09","m7":201000,"m7_month":"2017-
08","m8":201000,"m8_month":"2017-07","m9":201000,"m9_month":"2017-
06","m10":201000,"m10_month":"2017-05","m11":201000,"m11_month":"2017-
04","m12":201000,"m12_month":"2017-03"}]
Paul Shapiro | @fighto
How to Work with APIs
Most API Outputs:
1. JSON
2. XML
3. CSV
Paul Shapiro | @fighto
How to Work with APIs
Last Step:
Parse it!
Paul Shapiro | @fighto
How to Work with APIs
Parsing Example Using Python:
import json
json_string = '[{"keyword":"board games","updated_cpc":"2018-04-
30","updated_cmp":"2018-04-30","updated_lms":"2018-04-
30","updated_history":"2018-04-
30","lms":246000,"ams":246000,"gms":246000,"competition":0.86204091185173,"com
petetion":0.86204091185173,"cmp":0.86204091185173,"cpc":0.5,"m1":201000,"m1_mo
nth":"2018-02","m2":246000,"m2_month":"2018-01","m3":450000,"m3_month":"2017-
12","m4":368000,"m4_month":"2017-11","m5":201000,"m5_month":"2017-
10","m6":201000,"m6_month":"2017-09","m7":201000,"m7_month":"2017-
08","m8":201000,"m8_month":"2017-07","m9":201000,"m9_month":"2017-
06","m10":201000,"m10_month":"2017-05","m11":201000,"m11_month":"2017-
04","m12":201000,"m12_month":"2017-03"}]‘
parsed_json([0]['gms'])
1
2
3
Paul Shapiro | @fighto
How to Work with APIs
Full Python Script:
import requests
import json
r =
requests.get('http://api.grepwords.com/lookup?apike
y=secretapikey&q=board+games')
parsed_json = json.loads(r.text)
print(parsed_json[0]['gms'])
Paul Shapiro | @fighto
How to Work with APIs
Output
Paul Shapiro | @fighto
Tools for Automation
Paul Shapiro | @fighto
https://www.knime.org
Paul Shapiro | @fighto
Why KNIME?
• Fast way to put together complex
analyses
• Great for prototyping
• Large library of built-in “nodes”
• Free/Open Source
• Run on Windows/Mac/Linux
• Very expandable – even compatible
with R, Python, Java, JavaScript
• Easy enough for non-technical staff
to grasp
Paul Shapiro | @fighto
Data Agility
Paul Shapiro | @fighto
Other Options
• Scripting Languages
• Python
• Ruby
• Node.js
• Go
• R
• Excel with VBA
• Google Sheets
Paul Shapiro | @fighto
Cron & Windows Task Manager
are Your Friend
Paul Shapiro | @fighto
What is Cron and Why?
• *NIX system daemon used to schedule tasks and
scripts.
• Windows Task Manager is the Windows equivalent
of Cron.
• This way we can schedule scripts and programs that
perform automated tasks on a recurring, scheduled
basis.
Paul Shapiro | @fighto
Quick How To
* * * * * command /to/execute
Day of Week (0-6) (Sunday = 0)
Month (1-12)
Hour (0-23)
Day of Month (1-31)
Minute (0-59)
Paul Shapiro | @fighto
Run Every Month at Midnight
0 0 1 * * python datacollector.py
Paul Shapiro | @fighto
The Basics of KNIME
Paul Shapiro | @fighto
What is a Node?
• Nodes are prebuilt, drag and drop modules designed perform a singular task
• Nodes are strung together like a chain to accomplish larger, more complex
tasks
• Nodes can be grouped together into “meta-nodes”, which can be configured in
unison
Paul Shapiro | @fighto
How Do You Add Nodes &
How Do They Connect?
How do you add nodes to your “workflow”?
How do you string nodes together?
Paul Shapiro | @fighto
How Do You Configure & Run Nodes?
Configuring Nodes
Running Workflows
OR
Paul Shapiro | @fighto
Keyword Research
Paul Shapiro | @fighto
Most Keyword Research Looks Like This
Paul Shapiro | @fighto
Typical Time Investment for Keyword
Research
5
8
12
21 21
6
10
18
22
28
0
5
10
15
20
25
30
Micro (0-49 pages) Small (50-99 pages) Medium (100-249 pages) Large (250-499 pages) Extra Large (>500 pages)
Hours to Complete Keyword Research by Site Size
Average (Low End) Average (High End)
Paul Shapiro | @fighto
Size of the Data Set
vs.
Paul Shapiro | @fighto
Filtering +
Data
Manipula-
tions
One Big Keyword List
Seed Keywords
- List
- GWMT
- SEMRush Comp. KWs
- SQR Keywords
Keyword Planner
Suggestions (via
GrepWords)
Google Autocomplete
Semantic Keyword
Recommendations
(via MarketMuse)
Google Autocomplete
SEMRush
Domain vs. Domain
Keywords
Google Autocomplete
Paul Shapiro | @fighto
Data Manipulations / Calculations
• Get top 10 results from rank checking API (i.e., GetSTAT)
• Use Moz API nodes and find average PA to assess
competiveness.
• Optionally, use SEMRush’s Keyword Difficulty API
Organic
Competition
Search Volume
Keyword Trends
Paul Shapiro | @fighto
Data Manipulations / Calculations
• Get top 10 results from rank checking API (i.e., GetSTAT)
• Use Moz API nodes and find average PA to assess
competiveness.
• Optionally, use SEMRush’s Keyword Difficulty API
Organic
Competition
• Get Search Volumes via SEMRush API or via GrepWords APISearch Volume
Keyword Trends
Paul Shapiro | @fighto
Data Manipulations / Calculations
• Get top 10 results from rank checking API (i.e., GetSTAT)
• Use Moz API nodes and find average PA to assess
competiveness.
• Optionally, use SEMRush’s Keyword Difficulty API
Organic
Competition
• Get Search Volumes via SEMRush API or via GrepWords APISearch Volume
• Use 2 Years of Google Trends data to calculate slope and
determine growing/declining keywordsKeyword Trends
Paul Shapiro | @fighto
String ‘em All Together and then…
Paul Shapiro | @fighto
Visualize
This top-right quadrant contains
keywords with:
• Low competition
• Good growth
Larger bubbles show higher
search volumes.
You can alternatively use
current rank on the x-axis to
signal organic market share like
a traditional growth-share
matrix.
Paul Shapiro | @fighto
Option: Tableau
Paul Shapiro | @fighto
Option: Python + Bokeh Charts in KNIME
Paul Shapiro | @fighto
Sample Python Code:
http://pshapi.ro/SLBokeh
Paul Shapiro | @fighto
Learn More…
SearchLove Presentation about Automating Keyword Research:
http://pshapi.ro/2idcqYF
BrightonSEO Presentation about Semantic Keyword Research:
https://searchwilderness.com/semantic-keyword-research/
Paul Shapiro | @fighto
What to Automate
for SEO?
Paul Shapiro | @fighto
Reporting
Paul Shapiro | @fighto
Paul Shapiro | @fighto
Data Collection
Paul Shapiro | @fighto
Search Console
Schedule to run monthly with Cron
and backup to SQL database:
https://searchwilderness.com/gwmt-
data-python/
JR Oakes’ BigQuery vision:
http://pshapi.ro/2vmjDe8
Paul Shapiro | @fighto
301 Redirect Mapping
from Old URLs
Paul Shapiro | @fighto
301 Redirect Mapping from Old URLs
Crawl
Current Site
Download
Rendered
Pages
Extract Main
Content
(BoilerPipe)
Convert to
Bitvector
Get Historic
URLs from
Wayback
Machine
API
Filter Out
URLs found
on Current
Site
Grab
Rendered
Page from
Wayback
Machine
Extract
Main
Content
(BoilerPipe)
Convert to
Bitvector
Cosine
Similarity
Generate
.htaccess
strings
1
2
3
Paul Shapiro | @fighto
KNIME Toolkit
Paul Shapiro | @fighto
SERP Similarity / Overlap
Paul Shapiro | @fighto
1. Download ranking data via STAT
API
2. Compare results from 1-10 for
each query against results from 1-
10 for every other query.
3. Calculate percent similarity.
4. Schedule checks and examine
what changed.
Paul Shapiro | @fighto
Test JavaScript Rendering
Paul Shapiro | @fighto
http://pshapi.ro/puppetcrawl
Paul Shapiro | @fighto
Performance Testing with
Lighthouse
Paul Shapiro | @fighto
http://pshapi.ro/perfpony
Paul Shapiro | @fighto
Anomaly Detection
Paul Shapiro | @fighto
CTR
1. Data collection: We collect data on query,
page and associated metrics via the Google
Search Console Search Analytics API.
2. Round average position: I round average
position to the tenths decimal place (e.g., 1.19
is rounded to 1.2).
3. Math: We identify outliers using a combination
of the statistical methods for identifying
outliers (modified z-score, IQR).
4. Email: If any negative outliers are identified
for a keyword query and page combination at
an average position, an email is sent out
identifying all of this data to each of the SEOs
assigned to the account to investigate.
5. Scheduling: Set your script to run on a
recurring basis.
My SEL Article: http://pshapi.ro/2Ae2LYP
Paul Shapiro | @fighto
Content Ideas
Paul Shapiro | @fighto
Reddit Data Mining
Reddit Data Mining: Python Script
https://searchwilderness.com/reddit-python-code/
1. Enter filename for output
2. Enter a search or series of searches
3. Choose reddit sorting method. For this purpose,
choose ‘new’
4. Choose to look at all of reddit, or isolate to
particular subreddit(s).
5. Schedule with cron to find new topic ideas on a
recurring basis.
Paul Shapiro | @fighto
AMP Validation
Paul Shapiro | @fighto
Bulk Check AMP
Pages with
AMPBench API
Python Script:
http://pshapi.ro/2AHlNaE
Requires:
• Python
• Requests package
Ideally AMPBench would run locally, but
can be ran off the appspot demo URL.
Paul Shapiro | @fighto
Link Building
Paul Shapiro | @fighto
http://apiv2.ahrefs.com/?from=backlinks_new_lost&limit=10&target=competitor.com&
where=type:%22new%22,date:%222017-06-
01%22&mode=domain&output=json&token=your_personal_api_key
Link Building: Prospecting with Competitors
Scheduled use of Ahrefs API
Parsed Results + SMTP = Link Opportunities
Paul Shapiro | @fighto
Tech Audit Related Site Changes
Paul Shapiro | @fighto
• Detect new 404s and other errors
• New redirects
• Changes to robots.txt
• Content based changes over time
• Indexation changes
• New pages created
• Changes in rank or traffic
(rank checking API or Search Console)
Use Cloud Crawler like Botify/Deepcrawl with
API or a custom solution
Paul Shapiro | @fighto
Custom Solution
1. Run Screaming Frog in the Cloud with lots of
RAM:
• Amazon AWS: http://ipullrank.com/how-to-run-
screaming-frog-and-url-profiler-on-amazon-web-
services/
• Google Cloud:
https://online.marketing/guide/screaming-frog-in-
google-cloud/
2. Activate with command line and Task Manager for
scheduling
3. Use a macro program like RoboTask to generate
reports and send to a particular folder
4. Download via FTP or dump to SQL database for
analysis
5. Analysis produces alerts with SMTP
Paul Shapiro | @fighto
Meta Description Creation
Paul Shapiro | @fighto
1. Download webpage body
contents
2. Run through text
summarization engine(s) to
produce small snippets of
important page text
3. Have person edit to avoid
truncation and improve
language
Process for Semi-Automated Meta Descriptions
https://searchengineland.com/reducing-the-time-it-takes-
to-write-meta-descriptions-for-large-websites-299887
Paul Shapiro | @fighto
Alt Attribute Creation
Paul Shapiro | @fighto
Recognize Images
• You can use custom machine learning options
(https://www.tensorflow.org/tutorials/image_recognition) but it’s easier and
more effective to use an API in this context.
• For APIs, you have options:
• Microsoft Computer Vision: https://azure.microsoft.com/en-us/services/cognitive-
services/computer-vision/
• Google Cloud Vision API: https://cloud.google.com/vision/
• CloudSight: https://cloudsight.ai/
1. Download all images without alt attribute
2. Run through API and get a caption (not perfect, better than nothing)
Paul Shapiro | @fighto
Paul Shapiro | @fighto
A Word of Caution…
https://xkcd.com/1319/
Paul Shapiro | @fighto
CatalystDigital.com
Paul Shapiro
https://searchwilderness.com
@fighto
Thanks!

More Related Content

PPTX
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul Shapiro
PPTX
SearchLove Boston 2016 | Paul Shapiro | How to Automate Your Keyword Research
PPTX
MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...
PPTX
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile Landscape
PPTX
Cloud Security Monitoring and Spark Analytics
PPTX
MnSearch Summit 2018 - Rob Ousbey – The Evolution of SEO: Split-Testing for S...
PDF
TechSEO Boost 2018: Programming Basics for SEOs
PPTX
Advanced Technical SEO in 2020 - Data Science
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul Shapiro
SearchLove Boston 2016 | Paul Shapiro | How to Automate Your Keyword Research
MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile Landscape
Cloud Security Monitoring and Spark Analytics
MnSearch Summit 2018 - Rob Ousbey – The Evolution of SEO: Split-Testing for S...
TechSEO Boost 2018: Programming Basics for SEOs
Advanced Technical SEO in 2020 - Data Science

What's hot (20)

PDF
NLP for SEO
PPTX
TechSEO Boost 2017: Working Smarter: SEO Automation to Increase Efficiency & ...
PDF
What I Learned Building a Toy Example to Crawl & Render like Google
PPTX
TechSEO Boost 2017: SEO Best Practices for JavaScript T-Based Websites
PDF
Tutorial on-python-programming
PPTX
TechSEO Boost 2017: Fun with Machine Learning: How Machine Learning is Shapin...
PPTX
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...
PPTX
TechSEO Boost 2017: Making the Web Fast
PPTX
Pragmatic REST: recent trends in API design
PPTX
PyCon India 2012: Rapid development of website search in python
PPTX
TechSEO Boost 2017: The State of Technical SEO
PDF
Hacking RSS: Filtering & Processing Obscene Amounts of Information (short ve...
PPTX
TechSEO Boost 2018: The Statelessness of Technical SEO
PDF
SXSW Hacking RSS: Filtering & Processing Obscene Amounts of Information
PPTX
Tackling Python: How It Can Help With Technical SEO | Pint Sized Meetup Janua...
PPTX
Redefining technical SEO & how we should be thinking about it as an industry ...
PDF
Relevancy hacks for eCommerce
PPTX
The Power of Python :: How It Can Help With Technical SEO | Bristol SEO May 2...
PDF
Elasticsearch at Automattic
PPTX
Python For Technical SEO | Women In Tech SEO Festival March 2020 | Ruth Everett
NLP for SEO
TechSEO Boost 2017: Working Smarter: SEO Automation to Increase Efficiency & ...
What I Learned Building a Toy Example to Crawl & Render like Google
TechSEO Boost 2017: SEO Best Practices for JavaScript T-Based Websites
Tutorial on-python-programming
TechSEO Boost 2017: Fun with Machine Learning: How Machine Learning is Shapin...
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...
TechSEO Boost 2017: Making the Web Fast
Pragmatic REST: recent trends in API design
PyCon India 2012: Rapid development of website search in python
TechSEO Boost 2017: The State of Technical SEO
Hacking RSS: Filtering & Processing Obscene Amounts of Information (short ve...
TechSEO Boost 2018: The Statelessness of Technical SEO
SXSW Hacking RSS: Filtering & Processing Obscene Amounts of Information
Tackling Python: How It Can Help With Technical SEO | Pint Sized Meetup Janua...
Redefining technical SEO & how we should be thinking about it as an industry ...
Relevancy hacks for eCommerce
The Power of Python :: How It Can Help With Technical SEO | Bristol SEO May 2...
Elasticsearch at Automattic
Python For Technical SEO | Women In Tech SEO Festival March 2020 | Ruth Everett
Ad

Similar to Working Smarter: SEO Automation to Increase Efficiency and Effectiveness - Paul Shapiro (20)

PDF
Yahoo! BOSS API external 20091001
PDF
Y Boss External 20091017
PPTX
Redefining Technical SEO - Paul Shapiro at MozCon 2019
PPTX
Redefining Technical SEO, #MozCon 2019 by Paul Shapiro
PPTX
Fundamentals Of Search
PDF
AI assisted testing using postman and openAI.pdf
PPTX
Global Search Strategy
PDF
Dev Dives: Leverage APIs and Gen AI to power automations for RPA and software...
PDF
Developing Brilliant and Powerful APIs in Ruby & Python
PPTX
Boss hack u-iit-madras-2012
PPTX
Tackling Python: What is it and How Can it Help with Technical SEO? | TechSEO...
PDF
Top 13 web scraping tools in 2022
PPTX
Using Python and Data Science Practices in SEO Analysis of Data
PDF
Advanced data-driven technical SEO - SMX London 2019
PPTX
Measuring Search Engine Quality using Spark and Python
PDF
Selecting the right process for robotic process automation (rpa)
PDF
Productionizing Data Science at Experience
PDF
Serving ML easily with FastAPI - meme version
PDF
INTERFACE by apidays 2023 - APIs with bounded contexts, Jose Haro Peralta, mi...
PPTX
Bill Hartzer - Advanced SEO Audits - Pubcon Las Vegas 2018
Yahoo! BOSS API external 20091001
Y Boss External 20091017
Redefining Technical SEO - Paul Shapiro at MozCon 2019
Redefining Technical SEO, #MozCon 2019 by Paul Shapiro
Fundamentals Of Search
AI assisted testing using postman and openAI.pdf
Global Search Strategy
Dev Dives: Leverage APIs and Gen AI to power automations for RPA and software...
Developing Brilliant and Powerful APIs in Ruby & Python
Boss hack u-iit-madras-2012
Tackling Python: What is it and How Can it Help with Technical SEO? | TechSEO...
Top 13 web scraping tools in 2022
Using Python and Data Science Practices in SEO Analysis of Data
Advanced data-driven technical SEO - SMX London 2019
Measuring Search Engine Quality using Spark and Python
Selecting the right process for robotic process automation (rpa)
Productionizing Data Science at Experience
Serving ML easily with FastAPI - meme version
INTERFACE by apidays 2023 - APIs with bounded contexts, Jose Haro Peralta, mi...
Bill Hartzer - Advanced SEO Audits - Pubcon Las Vegas 2018
Ad

More from State of Search Conference (20)

PPTX
The Whale And a Phoenix: A true story of agency near death and rebirth - Stev...
PPT
Reputation Management: How Brands Get Tarnished and Trashed - Simon Heseltine
PPTX
The Future: Optimize for Trust - Aaron Weiche
PPTX
Link Building Without Pitching and Building Tons of Content - Adam Riemer
PPTX
When Good PPC Goes Bad - Kirk Williams
PDF
Data-Driven Attribution Under the Hood - Simon Poulton
PDF
Automated Bidding: Deriving Smart Insights - Sergio Cano
PPTX
Cousin Eddie's Guide to Winning at Local SEO in 2019 - Greg Gifford
PPTX
Artificial Intelligence and Machine Learning in PPC - David Szetela
PPTX
Advanced SEO Issues - Rob Woods
PPTX
Behind the Algorithm: Back Stage Pass - Ryan Jones
PDF
The Conversion Disconnect: Why Ad Platform Data Misguides You in Lead Gen - E...
PPTX
Next Level SEO - Jenny Halasz
PPTX
Brand Experience Optimization - The Future of Online Visibility is Building a...
PPTX
Little Budgets with Big Profits - Navah Hopkins
PPTX
Go Beyond Revenue: Solutions for Profit-Driven Marketing in GA - Mike Ulrich
PDF
7 Strategies You Won't Hear from Your Facebook Rep - Tommy Swanson
PPTX
Content Marketing Excellence: The Road to Content Marketing Greatness - Eric ...
PDF
Client Services & Agency Operations - Brandy Eddings
PPTX
Fulfilling the Mad Men Promise of Creative - Michelle Stinson-Ross
The Whale And a Phoenix: A true story of agency near death and rebirth - Stev...
Reputation Management: How Brands Get Tarnished and Trashed - Simon Heseltine
The Future: Optimize for Trust - Aaron Weiche
Link Building Without Pitching and Building Tons of Content - Adam Riemer
When Good PPC Goes Bad - Kirk Williams
Data-Driven Attribution Under the Hood - Simon Poulton
Automated Bidding: Deriving Smart Insights - Sergio Cano
Cousin Eddie's Guide to Winning at Local SEO in 2019 - Greg Gifford
Artificial Intelligence and Machine Learning in PPC - David Szetela
Advanced SEO Issues - Rob Woods
Behind the Algorithm: Back Stage Pass - Ryan Jones
The Conversion Disconnect: Why Ad Platform Data Misguides You in Lead Gen - E...
Next Level SEO - Jenny Halasz
Brand Experience Optimization - The Future of Online Visibility is Building a...
Little Budgets with Big Profits - Navah Hopkins
Go Beyond Revenue: Solutions for Profit-Driven Marketing in GA - Mike Ulrich
7 Strategies You Won't Hear from Your Facebook Rep - Tommy Swanson
Content Marketing Excellence: The Road to Content Marketing Greatness - Eric ...
Client Services & Agency Operations - Brandy Eddings
Fulfilling the Mad Men Promise of Creative - Michelle Stinson-Ross

Recently uploaded (20)

PPTX
Ipsos+Protocols+Playbook+V1.2+(DEC2024)+final+IntClientUseOnly.pptx
PDF
AFCAT Syllabus 2026 Guide by Best Defence Academy in Lucknow.pdf
PDF
How the Minnesota Vikings Used Community to Drive 170% Growth and Acquire 34K...
PDF
Master Fullstack Development Course in Chennai – Enroll Now!
PDF
EVOLUTION OF RURAL MARKETING IN INDIAN CIVILIZATION
PPTX
Mastering eCommerce SEO: Strategies to Boost Traffic and Maximize Conversions
PPTX
Assignment 2 Task 1 - How Consumers Use Technology and Its Impact on Their Lives
PPTX
Final Project parkville.............pptx
PPTX
Kimberly Crossland Storytelling Marketing Class 5stars.pptx
PPTX
Presentation - MindfulHeal Digital Ayurveda GTM & Marketing Plan.pptx
PDF
Modernizing IT for the age of AI - Jason Aloia, Freshworks
PDF
E_Book_Customer_Relation_Management_0.pdf
PDF
AI & Automation: The Future of Marketing or the End of Creativity - Matthew W...
PPTX
Fixing-AI-Hallucinations-The-NeuroRanktm-Approach.pptx
PDF
Mastering Bulk Email Campaign Optimization for 2025
PDF
20K Btc Enabled Cash App Accounts – Safe, Fast, Verified.pdf
PDF
Wondershare Filmora Crack Free Download 2025
PPTX
Sumit Saxena IIM J Project Market segmentation.pptx
PDF
Digital Marketing in the Age of AI: What CEOs Need to Know - Jennifer Apy, Ch...
PDF
Ipsos+Protocols+Playbook+V1.2+(DEC2024)+final+IntClientUseOnly.pptx
AFCAT Syllabus 2026 Guide by Best Defence Academy in Lucknow.pdf
How the Minnesota Vikings Used Community to Drive 170% Growth and Acquire 34K...
Master Fullstack Development Course in Chennai – Enroll Now!
EVOLUTION OF RURAL MARKETING IN INDIAN CIVILIZATION
Mastering eCommerce SEO: Strategies to Boost Traffic and Maximize Conversions
Assignment 2 Task 1 - How Consumers Use Technology and Its Impact on Their Lives
Final Project parkville.............pptx
Kimberly Crossland Storytelling Marketing Class 5stars.pptx
Presentation - MindfulHeal Digital Ayurveda GTM & Marketing Plan.pptx
Modernizing IT for the age of AI - Jason Aloia, Freshworks
E_Book_Customer_Relation_Management_0.pdf
AI & Automation: The Future of Marketing or the End of Creativity - Matthew W...
Fixing-AI-Hallucinations-The-NeuroRanktm-Approach.pptx
Mastering Bulk Email Campaign Optimization for 2025
20K Btc Enabled Cash App Accounts – Safe, Fast, Verified.pdf
Wondershare Filmora Crack Free Download 2025
Sumit Saxena IIM J Project Market segmentation.pptx
Digital Marketing in the Age of AI: What CEOs Need to Know - Jennifer Apy, Ch...

Working Smarter: SEO Automation to Increase Efficiency and Effectiveness - Paul Shapiro

  • 1. Paul Shapiro | @fighto SEO Automation to Increase Efficiency and Effectiveness Working Smarter
  • 2. Paul Shapiro | @fighto
  • 3. Paul Shapiro | @fighto Why Automate? 1.Work faster and free up time for the important stuff 2.Look at more data 3.Improved consistency and procedure
  • 4. Paul Shapiro | @fighto What Can You Automate? • If you’re doing something on a routine basis, it can probably be automated (but doesn’t mean it should). • Any procedures that can be broken down into smaller, micro-tasks that can be handled a by a computer. • Machine Learning can help with more complex decision making (think power of AlphaGo).
  • 5. Paul Shapiro | @fighto
  • 6. Paul Shapiro | @fighto
  • 7. Paul Shapiro | @fighto How Do You Automate?
  • 8. Paul Shapiro | @fighto How to Conceptualize Automation: Break into Micro-tasks 1 2 3 4
  • 9. Paul Shapiro | @fighto How to Work with APIs API Endpoint: http://api.grepwords.com/lookup?apikey=random_string&q=keyword Simple API key authentication via GET request String is unique to you (authentic ation) Variable, changes and often looped
  • 10. Paul Shapiro | @fighto How to Work with APIs http://api.grepwords.com/lookup?apikey=secret&q=board+games Output (JSON): Simple API key authentication via GET request [{"keyword":"board games","updated_cpc":"2018-04-30","updated_cmp":"2018-04- 30","updated_lms":"2018-04-30","updated_history":"2018-04- 30","lms":246000,"ams":246000,"gms":246000,"competition":0.86204091185173,"com petetion":0.86204091185173,"cmp":0.86204091185173,"cpc":0.5,"m1":201000,"m1_mo nth":"2018-02","m2":246000,"m2_month":"2018-01","m3":450000,"m3_month":"2017- 12","m4":368000,"m4_month":"2017-11","m5":201000,"m5_month":"2017- 10","m6":201000,"m6_month":"2017-09","m7":201000,"m7_month":"2017- 08","m8":201000,"m8_month":"2017-07","m9":201000,"m9_month":"2017- 06","m10":201000,"m10_month":"2017-05","m11":201000,"m11_month":"2017- 04","m12":201000,"m12_month":"2017-03"}]
  • 11. Paul Shapiro | @fighto How to Work with APIs Most API Outputs: 1. JSON 2. XML 3. CSV
  • 12. Paul Shapiro | @fighto How to Work with APIs Last Step: Parse it!
  • 13. Paul Shapiro | @fighto How to Work with APIs Parsing Example Using Python: import json json_string = '[{"keyword":"board games","updated_cpc":"2018-04- 30","updated_cmp":"2018-04-30","updated_lms":"2018-04- 30","updated_history":"2018-04- 30","lms":246000,"ams":246000,"gms":246000,"competition":0.86204091185173,"com petetion":0.86204091185173,"cmp":0.86204091185173,"cpc":0.5,"m1":201000,"m1_mo nth":"2018-02","m2":246000,"m2_month":"2018-01","m3":450000,"m3_month":"2017- 12","m4":368000,"m4_month":"2017-11","m5":201000,"m5_month":"2017- 10","m6":201000,"m6_month":"2017-09","m7":201000,"m7_month":"2017- 08","m8":201000,"m8_month":"2017-07","m9":201000,"m9_month":"2017- 06","m10":201000,"m10_month":"2017-05","m11":201000,"m11_month":"2017- 04","m12":201000,"m12_month":"2017-03"}]‘ parsed_json([0]['gms']) 1 2 3
  • 14. Paul Shapiro | @fighto How to Work with APIs Full Python Script: import requests import json r = requests.get('http://api.grepwords.com/lookup?apike y=secretapikey&q=board+games') parsed_json = json.loads(r.text) print(parsed_json[0]['gms'])
  • 15. Paul Shapiro | @fighto How to Work with APIs Output
  • 16. Paul Shapiro | @fighto Tools for Automation
  • 17. Paul Shapiro | @fighto https://www.knime.org
  • 18. Paul Shapiro | @fighto Why KNIME? • Fast way to put together complex analyses • Great for prototyping • Large library of built-in “nodes” • Free/Open Source • Run on Windows/Mac/Linux • Very expandable – even compatible with R, Python, Java, JavaScript • Easy enough for non-technical staff to grasp
  • 19. Paul Shapiro | @fighto Data Agility
  • 20. Paul Shapiro | @fighto Other Options • Scripting Languages • Python • Ruby • Node.js • Go • R • Excel with VBA • Google Sheets
  • 21. Paul Shapiro | @fighto Cron & Windows Task Manager are Your Friend
  • 22. Paul Shapiro | @fighto What is Cron and Why? • *NIX system daemon used to schedule tasks and scripts. • Windows Task Manager is the Windows equivalent of Cron. • This way we can schedule scripts and programs that perform automated tasks on a recurring, scheduled basis.
  • 23. Paul Shapiro | @fighto Quick How To * * * * * command /to/execute Day of Week (0-6) (Sunday = 0) Month (1-12) Hour (0-23) Day of Month (1-31) Minute (0-59)
  • 24. Paul Shapiro | @fighto Run Every Month at Midnight 0 0 1 * * python datacollector.py
  • 25. Paul Shapiro | @fighto The Basics of KNIME
  • 26. Paul Shapiro | @fighto What is a Node? • Nodes are prebuilt, drag and drop modules designed perform a singular task • Nodes are strung together like a chain to accomplish larger, more complex tasks • Nodes can be grouped together into “meta-nodes”, which can be configured in unison
  • 27. Paul Shapiro | @fighto How Do You Add Nodes & How Do They Connect? How do you add nodes to your “workflow”? How do you string nodes together?
  • 28. Paul Shapiro | @fighto How Do You Configure & Run Nodes? Configuring Nodes Running Workflows OR
  • 29. Paul Shapiro | @fighto Keyword Research
  • 30. Paul Shapiro | @fighto Most Keyword Research Looks Like This
  • 31. Paul Shapiro | @fighto Typical Time Investment for Keyword Research 5 8 12 21 21 6 10 18 22 28 0 5 10 15 20 25 30 Micro (0-49 pages) Small (50-99 pages) Medium (100-249 pages) Large (250-499 pages) Extra Large (>500 pages) Hours to Complete Keyword Research by Site Size Average (Low End) Average (High End)
  • 32. Paul Shapiro | @fighto Size of the Data Set vs.
  • 33. Paul Shapiro | @fighto Filtering + Data Manipula- tions One Big Keyword List Seed Keywords - List - GWMT - SEMRush Comp. KWs - SQR Keywords Keyword Planner Suggestions (via GrepWords) Google Autocomplete Semantic Keyword Recommendations (via MarketMuse) Google Autocomplete SEMRush Domain vs. Domain Keywords Google Autocomplete
  • 34. Paul Shapiro | @fighto Data Manipulations / Calculations • Get top 10 results from rank checking API (i.e., GetSTAT) • Use Moz API nodes and find average PA to assess competiveness. • Optionally, use SEMRush’s Keyword Difficulty API Organic Competition Search Volume Keyword Trends
  • 35. Paul Shapiro | @fighto Data Manipulations / Calculations • Get top 10 results from rank checking API (i.e., GetSTAT) • Use Moz API nodes and find average PA to assess competiveness. • Optionally, use SEMRush’s Keyword Difficulty API Organic Competition • Get Search Volumes via SEMRush API or via GrepWords APISearch Volume Keyword Trends
  • 36. Paul Shapiro | @fighto Data Manipulations / Calculations • Get top 10 results from rank checking API (i.e., GetSTAT) • Use Moz API nodes and find average PA to assess competiveness. • Optionally, use SEMRush’s Keyword Difficulty API Organic Competition • Get Search Volumes via SEMRush API or via GrepWords APISearch Volume • Use 2 Years of Google Trends data to calculate slope and determine growing/declining keywordsKeyword Trends
  • 37. Paul Shapiro | @fighto String ‘em All Together and then…
  • 38. Paul Shapiro | @fighto Visualize This top-right quadrant contains keywords with: • Low competition • Good growth Larger bubbles show higher search volumes. You can alternatively use current rank on the x-axis to signal organic market share like a traditional growth-share matrix.
  • 39. Paul Shapiro | @fighto Option: Tableau
  • 40. Paul Shapiro | @fighto Option: Python + Bokeh Charts in KNIME
  • 41. Paul Shapiro | @fighto Sample Python Code: http://pshapi.ro/SLBokeh
  • 42. Paul Shapiro | @fighto Learn More… SearchLove Presentation about Automating Keyword Research: http://pshapi.ro/2idcqYF BrightonSEO Presentation about Semantic Keyword Research: https://searchwilderness.com/semantic-keyword-research/
  • 43. Paul Shapiro | @fighto What to Automate for SEO?
  • 44. Paul Shapiro | @fighto Reporting
  • 45. Paul Shapiro | @fighto
  • 46. Paul Shapiro | @fighto Data Collection
  • 47. Paul Shapiro | @fighto Search Console Schedule to run monthly with Cron and backup to SQL database: https://searchwilderness.com/gwmt- data-python/ JR Oakes’ BigQuery vision: http://pshapi.ro/2vmjDe8
  • 48. Paul Shapiro | @fighto 301 Redirect Mapping from Old URLs
  • 49. Paul Shapiro | @fighto 301 Redirect Mapping from Old URLs Crawl Current Site Download Rendered Pages Extract Main Content (BoilerPipe) Convert to Bitvector Get Historic URLs from Wayback Machine API Filter Out URLs found on Current Site Grab Rendered Page from Wayback Machine Extract Main Content (BoilerPipe) Convert to Bitvector Cosine Similarity Generate .htaccess strings 1 2 3
  • 50. Paul Shapiro | @fighto KNIME Toolkit
  • 51. Paul Shapiro | @fighto SERP Similarity / Overlap
  • 52. Paul Shapiro | @fighto 1. Download ranking data via STAT API 2. Compare results from 1-10 for each query against results from 1- 10 for every other query. 3. Calculate percent similarity. 4. Schedule checks and examine what changed.
  • 53. Paul Shapiro | @fighto Test JavaScript Rendering
  • 54. Paul Shapiro | @fighto http://pshapi.ro/puppetcrawl
  • 55. Paul Shapiro | @fighto Performance Testing with Lighthouse
  • 56. Paul Shapiro | @fighto http://pshapi.ro/perfpony
  • 57. Paul Shapiro | @fighto Anomaly Detection
  • 58. Paul Shapiro | @fighto CTR 1. Data collection: We collect data on query, page and associated metrics via the Google Search Console Search Analytics API. 2. Round average position: I round average position to the tenths decimal place (e.g., 1.19 is rounded to 1.2). 3. Math: We identify outliers using a combination of the statistical methods for identifying outliers (modified z-score, IQR). 4. Email: If any negative outliers are identified for a keyword query and page combination at an average position, an email is sent out identifying all of this data to each of the SEOs assigned to the account to investigate. 5. Scheduling: Set your script to run on a recurring basis. My SEL Article: http://pshapi.ro/2Ae2LYP
  • 59. Paul Shapiro | @fighto Content Ideas
  • 60. Paul Shapiro | @fighto Reddit Data Mining Reddit Data Mining: Python Script https://searchwilderness.com/reddit-python-code/ 1. Enter filename for output 2. Enter a search or series of searches 3. Choose reddit sorting method. For this purpose, choose ‘new’ 4. Choose to look at all of reddit, or isolate to particular subreddit(s). 5. Schedule with cron to find new topic ideas on a recurring basis.
  • 61. Paul Shapiro | @fighto AMP Validation
  • 62. Paul Shapiro | @fighto Bulk Check AMP Pages with AMPBench API Python Script: http://pshapi.ro/2AHlNaE Requires: • Python • Requests package Ideally AMPBench would run locally, but can be ran off the appspot demo URL.
  • 63. Paul Shapiro | @fighto Link Building
  • 64. Paul Shapiro | @fighto http://apiv2.ahrefs.com/?from=backlinks_new_lost&limit=10&target=competitor.com& where=type:%22new%22,date:%222017-06- 01%22&mode=domain&output=json&token=your_personal_api_key Link Building: Prospecting with Competitors Scheduled use of Ahrefs API Parsed Results + SMTP = Link Opportunities
  • 65. Paul Shapiro | @fighto Tech Audit Related Site Changes
  • 66. Paul Shapiro | @fighto • Detect new 404s and other errors • New redirects • Changes to robots.txt • Content based changes over time • Indexation changes • New pages created • Changes in rank or traffic (rank checking API or Search Console) Use Cloud Crawler like Botify/Deepcrawl with API or a custom solution
  • 67. Paul Shapiro | @fighto Custom Solution 1. Run Screaming Frog in the Cloud with lots of RAM: • Amazon AWS: http://ipullrank.com/how-to-run- screaming-frog-and-url-profiler-on-amazon-web- services/ • Google Cloud: https://online.marketing/guide/screaming-frog-in- google-cloud/ 2. Activate with command line and Task Manager for scheduling 3. Use a macro program like RoboTask to generate reports and send to a particular folder 4. Download via FTP or dump to SQL database for analysis 5. Analysis produces alerts with SMTP
  • 68. Paul Shapiro | @fighto Meta Description Creation
  • 69. Paul Shapiro | @fighto 1. Download webpage body contents 2. Run through text summarization engine(s) to produce small snippets of important page text 3. Have person edit to avoid truncation and improve language Process for Semi-Automated Meta Descriptions https://searchengineland.com/reducing-the-time-it-takes- to-write-meta-descriptions-for-large-websites-299887
  • 70. Paul Shapiro | @fighto Alt Attribute Creation
  • 71. Paul Shapiro | @fighto Recognize Images • You can use custom machine learning options (https://www.tensorflow.org/tutorials/image_recognition) but it’s easier and more effective to use an API in this context. • For APIs, you have options: • Microsoft Computer Vision: https://azure.microsoft.com/en-us/services/cognitive- services/computer-vision/ • Google Cloud Vision API: https://cloud.google.com/vision/ • CloudSight: https://cloudsight.ai/ 1. Download all images without alt attribute 2. Run through API and get a caption (not perfect, better than nothing)
  • 72. Paul Shapiro | @fighto
  • 73. Paul Shapiro | @fighto A Word of Caution… https://xkcd.com/1319/
  • 74. Paul Shapiro | @fighto CatalystDigital.com Paul Shapiro https://searchwilderness.com @fighto Thanks!

Editor's Notes

  • #32: Focus on medium size-sites