SlideShare a Scribd company logo
Evaluating URLs at scale
26th May 2019
What was I thinking?
Maybe, just maybe, should have been called:
A guide to an SEO toolkit
Quick BIO
• Head of Digital Marketing at Bray Leino CX – 14 whole days
• Head of SEO - 2 years
• Senior SEO Manager – 2 years
• Ashridge Trees – Marketing Manager – 1 year (was a bad idea)
• Head of SEO – 3 years – SIFT / PracticeWEB
• Run a team of 4 (we’re looking for a technical SEO exec to join us) covering Digital Marketing
How many URLs can you check?
10, 100, 1000, 10,0000, a million, 100 million?
There comes a point where you want to look at them without
having to look at individual URLs.
URL Tool Kit*
* There are other tools. Many. Many other tools.
Crawling Backlinks Site Speed
SEMRush SEMRush Sitespeed.io
SiteBulb Majestic Lighthouse
Xenu Link Sleuth Ahrefs URL Profiler
Screaming Frog
URLProfiler
Custom Built
Botify
Bright Local
DeepCrawl
• Backlink analysis
• Migrations
• Site Structure (Internal linking visualisation)
• Specific Data (authorship?) / Regex? Canonicals
• Angular sites (SPA)
• Content Audits
What context?
Backlink analysis
Initial Audit
Backlink Analysis
• Get links from Search Console,
Majestic, AHREFS, SEMRUSH, MOZ
• Use list mode in Screaming Frog
• Crawl the links
• See which ones 404 or have long
redirect chains
• Fix them!
Initial Audit
Backlink Analysis
• Get links from Search Console,
Majestic, AHREFS, SEMRUSH, MOZ
• Use list mode in Screaming Frog
• Crawl the links
• See which ones 404 or have long
redirect chains
• Fix them!
Migration
Moving site
Migration
• Crawl the existing site in spider mode
• Crawl the development site in spider mode
• Export HTML pages from existing site
• Change domain name in excel (R-Studio if there are too many)
• Use List mode in SF
• Look at 404’s (these are pages that haven’t been moved to the new site)
• Set-up redirect mapping
• Test (recrawl)
Site Structure
Site Structure
Site Structure
All nodes with no-
weight have been
removed.
L. New int. links: 312
R. Old int. links 374
Specific Data
Specific Data
• Author Data
• Tag data
• Unencoded URLs
• Anything not captured by default
Specific Data
Configuration > Custom > Extraction
Specific Data
Right click on the element you want to
Select
Specific Data
Paste in to the extractor and rename
This data will be available at the URL level for any page that has it
JS and SPA’s
Problems
Angular JS and SPA’s
Problems
Angular JS and SPA’s
Problems Google is pretty good at crawling JS but
not all links on a page will be crawled
(Google I/O May 2018)
It is possible that Google can render the
links in a separate action after the initial
crawl
Other search engines are not as good
and are more likely to have problems.
Angular JS and SPA’s
Content Audits
• Readability scores
• Sentiment
• Broadcasted keywords
• Page Speed (Desktop vs Mobile)
Content Audits
Content Audits
Tool Overview
Tools
• Three modes
• Spider
• List
• SERP
• Very configurable. Data focused
• Speed of crawl – (can be a DDOS attack, true story.)
• Can specify CDN’s
• Crawler identifying name
• Add GA and SC
• Storage (RAM vs Hard Disk)
• £149 a year
Tools
• Report, not crawl
focused
• Hints are great
• Runs and reports on
multiple tests
• Keeps all the audits in a
central place
• £27.50 a month
URL Profiler
Tools
• Swiss army knife of tools
• Great at bringing
together metrics in one
place per URL
• £24 a month (2 devices)
R Studio
• Steep learning curve
• Can handle large datasets
• Can use machine learning
• Lots of online tutorials
• Keyword mapping (comp analysis)
• Link visualization
• Keyword clustering
• Free
Any Questions?

More Related Content

PPTX
SEO AND DIGITAL MARKETING
PPTX
Quick and Dirty SEO Tips
PPTX
What is On Page SEO and How to use it?
PPTX
On page seo
PPTX
Seo tutorial
PPTX
Search Engine Optimization, SEO Audits, and Analytics
PDF
Digpen search engine optimisation
PPTX
Search engine optimization
SEO AND DIGITAL MARKETING
Quick and Dirty SEO Tips
What is On Page SEO and How to use it?
On page seo
Seo tutorial
Search Engine Optimization, SEO Audits, and Analytics
Digpen search engine optimisation
Search engine optimization

What's hot (20)

PPTX
SPC Europe Training Week - Real World Challenges in Enterprise Search
PPTX
33 Tactics to Engage and Retain More Customers - IRCE 2016
PPTX
Office 365 SharePoint Search Planning
PDF
Il semaforo di Yoast non è il (tuo) problema
PPTX
How your (non-SEO) work affects Organic Search.
PPTX
Mobile and Desktop SEO Audits - Rocks Digital Marketing Conference 2018
PPTX
Search engine optimization
PPTX
Kahenacon 2012 - Penguin Backlink Analysis with Pivot Tables
PPT
Digital marketing prasentation
PPTX
Search engine optimization
PDF
SEO and search plugins
PPTX
WP Local SEO Basics - 2018 WordCamp DFW
PPT
Search engine optimization
PPTX
Tools for SEO Onsite Audits
PPTX
SEO (Search Engine Optimization)
PPTX
SEO Process - Search Engine Optimization Roadmap Requirement Analysis and Sel...
PPTX
Migrating to SharePoint 2010 for Public Sites: An Insiders Look at Milestones
PPTX
Search Engine Optimization(SEO)
PPT
Link Development - PubCon Las Vegas 2013
PPTX
Seo dgtlmart ppt ( search engine optimization)
SPC Europe Training Week - Real World Challenges in Enterprise Search
33 Tactics to Engage and Retain More Customers - IRCE 2016
Office 365 SharePoint Search Planning
Il semaforo di Yoast non è il (tuo) problema
How your (non-SEO) work affects Organic Search.
Mobile and Desktop SEO Audits - Rocks Digital Marketing Conference 2018
Search engine optimization
Kahenacon 2012 - Penguin Backlink Analysis with Pivot Tables
Digital marketing prasentation
Search engine optimization
SEO and search plugins
WP Local SEO Basics - 2018 WordCamp DFW
Search engine optimization
Tools for SEO Onsite Audits
SEO (Search Engine Optimization)
SEO Process - Search Engine Optimization Roadmap Requirement Analysis and Sel...
Migrating to SharePoint 2010 for Public Sites: An Insiders Look at Milestones
Search Engine Optimization(SEO)
Link Development - PubCon Las Vegas 2013
Seo dgtlmart ppt ( search engine optimization)
Ad

Similar to Evaluating URLs at Scale (20)

PPTX
Search Engine Optimization Primer
PPTX
SEO for Ecommerce: A Comprehensive Guide
PPT
SEARCH ENGINE OPTIMIZATION
PPTX
Technical SEO Training Day | Igoo
PDF
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
PPTX
Advanced search engine presentation - ppt
PPT
Search engine optimization (seo)
PDF
Search-Friendly Web Development at RubyNation
PPTX
how to learn Search Engine Optimization for free
PPTX
SEO Course by https://glowflick.thematrixclasses.com/
PDF
Yoast seo per tutti freelance-camp 9 sept 2021
PPTX
Digital Marketing For Architects
PPTX
SEO Training Slides October 2016
PPTX
Basic Search Engine Optimization Strategies
PPTX
Site Analysis
PPT
SEOPresentation5-9
PPTX
Maximizing Your SEO Results - June 2013
PDF
SEO Seminar for SF Chamber
PDF
On_Page_vs_Off_Page_SEO_Presentation.pdf
PPTX
Backlinking Best Practices
Search Engine Optimization Primer
SEO for Ecommerce: A Comprehensive Guide
SEARCH ENGINE OPTIMIZATION
Technical SEO Training Day | Igoo
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
Advanced search engine presentation - ppt
Search engine optimization (seo)
Search-Friendly Web Development at RubyNation
how to learn Search Engine Optimization for free
SEO Course by https://glowflick.thematrixclasses.com/
Yoast seo per tutti freelance-camp 9 sept 2021
Digital Marketing For Architects
SEO Training Slides October 2016
Basic Search Engine Optimization Strategies
Site Analysis
SEOPresentation5-9
Maximizing Your SEO Results - June 2013
SEO Seminar for SF Chamber
On_Page_vs_Off_Page_SEO_Presentation.pdf
Backlinking Best Practices
Ad

More from BristolSEO (13)

PPTX
BristolSEO - How To Get Big Links Without A Big Budget
PDF
BristolSEO - Using Schema during a time of crisis: what can we learn for search?
PPTX
Tackling Python: What is it and how can it help with Technical SEO?
PDF
Spying On Google: Using Log File Analysis To Reveal Invaluable SEO Insights
PPTX
Identifying On-Page Opportunities for Enterprise Sites Using ScreamingFrog Cu...
PDF
The Keyword Research Process That Generated 1.6 Million Impressions In 6 Months
PDF
Using Offline Data to Fuel Success through Online Paid Media
PDF
The Future of Link Buidling
PDF
An SEO's Guide to Website Migrations
PDF
Technical SEO Checklist for Beginners
PDF
Local SEO: 10 things you should be doing but might not be, and 5 things you m...
PDF
10 Ways To Increase Your Ecommerce Conversion Rate
PDF
SEO Checklists to make you rich and sexy
BristolSEO - How To Get Big Links Without A Big Budget
BristolSEO - Using Schema during a time of crisis: what can we learn for search?
Tackling Python: What is it and how can it help with Technical SEO?
Spying On Google: Using Log File Analysis To Reveal Invaluable SEO Insights
Identifying On-Page Opportunities for Enterprise Sites Using ScreamingFrog Cu...
The Keyword Research Process That Generated 1.6 Million Impressions In 6 Months
Using Offline Data to Fuel Success through Online Paid Media
The Future of Link Buidling
An SEO's Guide to Website Migrations
Technical SEO Checklist for Beginners
Local SEO: 10 things you should be doing but might not be, and 5 things you m...
10 Ways To Increase Your Ecommerce Conversion Rate
SEO Checklists to make you rich and sexy

Recently uploaded (20)

PPTX
Fixing-AI-Hallucinations-The-NeuroRanktm-Approach.pptx
PDF
UNIT 2 - 2 AGRICULTURE MARKETING in INDIA.pdf
PDF
Hidden gems in Microsoft ads with Navah Hopkins
PPTX
Presentation - MindfulHeal Digital Ayurveda GTM & Marketing Plan.pptx
PPTX
Ranking a Webpage with SEO (And Tracking It with the Right Attribution Type a...
PDF
UNIT 1 -4 Profile of Rural Consumers (1).pdf
PDF
Mastering Bulk Email Campaign Optimization for 2025
PDF
exceptionalinsights.group visitor traffic statistics 08-08-25
PDF
Coleção Nature .
PDF
Mastering Content Strategy in 2025 ss.pdf
PPTX
Ipsos+Protocols+Playbook+V1.2+(DEC2024)+final+IntClientUseOnly.pptx
PPTX
Tea and different types of tea in India
PDF
20K Btc Enabled Cash App Accounts – Safe, Fast, Verified.pdf
PDF
Digital Marketing - clear pictire of marketing
PPTX
Mastering eCommerce SEO: Strategies to Boost Traffic and Maximize Conversions
PDF
RC 14001 Certification: Enhancing ISO 14001 with EHS & Security Standards
PDF
Unit 1 -2 THE 4 As of RURAL MARKETING MIX.pdf
PDF
Building a strong social media presence.
PDF
Proven AI Visibility: From SEO Strategy To GEO Tactics
DOCX
procubiz_modern digital marketingblog.docx
Fixing-AI-Hallucinations-The-NeuroRanktm-Approach.pptx
UNIT 2 - 2 AGRICULTURE MARKETING in INDIA.pdf
Hidden gems in Microsoft ads with Navah Hopkins
Presentation - MindfulHeal Digital Ayurveda GTM & Marketing Plan.pptx
Ranking a Webpage with SEO (And Tracking It with the Right Attribution Type a...
UNIT 1 -4 Profile of Rural Consumers (1).pdf
Mastering Bulk Email Campaign Optimization for 2025
exceptionalinsights.group visitor traffic statistics 08-08-25
Coleção Nature .
Mastering Content Strategy in 2025 ss.pdf
Ipsos+Protocols+Playbook+V1.2+(DEC2024)+final+IntClientUseOnly.pptx
Tea and different types of tea in India
20K Btc Enabled Cash App Accounts – Safe, Fast, Verified.pdf
Digital Marketing - clear pictire of marketing
Mastering eCommerce SEO: Strategies to Boost Traffic and Maximize Conversions
RC 14001 Certification: Enhancing ISO 14001 with EHS & Security Standards
Unit 1 -2 THE 4 As of RURAL MARKETING MIX.pdf
Building a strong social media presence.
Proven AI Visibility: From SEO Strategy To GEO Tactics
procubiz_modern digital marketingblog.docx

Evaluating URLs at Scale

  • 1. Evaluating URLs at scale 26th May 2019
  • 2. What was I thinking? Maybe, just maybe, should have been called: A guide to an SEO toolkit
  • 3. Quick BIO • Head of Digital Marketing at Bray Leino CX – 14 whole days • Head of SEO - 2 years • Senior SEO Manager – 2 years • Ashridge Trees – Marketing Manager – 1 year (was a bad idea) • Head of SEO – 3 years – SIFT / PracticeWEB • Run a team of 4 (we’re looking for a technical SEO exec to join us) covering Digital Marketing
  • 4. How many URLs can you check? 10, 100, 1000, 10,0000, a million, 100 million? There comes a point where you want to look at them without having to look at individual URLs.
  • 5. URL Tool Kit* * There are other tools. Many. Many other tools. Crawling Backlinks Site Speed SEMRush SEMRush Sitespeed.io SiteBulb Majestic Lighthouse Xenu Link Sleuth Ahrefs URL Profiler Screaming Frog URLProfiler Custom Built Botify Bright Local DeepCrawl
  • 6. • Backlink analysis • Migrations • Site Structure (Internal linking visualisation) • Specific Data (authorship?) / Regex? Canonicals • Angular sites (SPA) • Content Audits What context?
  • 8. Initial Audit Backlink Analysis • Get links from Search Console, Majestic, AHREFS, SEMRUSH, MOZ • Use list mode in Screaming Frog • Crawl the links • See which ones 404 or have long redirect chains • Fix them!
  • 9. Initial Audit Backlink Analysis • Get links from Search Console, Majestic, AHREFS, SEMRUSH, MOZ • Use list mode in Screaming Frog • Crawl the links • See which ones 404 or have long redirect chains • Fix them!
  • 11. Moving site Migration • Crawl the existing site in spider mode • Crawl the development site in spider mode • Export HTML pages from existing site • Change domain name in excel (R-Studio if there are too many) • Use List mode in SF • Look at 404’s (these are pages that haven’t been moved to the new site) • Set-up redirect mapping • Test (recrawl)
  • 14. Site Structure All nodes with no- weight have been removed. L. New int. links: 312 R. Old int. links 374
  • 16. Specific Data • Author Data • Tag data • Unencoded URLs • Anything not captured by default
  • 17. Specific Data Configuration > Custom > Extraction
  • 18. Specific Data Right click on the element you want to Select
  • 19. Specific Data Paste in to the extractor and rename This data will be available at the URL level for any page that has it
  • 23. Problems Google is pretty good at crawling JS but not all links on a page will be crawled (Google I/O May 2018) It is possible that Google can render the links in a separate action after the initial crawl Other search engines are not as good and are more likely to have problems. Angular JS and SPA’s
  • 25. • Readability scores • Sentiment • Broadcasted keywords • Page Speed (Desktop vs Mobile) Content Audits
  • 28. Tools • Three modes • Spider • List • SERP • Very configurable. Data focused • Speed of crawl – (can be a DDOS attack, true story.) • Can specify CDN’s • Crawler identifying name • Add GA and SC • Storage (RAM vs Hard Disk) • £149 a year
  • 29. Tools • Report, not crawl focused • Hints are great • Runs and reports on multiple tests • Keeps all the audits in a central place • £27.50 a month
  • 30. URL Profiler Tools • Swiss army knife of tools • Great at bringing together metrics in one place per URL • £24 a month (2 devices)
  • 31. R Studio • Steep learning curve • Can handle large datasets • Can use machine learning • Lots of online tutorials • Keyword mapping (comp analysis) • Link visualization • Keyword clustering • Free