SlideShare a Scribd company logo
3
Most read
5
Most read
7
Most read
CRAWLING
AND
INDEXING
Crawling and Indexing
In the simplest terms, you could think of searching the
web as looking in a very large book with an impressive
index telling you exactly where everything is located.
When we perform a Google search, Google programs
check their index to determine the most relevant
search results.
There are three process in delivering search results.
1. Crawling – Does Google know about your site and
find it.
2. Indexing – Can Google index your site.
3. Serving – Does the site have good and useful content
that is relevant to search users.
CRAWLING
 Crawling is the process of fetching all the web pages
linked with a website.
 This task is performed by a software called crawler.
The crawlers are also known as spiders or bots , they
visit website and send information to their respective
parent websites.
INDEXING
 Indexing is the process of creating index for all the
fetched web pages and keeping them into a huge
database from where it can later be retrieved.
 An index is another name for the database used by
a search engines. It contains information on all
the websites the search engine was able to find. If a
website is not in a search engine’s index, users will not
be able to find it using that search engine. Search
engines regularly update their indexes.
Difference between Indexing and
Crawling
 Crawling is the process through which indexing is
done. Google crawlers crawls through the WebPages
and index the pages.
 When search engine crawlers visit any link is called
crawling and when crawlers save or index that links in
search engine database is called indexing.
 Crawling means to visit the link by Search engines and
indexing means to put the page contents in Database
(after analysis) and make them available in search
results when a request is made.
 Crawling means the search engine robot crawl or fetch
the web pages while Indexing means search engine
robot crawl the web pages, saved the information and
it appear in the search engine.
THANK YOU

More Related Content

PPT
Google Search Engine
PPTX
Introduction to Google Analytics
PPTX
Google Algorithms presentation
PDF
Search engine and web crawler
PDF
Introduction to Google Analytics
PPT
Working Of Search Engine
PPTX
Recommender systems: Content-based and collaborative filtering
Google Search Engine
Introduction to Google Analytics
Google Algorithms presentation
Search engine and web crawler
Introduction to Google Analytics
Working Of Search Engine
Recommender systems: Content-based and collaborative filtering

What's hot (20)

PPT
Introduction to Google Search Console
PPTX
Web crawler
PPTX
Restaurant recommender
PPTX
Working of search engine
PDF
Web Crawling & Crawler
PDF
Web Analytics in 10 slides
PPTX
What is the best steps for seo ? ppt
PPTX
Off page seo
PPT
Seo digital marketing
DOC
Search Engine
PPT
Pagerank Algorithm Explained
PPTX
Search engine
PPTX
Link analysis : Comparative study of HITS and Page Rank Algorithm
PPTX
Web Crawlers
PPTX
Search engine
PPTX
Google Analytics ppt
PPTX
Search Engine
PPT
On page seo
PPTX
Google Analytics Ppt Final
PDF
Basics of Search Engine Optimisation
Introduction to Google Search Console
Web crawler
Restaurant recommender
Working of search engine
Web Crawling & Crawler
Web Analytics in 10 slides
What is the best steps for seo ? ppt
Off page seo
Seo digital marketing
Search Engine
Pagerank Algorithm Explained
Search engine
Link analysis : Comparative study of HITS and Page Rank Algorithm
Web Crawlers
Search engine
Google Analytics ppt
Search Engine
On page seo
Google Analytics Ppt Final
Basics of Search Engine Optimisation
Ad

Viewers also liked (20)

DOCX
Difference Between Crawling, Indexing and Caching
PPT
Seo how search engine works
PDF
Brilliant Steps to do Search Engine Optimization
PPT
Blogging With Word Press -Social Media Bootcamp
PPTX
Ted Dunning - Whither Hadoop
PPT
Search Engine Optimization - Social Media Bootcamp
PPTX
Web Scraping : Crawling
PPTX
Search Engine Optimization - Fundamentals - SEO
PDF
Challenges Distributed Information Retrieval [RBY] (ICDE 2007 Turkey)
PPTX
Collaborative filtering
PPTX
Indexing popsi....
PPTX
Spatial databases
PPTX
Introduction to indexing (presentation1)
PPTX
Database index(sql server)
PPT
Intelligent crawling and indexing using lucene
DOCX
1. indexing and abstracting
PPTX
POPSI
PPT
Crawling, indexing, ranking: Make the search engine crawlers and algorithms y...
Difference Between Crawling, Indexing and Caching
Seo how search engine works
Brilliant Steps to do Search Engine Optimization
Blogging With Word Press -Social Media Bootcamp
Ted Dunning - Whither Hadoop
Search Engine Optimization - Social Media Bootcamp
Web Scraping : Crawling
Search Engine Optimization - Fundamentals - SEO
Challenges Distributed Information Retrieval [RBY] (ICDE 2007 Turkey)
Collaborative filtering
Indexing popsi....
Spatial databases
Introduction to indexing (presentation1)
Database index(sql server)
Intelligent crawling and indexing using lucene
1. indexing and abstracting
POPSI
Crawling, indexing, ranking: Make the search engine crawlers and algorithms y...
Ad

Similar to Crawling and Indexing (20)

PPTX
Web Crawling and Indexing in Information Retrieval.pptx
PPTX
CRAWLER,INDEX,RANKING AND ITS WORKING.pptx
PPTX
Introduction to Search Engine Optimization
PDF
Webmaster guide-en
PPTX
Google indexing
PPTX
How search engine works
PPTX
Technical SEO explain by Akramujjaman Mridha
PDF
Search Engine Marketing | Top Search Engines | Search Engines List
PPT
Web Crawler
PDF
Search engine and web crawler
PDF
Week10 Web Presentation
PPTX
SEO (SEARCH ENGINE OPTIMIZATION) AND DIGITAL MARKETING.pptx
DOC
Seo Manual
PPTX
Crawl Budget: Everything you Need to Know
PPTX
Web Mining.pptx
PPTX
How Google Search Algorithm Works ??
PPTX
How Google Search Engine Algorithm Works ??
PPTX
Challenges in web crawling
PPTX
Search Engine working, Crawlers working, Search Engine mechanism
PPTX
CSCI 494 - Lect. 3. Anatomy of Search Engines/Building a Crawler
Web Crawling and Indexing in Information Retrieval.pptx
CRAWLER,INDEX,RANKING AND ITS WORKING.pptx
Introduction to Search Engine Optimization
Webmaster guide-en
Google indexing
How search engine works
Technical SEO explain by Akramujjaman Mridha
Search Engine Marketing | Top Search Engines | Search Engines List
Web Crawler
Search engine and web crawler
Week10 Web Presentation
SEO (SEARCH ENGINE OPTIMIZATION) AND DIGITAL MARKETING.pptx
Seo Manual
Crawl Budget: Everything you Need to Know
Web Mining.pptx
How Google Search Algorithm Works ??
How Google Search Engine Algorithm Works ??
Challenges in web crawling
Search Engine working, Crawlers working, Search Engine mechanism
CSCI 494 - Lect. 3. Anatomy of Search Engines/Building a Crawler

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Electronic commerce courselecture one. Pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
KodekX | Application Modernization Development
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Spectral efficient network and resource selection model in 5G networks
Review of recent advances in non-invasive hemoglobin estimation
Chapter 3 Spatial Domain Image Processing.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Understanding_Digital_Forensics_Presentation.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Machine learning based COVID-19 study performance prediction
Electronic commerce courselecture one. Pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Reach Out and Touch Someone: Haptics and Empathic Computing
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Per capita expenditure prediction using model stacking based on satellite ima...
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
NewMind AI Monthly Chronicles - July 2025
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
KodekX | Application Modernization Development

Crawling and Indexing

  • 3. In the simplest terms, you could think of searching the web as looking in a very large book with an impressive index telling you exactly where everything is located. When we perform a Google search, Google programs check their index to determine the most relevant search results. There are three process in delivering search results. 1. Crawling – Does Google know about your site and find it. 2. Indexing – Can Google index your site. 3. Serving – Does the site have good and useful content that is relevant to search users.
  • 4. CRAWLING  Crawling is the process of fetching all the web pages linked with a website.  This task is performed by a software called crawler. The crawlers are also known as spiders or bots , they visit website and send information to their respective parent websites.
  • 5. INDEXING  Indexing is the process of creating index for all the fetched web pages and keeping them into a huge database from where it can later be retrieved.  An index is another name for the database used by a search engines. It contains information on all the websites the search engine was able to find. If a website is not in a search engine’s index, users will not be able to find it using that search engine. Search engines regularly update their indexes.
  • 6. Difference between Indexing and Crawling  Crawling is the process through which indexing is done. Google crawlers crawls through the WebPages and index the pages.  When search engine crawlers visit any link is called crawling and when crawlers save or index that links in search engine database is called indexing.
  • 7.  Crawling means to visit the link by Search engines and indexing means to put the page contents in Database (after analysis) and make them available in search results when a request is made.  Crawling means the search engine robot crawl or fetch the web pages while Indexing means search engine robot crawl the web pages, saved the information and it appear in the search engine.