SlideShare a Scribd company logo
How Search Engine Works ?
Presented by Mohammed Azharuddin
Digital Marketing Trainer
History of Search
• 1990 – Archi Query Form
– FTP based file search engine
• Feb 1993 – Excite.com
– General word relation based search
• Oct 1993 – AliWeb
– Manual submission engine
• Jan 1994 – Altavista
– First natural language search engine
• Jan 1996 – Backrup
– Started by Larry Page and Segrey Brin
• Sep 15 1997 – Google.com
– First search engine with Page Rank Technology
• 1997 – Yandex.com
– Russian based search engine
• 1998 – MSN Search
– Microsoft Rival to Google
• 2000 – Baidu.com
– Chinese based search engine
• 2008 – duckduckgo.com
– Non tracking search engine
• 2009 – Bing.com
– Microsoft Rival to Google
• 2010 – Blekko.com
– Spam and Virus free search
http://www.searchenginehistory.com/
http://www.google.co.in/about/company/history/
http://www.wordstream.com/articles/internet-search-engines-history
The Google Story
Search Engine Architecture
• Every search engine is based on following
– Crawling
– Indexing
– Algorithms
– Results
– Fight Spam
Google Architecture
http://infolab.stanford.edu/~backrub/google.html
Search Engine Architecture
Crawler
Store
Indexer
100 Million GB
indexes
indexes
Search
Interface
Algorithms
(Programs)
trash
trash
trash
Sorted based on Content / Factors
WWW
60 Trillion Pages
Or
60 Lakh CroreLive Google Example
Algorithms
• Programs and Formulas to get relevant results
– Page Rank
– Spelling Check
– Synonym check
– Auto complete
– Query Understanding
– Safe Search
– User Context
Page Rank Algorithm
• Google's first algorithms, which looks at links
between pages to determine their relevance.
• PR is a number generated for each page
available in Google Index
• PR Toolbar Range
– NA to 10 (Best Rank) : This is based on Log Scale
of 0 – 10
• Real Page rank is calculated based on number
of pages in index, which can be 0.15 to
Trillions
Toolbar Vs. Real PR
Toolbar Real PR
0 0 - 10
1 100 - 1,000
2 1,000 – 10,000
3 10,000 – 100000
4 100000 – 1000000
5 1000000 - 10000000
http://www.webworkshop.net/pagerank_calculator.php3
PR Formula
Updated Formula
Old Formula
D = Damping Factor ; PR(N) = PR of Linking Site ; L(N) : No of Outbound Links
Example
http://en.wikipedia.org/wiki/PageRank
http://www.cs.princeton.edu/~chazelle/courses/BIB/pagerank.htm
Fighting Spam
• Spam refers to websites which uses un ethical
practices for Search Rankings
• To fight the spam Google release updates
frequently called as “Algorithm Updates”
• Google changes its search algorithm around
500 – 600 times every year.
• Some of them are major and few are minor
updates
Major Updates
Basics of search engines and algorithms (1)
• Panda Update - February 23, 2011
– This algorithm target the sites with thin content,
content farms, duplicate content, sites with high
ad-to-content ratios, and a number of other
quality issues.
– Affected 12% queries on launch
– Recent update : Panda 4 – May 19 2013
Basics of search engines and algorithms (1)
• Penguin Update – April 24, 2012
– This algorithm target the sites which over optimize
the websites, uses excessive links.
– Affected 3% queries on launch
– Recent update : Pengiun 2.1 – Oct 4 2013
Basics of search engines and algorithms (1)
Humming Bird Update – August 2013
• This algorithm understands the context of the
query by analyzing the words in query
• It can automatically rewrite the query internally
based on certain words like “Near”, Vs, How to,
Where, Who is …. Etc
• Many queries are provided as “ONE BOX
ANSWERS” to give the quick answers.
How it Works ?
User Query
Query
Translator
Modified
Query
Index
One Box Answers Queries
• When is Independence of India
• Time in India or Time in Toronto
• 1$ to INR
• 1Mile to Kms
• Banana Vs. Apple
• Who is wife of Bill Gates
• What is my IP
• who invented www
• Show me pictures of taj mahal
Search Engine Results Page
(SERP)
Basics of search engines and algorithms (1)
Types of Results
Paid Results
PPC Ads
Comparison Ads
Shopping Ads
Non Paid Results
Organic Web
News Results
Image Results
Local Results
Video Results
Site Links
Schema Data
Click Through Rate (CTR)
• CTR is a measure to understand how many users are
clicking on the site from SERP
• CTR helps to understand the user response
• The top four positions “above the fold” for many
desktop users, receive 83% of first page organic
clicks.
CTR = (No of Clicks/No of Impressions)x100
2011
2012 CTR Results
Branded Vs. Un Branded
Thank you
Give us your feedback

More Related Content

PDF
Understanding search engine algorithms
PPTX
Basics of Search Engines and Algorithms
PPTX
TechSEO Boost 2017: SEO Best Practices for JavaScript T-Based Websites
PPTX
MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...
PPTX
TechSEO Boost 2017: The State of Technical SEO
Understanding search engine algorithms
Basics of Search Engines and Algorithms
TechSEO Boost 2017: SEO Best Practices for JavaScript T-Based Websites
MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...
TechSEO Boost 2017: The State of Technical SEO

What's hot (18)

PPT
SEO Essentials
PPTX
TechSEO Boost 2017: Making the Web Fast
PPTX
SEO Tools
PDF
App Store PR
PPTX
Inside google search - how it works??
PDF
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
PPTX
How search engine works
PPTX
SEO Basics
PPTX
Comparing Search Engines
PDF
Hypermedia APIs that make sense
PPTX
Google algorithim’s
PPTX
The truth behind seo
KEY
App Store keyword popularity - by App Codes
PDF
KEY
App Store SEO tutorial
PPTX
Winning with mobile page speed: killer technologies, tools, and tips [by Aleh...
DOC
How a search engine works report
PPT
How search engine works ( Mr. Mirza)
SEO Essentials
TechSEO Boost 2017: Making the Web Fast
SEO Tools
App Store PR
Inside google search - how it works??
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
How search engine works
SEO Basics
Comparing Search Engines
Hypermedia APIs that make sense
Google algorithim’s
The truth behind seo
App Store keyword popularity - by App Codes
App Store SEO tutorial
Winning with mobile page speed: killer technologies, tools, and tips [by Aleh...
How a search engine works report
How search engine works ( Mr. Mirza)
Ad

Similar to Basics of search engines and algorithms (1) (20)

PPTX
Search Engine Optimization - Fundamentals - SEO
PDF
SEO 101
PDF
Search Engine Optimization - Aykut Aslantaş
PPTX
Search Engine optimization (SEO)
PPTX
Search Engine
PPTX
Medical Website Optimization
PPTX
Introduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning Catalyst
PPSX
Search engine optimization (seo) overview
PDF
Seo Made Easy
PDF
Basic Level SEO Interview Questions.pdf
PPT
Emarketing1
PPTX
Exploring Search Engines and their usage online
PDF
SEOMoz The Beginners Guide To SEO
PPT
Seo Presentation for Beginners, Complete SEO ppt,
PDF
Introduction of Search Engine & working process.pdf
PPT
Search Engine Optimization and Analytics for CSEPP Advanced Training Course
PPTX
SEO Audit Workshop : Frameworks , Techniques and Tools
PPTX
Demand Quest SEO Training Sept. 2017 - Session 1
PDF
Search Engine Optimisation - SEO basic training
PPTX
Seo Intorduction
Search Engine Optimization - Fundamentals - SEO
SEO 101
Search Engine Optimization - Aykut Aslantaş
Search Engine optimization (SEO)
Search Engine
Medical Website Optimization
Introduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning Catalyst
Search engine optimization (seo) overview
Seo Made Easy
Basic Level SEO Interview Questions.pdf
Emarketing1
Exploring Search Engines and their usage online
SEOMoz The Beginners Guide To SEO
Seo Presentation for Beginners, Complete SEO ppt,
Introduction of Search Engine & working process.pdf
Search Engine Optimization and Analytics for CSEPP Advanced Training Course
SEO Audit Workshop : Frameworks , Techniques and Tools
Demand Quest SEO Training Sept. 2017 - Session 1
Search Engine Optimisation - SEO basic training
Seo Intorduction
Ad

More from kongara (20)

PPTX
K.chaitanya sm
DOCX
Stakeholder management
PPTX
K.chaitanya pm
PDF
2 e salesforce objectives.pdf (3 files merged)
PDF
Linear logisticregression
PDF
Adwords introduction (1)
PPTX
Offpage optimization
PDF
Isttm evol, dynamics, trends hrm
PDF
Isttm hyd ir v2.0
PDF
Isstm merit rating, promotions & transfers
PPTX
Matching entrepreneur
PPT
Marketing channel selection
PPT
Market feasibility
DOCX
Innovation & entrepreneurship development program
PPT
Industrial policy
PPTX
government industrial policies
PPT
Current scenario
PPT
Feasibilitystudy
PPTX
Dpr
PPT
Entrepreneurial competence
K.chaitanya sm
Stakeholder management
K.chaitanya pm
2 e salesforce objectives.pdf (3 files merged)
Linear logisticregression
Adwords introduction (1)
Offpage optimization
Isttm evol, dynamics, trends hrm
Isttm hyd ir v2.0
Isstm merit rating, promotions & transfers
Matching entrepreneur
Marketing channel selection
Market feasibility
Innovation & entrepreneurship development program
Industrial policy
government industrial policies
Current scenario
Feasibilitystudy
Dpr
Entrepreneurial competence

Recently uploaded (20)

PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Classroom Observation Tools for Teachers
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Cell Structure & Organelles in detailed.
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Basic Mud Logging Guide for educational purpose
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
01-Introduction-to-Information-Management.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
TR - Agricultural Crops Production NC III.pdf
Classroom Observation Tools for Teachers
STATICS OF THE RIGID BODIES Hibbelers.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Cell Structure & Organelles in detailed.
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Abdominal Access Techniques with Prof. Dr. R K Mishra
Basic Mud Logging Guide for educational purpose
Microbial disease of the cardiovascular and lymphatic systems
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
01-Introduction-to-Information-Management.pdf
VCE English Exam - Section C Student Revision Booklet
Supply Chain Operations Speaking Notes -ICLT Program
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
Anesthesia in Laparoscopic Surgery in India
Module 4: Burden of Disease Tutorial Slides S2 2025
Chapter 2 Heredity, Prenatal Development, and Birth.pdf

Basics of search engines and algorithms (1)

  • 1. How Search Engine Works ? Presented by Mohammed Azharuddin Digital Marketing Trainer
  • 2. History of Search • 1990 – Archi Query Form – FTP based file search engine • Feb 1993 – Excite.com – General word relation based search • Oct 1993 – AliWeb – Manual submission engine • Jan 1994 – Altavista – First natural language search engine
  • 3. • Jan 1996 – Backrup – Started by Larry Page and Segrey Brin • Sep 15 1997 – Google.com – First search engine with Page Rank Technology • 1997 – Yandex.com – Russian based search engine • 1998 – MSN Search – Microsoft Rival to Google
  • 4. • 2000 – Baidu.com – Chinese based search engine • 2008 – duckduckgo.com – Non tracking search engine • 2009 – Bing.com – Microsoft Rival to Google • 2010 – Blekko.com – Spam and Virus free search http://www.searchenginehistory.com/ http://www.google.co.in/about/company/history/ http://www.wordstream.com/articles/internet-search-engines-history
  • 6. Search Engine Architecture • Every search engine is based on following – Crawling – Indexing – Algorithms – Results – Fight Spam
  • 8. Search Engine Architecture Crawler Store Indexer 100 Million GB indexes indexes Search Interface Algorithms (Programs) trash trash trash Sorted based on Content / Factors WWW 60 Trillion Pages Or 60 Lakh CroreLive Google Example
  • 9. Algorithms • Programs and Formulas to get relevant results – Page Rank – Spelling Check – Synonym check – Auto complete – Query Understanding – Safe Search – User Context
  • 10. Page Rank Algorithm • Google's first algorithms, which looks at links between pages to determine their relevance. • PR is a number generated for each page available in Google Index • PR Toolbar Range – NA to 10 (Best Rank) : This is based on Log Scale of 0 – 10 • Real Page rank is calculated based on number of pages in index, which can be 0.15 to Trillions
  • 11. Toolbar Vs. Real PR Toolbar Real PR 0 0 - 10 1 100 - 1,000 2 1,000 – 10,000 3 10,000 – 100000 4 100000 – 1000000 5 1000000 - 10000000 http://www.webworkshop.net/pagerank_calculator.php3
  • 12. PR Formula Updated Formula Old Formula D = Damping Factor ; PR(N) = PR of Linking Site ; L(N) : No of Outbound Links
  • 14. Fighting Spam • Spam refers to websites which uses un ethical practices for Search Rankings • To fight the spam Google release updates frequently called as “Algorithm Updates” • Google changes its search algorithm around 500 – 600 times every year. • Some of them are major and few are minor updates
  • 17. • Panda Update - February 23, 2011 – This algorithm target the sites with thin content, content farms, duplicate content, sites with high ad-to-content ratios, and a number of other quality issues. – Affected 12% queries on launch – Recent update : Panda 4 – May 19 2013
  • 19. • Penguin Update – April 24, 2012 – This algorithm target the sites which over optimize the websites, uses excessive links. – Affected 3% queries on launch – Recent update : Pengiun 2.1 – Oct 4 2013
  • 21. Humming Bird Update – August 2013 • This algorithm understands the context of the query by analyzing the words in query • It can automatically rewrite the query internally based on certain words like “Near”, Vs, How to, Where, Who is …. Etc • Many queries are provided as “ONE BOX ANSWERS” to give the quick answers.
  • 22. How it Works ? User Query Query Translator Modified Query Index
  • 23. One Box Answers Queries • When is Independence of India • Time in India or Time in Toronto • 1$ to INR • 1Mile to Kms • Banana Vs. Apple • Who is wife of Bill Gates • What is my IP • who invented www • Show me pictures of taj mahal
  • 24. Search Engine Results Page (SERP)
  • 26. Types of Results Paid Results PPC Ads Comparison Ads Shopping Ads Non Paid Results Organic Web News Results Image Results Local Results Video Results Site Links Schema Data
  • 27. Click Through Rate (CTR) • CTR is a measure to understand how many users are clicking on the site from SERP • CTR helps to understand the user response • The top four positions “above the fold” for many desktop users, receive 83% of first page organic clicks. CTR = (No of Clicks/No of Impressions)x100
  • 28. 2011
  • 30. Branded Vs. Un Branded
  • 31. Thank you Give us your feedback