SlideShare a Scribd company logo
4
Most read
5
Most read
7
Most read
Divyansh Verma
SAU/AM(M)/2014/14
South Asian University
Email : itsmedv91@gmail.com
LINEAR ALGEBRA
BEHIND
GOOGLE SEARCH
Contents
• Search Engine : Google
• Magic Behind Google Success
• PageRank Algorithm
• PageRank - How it works ?
• Importance of Linear Algebra in Page Ranking Algorithm
• References
Search Engine : Google
What is a search engine?
A web search engine is a software system that is designed to
search for information on the World Wide Web.
Eg : Google, Bing, Yahoo, Ask, etc.
Why Google?
• It is the most popular search engine.
• It is very simple, fast and precise.
• Adaptive to growing internet.
Magic Behind Google Success
When Google went online in 1990’s, one thing that set it apart from
other search engines was its search result listings which always
delivered “good stuff”.
Search Engines like Google have to do three basic things :
1. Look the web and locate all web pages with public access.
2. Indexing of searched data for more efficient search.
3. Rate the importance of each page in the database, so when the
user does a search, the more important pages are presented first.
Big part of the MAGIC behind Google success is its PageRank
Algorithm.
PageRank Algorithm
PageRank Algorithm, developed by Google’s founders, Larry
Page and Sergey Brin, when they were graduate students at
Stanford University.
PageRank is a link analysis algorithm that ranks the relative
importance of all web pages within a network.
Three features for determining PageRank :
• Outgoing Links - the number of links found in a page
• Incoming Links - the number of times other pages have cited
this page
• Rank - A value representing the page's relative importance in
the network.
PageRank – How it Works ?
Mathematical Model of Internet
1. Represent Internet as Graph
2. Represent Graph as Stochastic Matrix
3. Make stochastic matrix more convenient ⇒ Google Matrix
4. Find Dominant eigenvector of Google Matrix ⇒ PageRank
Internet as a Graph
Link from one web page to another web page.
Web graph : Web pages = nodes, Links = edges
PageRank – How it Works ?
Web graph as a Matrix
Links = nonzero elements in matrix
Every page ‘i’ has li≥1 outlinks. Sij = 1/li if page I has link to page j
0 otherwise
S is a Sparse Matrix, as most of the entries are zero.
Probability that surfer moves from page i to page j.
1
2
3
4
5
S =
0 1/2 0 1/2 0
0 0 1/3 1/3 1/3
0 0 0 1 0
0 0 0 0 1
1 0 0 0 0
PageRank – How it Works ?
Google Matrix
Convex Combination of two Stochastic Matrix gives a Google
Stochastic Matrix which is reducible and more convenient.
G = αS + (1 − α)S1vT
where 0≤ α ≤1 is damping factor,
S1 is a matrix whose all entries are 1,
vT is vector that models teleportation corresponding to webpage vi
Eigen Values of G are 1 > α λ2(S) ≥ α λ3(S) ≥ . . .
Unique dominant left eigenvector : πTG = πT, π ≥ 0
Links Teleportation
PageRank – How it Works ?
PageRank
Dominant Eigen Vector πT gives PageRank corresponding webpage i
πTG = πT, π ≥ 0
πi is the PageRank Corresponding to webpage i
How Google Ranks Web pages
• Model : Internet → Web Graph → Stochastic Matrix G
• Computation : Dominant eigenvector of G for PageRank πi
• Display : πi > πk , then page i may* be displayed before page k
*depending on hypertext analysis
Importance of Linear Algebra
Using techniques of Linear Algebra, one can compute a unique
solution for PageRank Problem.
It gives importance of all webpages in terms of PageRank
Eigenvector corresponding to each webpage.
No other successful technique other than Linear Algebra is
available to solve this problem.
References
https://www.rose-hulman.edu/~bryan/googleFinalVersionFixed.pdf
http://www.math.cornell.edu/~mec/Winter2009/RalucaRemus/Lecture3/lecture3.html
http://www.cs.princeton.edu/~chazelle/courses/BIB/pagerank.html
http://blog.kleinproject.org/?p=280
THANK
YOU

More Related Content

PDF
Linear algebra behind Google search
PPTX
How Does Math Matter in Data Science
PPTX
Page rank algortihm
PDF
Linear Algebra – A Powerful Tool for Data Science
PPTX
WOLFRAM MATHEMATICA PRESENTATION.pptx
ODP
Markov chain and its Application
PPT
scilab
PPT
Pagerank Algorithm Explained
Linear algebra behind Google search
How Does Math Matter in Data Science
Page rank algortihm
Linear Algebra – A Powerful Tool for Data Science
WOLFRAM MATHEMATICA PRESENTATION.pptx
Markov chain and its Application
scilab
Pagerank Algorithm Explained

What's hot (20)

PPTX
Application of linear algebra in cse
PPTX
Graph Theory
PPTX
Applications of graph theory
PPTX
Application of Discrete Mathematics in CSE
PPT
Maths Project Power Point Presentation
PPTX
Cryptography - Discrete Mathematics
PPTX
Applications of algebra and calculus
DOCX
Applications of Matrices
PPTX
Linear Algebra Applications
PPTX
Application of Matrices in real life | Matrices application | The Matrices
PPTX
Application of discrete mathematics in IT
PPTX
page ranking algorithm
PPTX
Introduction to Random Walk
DOCX
Applications of linear algebra in field of it
PPTX
Applications of matrices in Real\Daily life
PPTX
Application of matrices in real life and matrix
PPTX
Applications of linear algebra
PPTX
Matrix Representation Of Graph
PPTX
Intro & Applications of Discrete Math
PPTX
Application of discrete math in real life
Application of linear algebra in cse
Graph Theory
Applications of graph theory
Application of Discrete Mathematics in CSE
Maths Project Power Point Presentation
Cryptography - Discrete Mathematics
Applications of algebra and calculus
Applications of Matrices
Linear Algebra Applications
Application of Matrices in real life | Matrices application | The Matrices
Application of discrete mathematics in IT
page ranking algorithm
Introduction to Random Walk
Applications of linear algebra in field of it
Applications of matrices in Real\Daily life
Application of matrices in real life and matrix
Applications of linear algebra
Matrix Representation Of Graph
Intro & Applications of Discrete Math
Application of discrete math in real life
Ad

Viewers also liked (20)

PPTX
PPTX
Types of Error in Mechanical Measurement & Metrology (MMM)
PPTX
Biometrics
PPTX
PPTX
PDF
FPDE presentation
DOCX
PPTX
Mechanical measurement
DOCX
PPTX
Operation Research (Simplex Method)
PPT
L20 Simplex Method
DOCX
Instructionformatreport 110419102141-phpapp02
PPT
Metrology and Measurements unit 2
PPTX
Measurement of force, torque and strain
PPT
PPTX
Automation and robotics
PPTX
Special Cases in Simplex Method
PPT
Robots & Automation
PDF
PPTX
Thermocouple gauge & pirani gauge
Types of Error in Mechanical Measurement & Metrology (MMM)
Biometrics
FPDE presentation
Mechanical measurement
Operation Research (Simplex Method)
L20 Simplex Method
Instructionformatreport 110419102141-phpapp02
Metrology and Measurements unit 2
Measurement of force, torque and strain
Automation and robotics
Special Cases in Simplex Method
Robots & Automation
Thermocouple gauge & pirani gauge
Ad

Similar to LINEAR ALGEBRA BEHIND GOOGLE SEARCH (20)

PDF
PageRank Algorithm
PPTX
PPTX
Analysis Of Algorithm
PDF
Random web surfer pagerank algorithm
PPT
The Maths behind Web search engines
PPT
Seo and page rank algorithm
PPTX
Optimizing search engines
PPT
Ranking Web Pages
PPTX
Link Analysis Methods a fds fdsa f fads f.pptx
PPTX
Dm page rank
PPTX
google pagerank algorithms cosc 4335 stnaford
PPTX
PageRank Algorithm In data mining
PDF
J046045558
PPT
Page Rank
PPT
Search engine page rank demystification
PPTX
Page rank method
PPT
Pagerank
PPTX
Page-Rank Algorithm Final
PPTX
PageRank
PPTX
The 400 Million Dollar Algorithm - Copy
PageRank Algorithm
Analysis Of Algorithm
Random web surfer pagerank algorithm
The Maths behind Web search engines
Seo and page rank algorithm
Optimizing search engines
Ranking Web Pages
Link Analysis Methods a fds fdsa f fads f.pptx
Dm page rank
google pagerank algorithms cosc 4335 stnaford
PageRank Algorithm In data mining
J046045558
Page Rank
Search engine page rank demystification
Page rank method
Pagerank
Page-Rank Algorithm Final
PageRank
The 400 Million Dollar Algorithm - Copy

Recently uploaded (20)

PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Complications of Minimal Access Surgery at WLH
PDF
Computing-Curriculum for Schools in Ghana
PPTX
master seminar digital applications in india
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Sports Quiz easy sports quiz sports quiz
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Institutional Correction lecture only . . .
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Microbial disease of the cardiovascular and lymphatic systems
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Abdominal Access Techniques with Prof. Dr. R K Mishra
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Complications of Minimal Access Surgery at WLH
Computing-Curriculum for Schools in Ghana
master seminar digital applications in india
Module 4: Burden of Disease Tutorial Slides S2 2025
Sports Quiz easy sports quiz sports quiz
102 student loan defaulters named and shamed – Is someone you know on the list?
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
2.FourierTransform-ShortQuestionswithAnswers.pdf
Institutional Correction lecture only . . .
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
TR - Agricultural Crops Production NC III.pdf
Microbial diseases, their pathogenesis and prophylaxis

LINEAR ALGEBRA BEHIND GOOGLE SEARCH

  • 1. Divyansh Verma SAU/AM(M)/2014/14 South Asian University Email : itsmedv91@gmail.com LINEAR ALGEBRA BEHIND GOOGLE SEARCH
  • 2. Contents • Search Engine : Google • Magic Behind Google Success • PageRank Algorithm • PageRank - How it works ? • Importance of Linear Algebra in Page Ranking Algorithm • References
  • 3. Search Engine : Google What is a search engine? A web search engine is a software system that is designed to search for information on the World Wide Web. Eg : Google, Bing, Yahoo, Ask, etc. Why Google? • It is the most popular search engine. • It is very simple, fast and precise. • Adaptive to growing internet.
  • 4. Magic Behind Google Success When Google went online in 1990’s, one thing that set it apart from other search engines was its search result listings which always delivered “good stuff”. Search Engines like Google have to do three basic things : 1. Look the web and locate all web pages with public access. 2. Indexing of searched data for more efficient search. 3. Rate the importance of each page in the database, so when the user does a search, the more important pages are presented first. Big part of the MAGIC behind Google success is its PageRank Algorithm.
  • 5. PageRank Algorithm PageRank Algorithm, developed by Google’s founders, Larry Page and Sergey Brin, when they were graduate students at Stanford University. PageRank is a link analysis algorithm that ranks the relative importance of all web pages within a network. Three features for determining PageRank : • Outgoing Links - the number of links found in a page • Incoming Links - the number of times other pages have cited this page • Rank - A value representing the page's relative importance in the network.
  • 6. PageRank – How it Works ? Mathematical Model of Internet 1. Represent Internet as Graph 2. Represent Graph as Stochastic Matrix 3. Make stochastic matrix more convenient ⇒ Google Matrix 4. Find Dominant eigenvector of Google Matrix ⇒ PageRank Internet as a Graph Link from one web page to another web page. Web graph : Web pages = nodes, Links = edges
  • 7. PageRank – How it Works ? Web graph as a Matrix Links = nonzero elements in matrix Every page ‘i’ has li≥1 outlinks. Sij = 1/li if page I has link to page j 0 otherwise S is a Sparse Matrix, as most of the entries are zero. Probability that surfer moves from page i to page j. 1 2 3 4 5 S = 0 1/2 0 1/2 0 0 0 1/3 1/3 1/3 0 0 0 1 0 0 0 0 0 1 1 0 0 0 0
  • 8. PageRank – How it Works ? Google Matrix Convex Combination of two Stochastic Matrix gives a Google Stochastic Matrix which is reducible and more convenient. G = αS + (1 − α)S1vT where 0≤ α ≤1 is damping factor, S1 is a matrix whose all entries are 1, vT is vector that models teleportation corresponding to webpage vi Eigen Values of G are 1 > α λ2(S) ≥ α λ3(S) ≥ . . . Unique dominant left eigenvector : πTG = πT, π ≥ 0 Links Teleportation
  • 9. PageRank – How it Works ? PageRank Dominant Eigen Vector πT gives PageRank corresponding webpage i πTG = πT, π ≥ 0 πi is the PageRank Corresponding to webpage i How Google Ranks Web pages • Model : Internet → Web Graph → Stochastic Matrix G • Computation : Dominant eigenvector of G for PageRank πi • Display : πi > πk , then page i may* be displayed before page k *depending on hypertext analysis
  • 10. Importance of Linear Algebra Using techniques of Linear Algebra, one can compute a unique solution for PageRank Problem. It gives importance of all webpages in terms of PageRank Eigenvector corresponding to each webpage. No other successful technique other than Linear Algebra is available to solve this problem.