SlideShare a Scribd company logo
2
Most read
3
Most read
10
Most read
Information Retrieval : 7
Boolean Model
Prof Neeraj Bhargava
Vaibhav Khanna
Department of Computer Science
School of Engineering and Systems Sciences
Maharshi Dayanand Saraswati University Ajmer
The Boolean Model
‱ Simple model based on set theory and Boolean algebra
‱ Queries specified as boolean expressions
– quite intuitive and precise semantics
– neat formalism
– example of query
‱ Term-document frequencies in the term-document matrix are
all binary
‱ The (standard) Boolean model of information retrieval (BIR)
is a classical information retrieval (IR) model and, at the same
time, the first and most-adopted one. ...
‱ The BIR is based on Boolean logic and classical set theory in
that both the documents to be searched and the user's query
are conceived as sets of terms
‱ Retrieval is based on whether the documents contain the
query terms or not .
The Boolean Model
‱ A term conjunctive component that satisfies a query q is
called a query conjunctive component c(q)
‱ A query q rewritten as a disjunction of those components is
called the disjunct normal form qDNF
‱ To illustrate, consider
The Boolean Model
‱ The three conjunctive components for the
query
The Boolean Model
‱ This approach works even if the vocabulary of the collection
includes terms not in the query
‱ Consider that the vocabulary is given by
‱ Then, a document dj that contains only terms ka, kb, and kc is
represented by c(dj) = (1, 1, 1, 0)
The Boolean Model
‱ The similarity of the document dj to the query
q is defined as
‱ The Boolean model predicts that each
document is either relevant or non-relevant
Advantages of Boolean Model
‱ Clean formalism
‱ Easy to implement
‱ Intuitive concept
Disadvantages of Boolean Model
‱ Exact matching may retrieve too few or too
many documents
‱ Hard to translate a query into a Boolean
expression
‱ All terms are equally weighted
‱ More like data retrieval than information
retrieval
Drawbacks of the Boolean Model
‱ Retrieval based on binary decision criteria with no
notion of partial matching
‱ No ranking of the documents is provided (absence of
a grading scale)
‱ Information need has to be translated into a Boolean
expression, which most users find awkward
‱ The Boolean queries formulated by the users are
most often too simplistic
‱ The model frequently returns either too few or too
many documents in response to a user query
Assignment
‱ Explain the Boolean Model of Information
Retrieval.

More Related Content

PPTX
Information retrieval s
PPT
Information Retrieval Models
PPTX
Information retrieval introduction
PPTX
Ppt evaluation of information retrieval system
PDF
CS6007 information retrieval - 5 units notes
PPT
6&7-Query Languages & Operations.ppt
PPTX
Probabilistic retrieval model
PPT
Information retrieval system
Information retrieval s
Information Retrieval Models
Information retrieval introduction
Ppt evaluation of information retrieval system
CS6007 information retrieval - 5 units notes
6&7-Query Languages & Operations.ppt
Probabilistic retrieval model
Information retrieval system

What's hot (20)

PPTX
Vector space model in information retrieval
PPTX
Boolean,vector space retrieval Models
PPTX
Information retrieval 14 fuzzy set models of ir
PPTX
Informatio retrival evaluation
PPTX
INFORMATION RETRIEVAL Anandraj.L
PPTX
Automatic indexing
PPT
Inverted index
PPTX
Introduction to Information Retrieval
PPTX
Probabilistic information retrieval models & systems
PPTX
Functions of information retrival system(1)
PPTX
The impact of web on ir
PDF
CS8080 IRT UNIT I NOTES.pdf
PPTX
Information Retrieval Evaluation
PPT
Latent Semantic Indexing For Information Retrieval
PDF
Cloud Computing in Libraries
PDF
Information Storage and Retrieval : A Case Study
PPT
automatic classification in information retrieval
PPTX
Semantic web
Vector space model in information retrieval
Boolean,vector space retrieval Models
Information retrieval 14 fuzzy set models of ir
Informatio retrival evaluation
INFORMATION RETRIEVAL Anandraj.L
Automatic indexing
Inverted index
Introduction to Information Retrieval
Probabilistic information retrieval models & systems
Functions of information retrival system(1)
The impact of web on ir
CS8080 IRT UNIT I NOTES.pdf
Information Retrieval Evaluation
Latent Semantic Indexing For Information Retrieval
Cloud Computing in Libraries
Information Storage and Retrieval : A Case Study
automatic classification in information retrieval
Semantic web
Ad

Similar to Information retrieval 7 boolean model (20)

PDF
presnetion of Ir topic is how to retirve information.pdf
PPTX
Information retrieval 6 ir models
PPTX
UMAP16: A Framework for Dynamic Knowledge Modeling in Textbook-Based Learning
PPTX
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
PPTX
Information retrival system and PageRank algorithm
PPTX
A task-based scientific paper recommender system for literature review and ma...
PPTX
Deductive databases
PPTX
SE-DSI-Generative-information-reterival.pptx
PPT
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
PPTX
The research process steps
PPTX
information retrieval
PPTX
Information Retrieval
PPTX
Information retrieval 13 alternative set theoretic models
PPTX
Text mining
PPTX
Credible workshop
PPTX
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
PPTX
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
PPT
Query Dependent Pseudo-Relevance Feedback based on Wikipedia
PDF
Chapter 4 IR Models.pdf
PPT
Information Retrieval QueryLanguageOperation.ppt
presnetion of Ir topic is how to retirve information.pdf
Information retrieval 6 ir models
UMAP16: A Framework for Dynamic Knowledge Modeling in Textbook-Based Learning
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Information retrival system and PageRank algorithm
A task-based scientific paper recommender system for literature review and ma...
Deductive databases
SE-DSI-Generative-information-reterival.pptx
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
The research process steps
information retrieval
Information Retrieval
Information retrieval 13 alternative set theoretic models
Text mining
Credible workshop
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
Query Dependent Pseudo-Relevance Feedback based on Wikipedia
Chapter 4 IR Models.pdf
Information Retrieval QueryLanguageOperation.ppt
Ad

More from Vaibhav Khanna (20)

PPTX
Information and network security 47 authentication applications
PPTX
Information and network security 46 digital signature algorithm
PPTX
Information and network security 45 digital signature standard
PPTX
Information and network security 44 direct digital signatures
PPTX
Information and network security 43 digital signatures
PPTX
Information and network security 42 security of message authentication code
PPTX
Information and network security 41 message authentication code
PPTX
Information and network security 40 sha3 secure hash algorithm
PPTX
Information and network security 39 secure hash algorithm
PPTX
Information and network security 38 birthday attacks and security of hash fun...
PPTX
Information and network security 37 hash functions and message authentication
PPTX
Information and network security 35 the chinese remainder theorem
PPTX
Information and network security 34 primality
PPTX
Information and network security 33 rsa algorithm
PPTX
Information and network security 32 principles of public key cryptosystems
PPTX
Information and network security 31 public key cryptography
PPTX
Information and network security 30 random numbers
PPTX
Information and network security 29 international data encryption algorithm
PPTX
Information and network security 28 blowfish
PPTX
Information and network security 27 triple des
Information and network security 47 authentication applications
Information and network security 46 digital signature algorithm
Information and network security 45 digital signature standard
Information and network security 44 direct digital signatures
Information and network security 43 digital signatures
Information and network security 42 security of message authentication code
Information and network security 41 message authentication code
Information and network security 40 sha3 secure hash algorithm
Information and network security 39 secure hash algorithm
Information and network security 38 birthday attacks and security of hash fun...
Information and network security 37 hash functions and message authentication
Information and network security 35 the chinese remainder theorem
Information and network security 34 primality
Information and network security 33 rsa algorithm
Information and network security 32 principles of public key cryptosystems
Information and network security 31 public key cryptography
Information and network security 30 random numbers
Information and network security 29 international data encryption algorithm
Information and network security 28 blowfish
Information and network security 27 triple des

Recently uploaded (20)

PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
ai tools demonstartion for schools and inter college
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PPTX
Introduction to Artificial Intelligence
PDF
Understanding Forklifts - TECH EHS Solution
PPTX
history of c programming in notes for students .pptx
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
top salesforce developer skills in 2025.pdf
PDF
Digital Strategies for Manufacturing Companies
PDF
How Creative Agencies Leverage Project Management Software.pdf
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
System and Network Administraation Chapter 3
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
System and Network Administration Chapter 2
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Wondershare Filmora 15 Crack With Activation Key [2025
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Design an Analysis of Algorithms II-SECS-1021-03
ai tools demonstartion for schools and inter college
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Introduction to Artificial Intelligence
Understanding Forklifts - TECH EHS Solution
history of c programming in notes for students .pptx
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
top salesforce developer skills in 2025.pdf
Digital Strategies for Manufacturing Companies
How Creative Agencies Leverage Project Management Software.pdf
CHAPTER 2 - PM Management and IT Context
System and Network Administraation Chapter 3
ManageIQ - Sprint 268 Review - Slide Deck
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
System and Network Administration Chapter 2
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...

Information retrieval 7 boolean model

  • 1. Information Retrieval : 7 Boolean Model Prof Neeraj Bhargava Vaibhav Khanna Department of Computer Science School of Engineering and Systems Sciences Maharshi Dayanand Saraswati University Ajmer
  • 2. The Boolean Model ‱ Simple model based on set theory and Boolean algebra ‱ Queries specified as boolean expressions – quite intuitive and precise semantics – neat formalism – example of query ‱ Term-document frequencies in the term-document matrix are all binary
  • 3. ‱ The (standard) Boolean model of information retrieval (BIR) is a classical information retrieval (IR) model and, at the same time, the first and most-adopted one. ... ‱ The BIR is based on Boolean logic and classical set theory in that both the documents to be searched and the user's query are conceived as sets of terms ‱ Retrieval is based on whether the documents contain the query terms or not .
  • 4. The Boolean Model ‱ A term conjunctive component that satisfies a query q is called a query conjunctive component c(q) ‱ A query q rewritten as a disjunction of those components is called the disjunct normal form qDNF ‱ To illustrate, consider
  • 5. The Boolean Model ‱ The three conjunctive components for the query
  • 6. The Boolean Model ‱ This approach works even if the vocabulary of the collection includes terms not in the query ‱ Consider that the vocabulary is given by ‱ Then, a document dj that contains only terms ka, kb, and kc is represented by c(dj) = (1, 1, 1, 0)
  • 7. The Boolean Model ‱ The similarity of the document dj to the query q is defined as ‱ The Boolean model predicts that each document is either relevant or non-relevant
  • 8. Advantages of Boolean Model ‱ Clean formalism ‱ Easy to implement ‱ Intuitive concept
  • 9. Disadvantages of Boolean Model ‱ Exact matching may retrieve too few or too many documents ‱ Hard to translate a query into a Boolean expression ‱ All terms are equally weighted ‱ More like data retrieval than information retrieval
  • 10. Drawbacks of the Boolean Model ‱ Retrieval based on binary decision criteria with no notion of partial matching ‱ No ranking of the documents is provided (absence of a grading scale) ‱ Information need has to be translated into a Boolean expression, which most users find awkward ‱ The Boolean queries formulated by the users are most often too simplistic ‱ The model frequently returns either too few or too many documents in response to a user query
  • 11. Assignment ‱ Explain the Boolean Model of Information Retrieval.