SlideShare a Scribd company logo
QUERY EXPANSION WITH ENRICHED USER PROFILES FOR PERSONALIZED
SEARCH UTILIZING FOLKSONOMY DATA
ABSTRACT
Query expansion has been widely adopted in Web search as a way of tackling the ambiguity of
queries. Personalized search utilizing folksonomy data has demonstrated an extreme vocabulary
mismatch problem that requires even more effective query expansion methods. Co-occurrence
statistics, tag-tag relationships and semantic matching approaches are among those favored by
previous research. However, user profiles which only contain a user’s past annotation
information may not be enough to support the selection of expansion terms, especially for users
with limited previous activity with the system. We propose a novel model to construct enriched
user profiles with the help of an external corpus for personalized query expansion. Our model
integrates the current state-of-the-art text representation learning framework, known as word
embeddings, with topic models in two groups of pseudo-aligned documents. Based on user
profiles, we build two novel query expansion techniques. These two techniques are based on
topical weights-enhanced word embeddings, and the topical relevance between the query and the
terms inside a user profile respectively. The results of an in-depth experimental evaluation,
performed on two real-world datasets using different external corpora, show that our approach
outperforms traditional techniques, including existing non-personalized and personalized query
expansion methods.
EXISTING SYSTEM
Over the past number of years personalized search algorithms which utilize folksonomy data
have attracted significant attention in the literature . This is partially due to the relative
unavailability of users’ search and click-through history to independent researchers not
employed by, or engaged with, a commercial search engine. Another reason for utilizing
folksonomy data is that tags are highly ambiguous, representing a typical realworld Web search
scenario of short queries formulated by users. “Folksonomy” is a term typically used to describe
the social classification phenomenon. Online folksonomy services are used by millions of users
world-wide, enabling users to save and organize their online bookmarks with freely chosen short
text descriptors.
DISADVANTAGES:
 User profiles which contain only a user’s past annotation information may not be enough
to support the effective selection of expansion terms, especially for users who have had
limited previous activity with the system.
 Previous personalized QE research either favors tagtag relationships or relies on the co-
occurrence statistics of two terms.
PROPOSED SYSTEM
We tackle the challenge of personalized QE utilizing folksonomy data in a novel way by
integrating latent and deep semantics. We propose a novel model that integrates word
embeddings with topic models to construct enriched user profiles with the help of an external
corpus.We suggest two novel personalized QE techniques based on topical weights-enhanced
word embeddings, and the topical relevance between the query and the terms inside a user
profile. The techniques demonstrate significantly better results than previously proposed non-
personalized and personalized QE methods.
ADVANTAGES
Our model integrates the current state-of-the-art text representation learning framework, known
as word embeddings, with topic models in two groups of pseudo-aligned documents between
user annotations and documents from the external corpus. Based on these enhanced user profiles,
we then present two novel QE techniques.
The first technique approaches the problem by using topical weights-enhanced word embeddings
to select the best possible expansion terms.
The second technique calculates the topical relevance between the query and the terms inside a
user profile.
OBJECTIVES
 We tackle the challenge of personalized QE utilizing folksonomy data in a novel way by
integrating latent and deep semantics.
 We propose a novel model that integrates word embeddings with topic models to
construct enriched user profiles with the help of an external corpus.
 We suggest two novel personalized QE techniques based on topical weights-enhanced
word embeddings, and the topical relevance between the query and the terms inside a
user profile. The techniques demonstrate significantly better results than previously
proposed non-personalized and personalized QE methods.
Architecture Diagram
SYSTEM REQUIREMENTS
H/W SYSTEM CONFIGURATION:-
Processor - Pentium –IV
Speed - 1.5 Ghz
RAM - 512 MB(min)
Hard Disk - 40 GB
S/W SYSTEM CONFIGURATION
 Operating System :Windows95/98/2000/XP
 Application Server : Tomcat5.0/6.X
 Front End : HTML, Java, Jsp
 Scripts : JavaScript.
 Server side Script : Java Server Pages.
 Database Connectivity : Mysql.

More Related Content

PDF
Personalized web search using browsing history and domain knowledge
PDF
Context Driven Technique for Document Classification
PDF
dexa08linli
PDF
Classification-based Retrieval Methods to Enhance Information Discovery on th...
PDF
Enhancing the Privacy Protection of the User Personalized Web Search Using RDF
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
PDF
TWO WAY CHAINED PACKETS MARKING TECHNIQUE FOR SECURE COMMUNICATION IN WIRELES...
PDF
Filtering Unwanted Messages from Online Social Networks (OSN) using Rule Base...
Personalized web search using browsing history and domain knowledge
Context Driven Technique for Document Classification
dexa08linli
Classification-based Retrieval Methods to Enhance Information Discovery on th...
Enhancing the Privacy Protection of the User Personalized Web Search Using RDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
TWO WAY CHAINED PACKETS MARKING TECHNIQUE FOR SECURE COMMUNICATION IN WIRELES...
Filtering Unwanted Messages from Online Social Networks (OSN) using Rule Base...

What's hot (20)

PDF
A Review: Text Classification on Social Media Data
PDF
Iaetsd hierarchical fuzzy rule based classification
PDF
K1803057782
PDF
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
PDF
IJERD(www.ijerd.com)International Journal of Engineering Research and Develop...
PDF
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
PDF
Iaetsd efficient filteration of unwanted messages
PDF
G017415465
DOCX
A system to filter unwanted messages from osn user walls
PDF
Custom-Made Ranking in Databases Establishing and Utilizing an Appropriate Wo...
PDF
50120140502013
PDF
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
PDF
A New Algorithm for Inferring User Search Goals with Feedback Sessions
PDF
Query- And User-Dependent Approach for Ranking Query Results in Web Databases
PPTX
A system to filter unwanted messages from OSN user walls
PDF
User search goal inference and feedback session using fast generalized – fuzz...
DOCX
Dynamic personalized recommendation on sparse data
PDF
USPatents
PDF
Performance Evaluation of Query Processing Techniques in Information Retrieval
PDF
Using user personalized ontological profile to infer semantic knowledge for p...
A Review: Text Classification on Social Media Data
Iaetsd hierarchical fuzzy rule based classification
K1803057782
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...
IJERD(www.ijerd.com)International Journal of Engineering Research and Develop...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
Iaetsd efficient filteration of unwanted messages
G017415465
A system to filter unwanted messages from osn user walls
Custom-Made Ranking in Databases Establishing and Utilizing an Appropriate Wo...
50120140502013
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
A New Algorithm for Inferring User Search Goals with Feedback Sessions
Query- And User-Dependent Approach for Ranking Query Results in Web Databases
A system to filter unwanted messages from OSN user walls
User search goal inference and feedback session using fast generalized – fuzz...
Dynamic personalized recommendation on sparse data
USPatents
Performance Evaluation of Query Processing Techniques in Information Retrieval
Using user personalized ontological profile to infer semantic knowledge for p...
Ad

Similar to QUERY EXPANSION WITH ENRICHED USER PROFILES FOR PERSONALIZED SEARCH UTILIZING FOLKSONOMY DATA (20)

PDF
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
PDF
Naresh sharma
PDF
Semantic web personalization
PPTX
Web Minnig and text mining presentation
PPTX
User friendly pattern search paradigm
PDF
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
DOCX
JPJ1421 Facilitating Document Annotation Using Content and Querying Value
DOCX
Personalized mobile search engine
DOCX
JAVA 2013 IEEE DATAMINING PROJECT PMSE A Personalized Mobile Search Engine
PDF
Ay3313861388
PDF
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
PDF
E0322035037
DOCX
JPJ1419 Discovering Emerging Topics in Social Streams via Link-Anomaly Detec...
PDF
Kp3518241828
PDF
Classification of News and Research Articles Using Text Pattern Mining
DOC
View the Microsoft Word document.doc
DOC
View the Microsoft Word document.doc
DOC
View the Microsoft Word document.doc
PDF
an efficient approach for co extracting opinion targets based in online revie...
PDF
IRJET - Deep Collaborrative Filtering with Aspect Information
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
Naresh sharma
Semantic web personalization
Web Minnig and text mining presentation
User friendly pattern search paradigm
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
JPJ1421 Facilitating Document Annotation Using Content and Querying Value
Personalized mobile search engine
JAVA 2013 IEEE DATAMINING PROJECT PMSE A Personalized Mobile Search Engine
Ay3313861388
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
E0322035037
JPJ1419 Discovering Emerging Topics in Social Streams via Link-Anomaly Detec...
Kp3518241828
Classification of News and Research Articles Using Text Pattern Mining
View the Microsoft Word document.doc
View the Microsoft Word document.doc
View the Microsoft Word document.doc
an efficient approach for co extracting opinion targets based in online revie...
IRJET - Deep Collaborrative Filtering with Aspect Information
Ad

More from Prasadu Peddi (17)

PDF
Pointers
PDF
String notes
DOCX
B.Com 1year Lab programs
DOCX
COMPUTING SEMANTIC SIMILARITY OF CONCEPTS IN KNOWLEDGE GRAPHS
DOCX
Energy-efficient Query Processing in Web Search Engines
DOCX
MINING COMPETITORS FROM LARGE UNSTRUCTURED DATASETS
DOCX
GENERATING QUERY FACETS USING KNOWLEDGE BASES
DOCX
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
DOCX
SOCIRANK: IDENTIFYING AND RANKING PREVALENT NEWS TOPICS USING SOCIAL MEDIA FA...
DOCX
COLLABORATIVE FILTERING-BASED RECOMMENDATION OF ONLINE SOCIAL VOTING
DOCX
DYNAMIC FACET ORDERING FOR FACETED PRODUCT SEARCH ENGINES
PPTX
A Cross Tenant Access Control (CTAC) Model for Cloud Computing: Formal Specif...
PPTX
Time and Attribute Factors Combined Access Control on Time-Sensitive Data in ...
PPTX
Attribute Based Storage Supporting Secure Deduplication of Encrypted D...
PPTX
RAAC: Robust and Auditable Access Control with Multiple Attribute Authorities...
PPTX
Provably Secure Key-Aggregate Cryptosystems with Broadcast Aggregate Keys for...
PPTX
Identity-Based Remote Data Integrity Checking With Perfect Data Privacy Prese...
Pointers
String notes
B.Com 1year Lab programs
COMPUTING SEMANTIC SIMILARITY OF CONCEPTS IN KNOWLEDGE GRAPHS
Energy-efficient Query Processing in Web Search Engines
MINING COMPETITORS FROM LARGE UNSTRUCTURED DATASETS
GENERATING QUERY FACETS USING KNOWLEDGE BASES
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
SOCIRANK: IDENTIFYING AND RANKING PREVALENT NEWS TOPICS USING SOCIAL MEDIA FA...
COLLABORATIVE FILTERING-BASED RECOMMENDATION OF ONLINE SOCIAL VOTING
DYNAMIC FACET ORDERING FOR FACETED PRODUCT SEARCH ENGINES
A Cross Tenant Access Control (CTAC) Model for Cloud Computing: Formal Specif...
Time and Attribute Factors Combined Access Control on Time-Sensitive Data in ...
Attribute Based Storage Supporting Secure Deduplication of Encrypted D...
RAAC: Robust and Auditable Access Control with Multiple Attribute Authorities...
Provably Secure Key-Aggregate Cryptosystems with Broadcast Aggregate Keys for...
Identity-Based Remote Data Integrity Checking With Perfect Data Privacy Prese...

Recently uploaded (20)

PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Geodesy 1.pptx...............................................
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
Digital Logic Computer Design lecture notes
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
DOCX
573137875-Attendance-Management-System-original
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
Welding lecture in detail for understanding
PDF
PPT on Performance Review to get promotions
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
composite construction of structures.pdf
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Well-logging-methods_new................
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Geodesy 1.pptx...............................................
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Digital Logic Computer Design lecture notes
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
bas. eng. economics group 4 presentation 1.pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
573137875-Attendance-Management-System-original
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Welding lecture in detail for understanding
PPT on Performance Review to get promotions
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
composite construction of structures.pdf
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Well-logging-methods_new................
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Automation-in-Manufacturing-Chapter-Introduction.pdf

QUERY EXPANSION WITH ENRICHED USER PROFILES FOR PERSONALIZED SEARCH UTILIZING FOLKSONOMY DATA

  • 1. QUERY EXPANSION WITH ENRICHED USER PROFILES FOR PERSONALIZED SEARCH UTILIZING FOLKSONOMY DATA ABSTRACT Query expansion has been widely adopted in Web search as a way of tackling the ambiguity of queries. Personalized search utilizing folksonomy data has demonstrated an extreme vocabulary mismatch problem that requires even more effective query expansion methods. Co-occurrence statistics, tag-tag relationships and semantic matching approaches are among those favored by previous research. However, user profiles which only contain a user’s past annotation information may not be enough to support the selection of expansion terms, especially for users with limited previous activity with the system. We propose a novel model to construct enriched user profiles with the help of an external corpus for personalized query expansion. Our model integrates the current state-of-the-art text representation learning framework, known as word embeddings, with topic models in two groups of pseudo-aligned documents. Based on user profiles, we build two novel query expansion techniques. These two techniques are based on topical weights-enhanced word embeddings, and the topical relevance between the query and the terms inside a user profile respectively. The results of an in-depth experimental evaluation, performed on two real-world datasets using different external corpora, show that our approach outperforms traditional techniques, including existing non-personalized and personalized query expansion methods. EXISTING SYSTEM Over the past number of years personalized search algorithms which utilize folksonomy data have attracted significant attention in the literature . This is partially due to the relative unavailability of users’ search and click-through history to independent researchers not employed by, or engaged with, a commercial search engine. Another reason for utilizing folksonomy data is that tags are highly ambiguous, representing a typical realworld Web search scenario of short queries formulated by users. “Folksonomy” is a term typically used to describe the social classification phenomenon. Online folksonomy services are used by millions of users
  • 2. world-wide, enabling users to save and organize their online bookmarks with freely chosen short text descriptors. DISADVANTAGES:  User profiles which contain only a user’s past annotation information may not be enough to support the effective selection of expansion terms, especially for users who have had limited previous activity with the system.  Previous personalized QE research either favors tagtag relationships or relies on the co- occurrence statistics of two terms. PROPOSED SYSTEM We tackle the challenge of personalized QE utilizing folksonomy data in a novel way by integrating latent and deep semantics. We propose a novel model that integrates word embeddings with topic models to construct enriched user profiles with the help of an external corpus.We suggest two novel personalized QE techniques based on topical weights-enhanced word embeddings, and the topical relevance between the query and the terms inside a user profile. The techniques demonstrate significantly better results than previously proposed non- personalized and personalized QE methods. ADVANTAGES Our model integrates the current state-of-the-art text representation learning framework, known as word embeddings, with topic models in two groups of pseudo-aligned documents between user annotations and documents from the external corpus. Based on these enhanced user profiles, we then present two novel QE techniques. The first technique approaches the problem by using topical weights-enhanced word embeddings to select the best possible expansion terms. The second technique calculates the topical relevance between the query and the terms inside a user profile.
  • 3. OBJECTIVES  We tackle the challenge of personalized QE utilizing folksonomy data in a novel way by integrating latent and deep semantics.  We propose a novel model that integrates word embeddings with topic models to construct enriched user profiles with the help of an external corpus.  We suggest two novel personalized QE techniques based on topical weights-enhanced word embeddings, and the topical relevance between the query and the terms inside a user profile. The techniques demonstrate significantly better results than previously proposed non-personalized and personalized QE methods. Architecture Diagram SYSTEM REQUIREMENTS
  • 4. H/W SYSTEM CONFIGURATION:- Processor - Pentium –IV Speed - 1.5 Ghz RAM - 512 MB(min) Hard Disk - 40 GB S/W SYSTEM CONFIGURATION  Operating System :Windows95/98/2000/XP  Application Server : Tomcat5.0/6.X  Front End : HTML, Java, Jsp  Scripts : JavaScript.  Server side Script : Java Server Pages.  Database Connectivity : Mysql.