SlideShare a Scribd company logo
Building Local/Geo Search
with Apache Lucene and Solr
Agenda



   Grant Ingersoll, Lucid Imagination
      Introduction
      Basics of geo-spatial search
      Tools available in Lucene and Solr
   Ryan McKinley, Voyager GIS
      Spatial search in Action:
   Sameer Maggon, AT&T Interactive
      How Solr powers local search at YP.com


                              Lucid Imagination, Inc.
Introductions
   Grant Ingersoll
         Lucene/Solr committer
         Co-author of upcoming “Taming Text”


   Ryan McKinley
         Lucene/Solr committer
         Co-founder of Voyager GIS


   Sameer Maggon
         Search Eng. Team lead at AT&T Interactive
         Active user of Lucene since 2001

                                 Lucid Imagination, Inc.
Use Cases



      Asset Management
        “Dude, where’s my map?”
      Social Networking
        Find all friends near me
      Targeted, local search results and ads
        “restaurants in Austin Texas”
        “Starbucks, 55313”
      Business Intelligence
        Restrict doc set for analysis by location

                                   Lucid Imagination, Inc.
Spatial Search Concepts



      Spatial Data Types
        Points (latitude/longitude)
        Lines
        Shapes


      Maps and overlays
        Streets, POI
                                         http://www.openstreetmap.org/?lat=44.9744&lon=-93.2484&zoom=14&layers=B000FTFT

      Integration with unstructured text
        Metadata, descriptions, user reviews, etc.

                                Lucid Imagination, Inc.
Application Needs



      Query Parsing
      Efficient distance calculations
        Euclidean, Great Circle (Haversine), Vincenty’s
      Filtering
        Bounding Box
      Sort by Distance
      Relevance Enhancement
      Faceting
      Advanced: shape intersections, routes

                                Lucid Imagination, Inc.
Lucene 2.9/Solr 1.4 Features for Spatial Search



      Lucene/Solr are excellent for dealing with unstructured text


      2.9/1.4 adds:
        Better Numeric handling for range searches


        Spatial contribution with features for (2.9 only, coming in 1.5):
        • Creating Cartesian Tiers (Grids)
        • Geohashes
        • Calculating distances
        • Filter implementations
                                   Lucid Imagination, Inc.
Query Parsing



      Query parsing is often the most difficult to get right
        User error, ambiguity in names
        Mixture of topic and location: bars in Minneapolis MN
      Geocoding translates addresses, POIs into lat/lon or other
        Several publicly available services: geonames.org, Google Maps
        Often have built-in throttles, so may not be effective for prod.


      Query logs are invaluable for developing an effective parser



                                Lucid Imagination, Inc.
Filtering



       Range queries can significantly slow down search if done
     improperly
       Goal: reduce the number of terms to evaluate
       Solution 1:
            New Trie-based numeric capabilities
       Solution 2:
            Cartesian Tiers




                                   Lucid Imagination, Inc.
Cartesian Tiers



     Divide up the space into grids and assign it an id
       Each tier breaks the space down into 2tier grids
       Sample code using Lucene spatial contrib:
   CartesianTierPlotter pl = new
    CartesianTierPlotter(10, new
    SinusoidalProjector(), "spatial");
   pl.getTierBoxId(latitude, longitude);
      See
   http://www.nsshutdown.com/projects/lucene/wh
   itepaper/locallucene_v2.html

                                 Lucid Imagination, Inc.
What’s next?



      Tighter integration in Solr
        Work already under way
        Native field types, query parsing support, faceting support


      Resources
        java-user@lucene,apache.org, solr-user@lucene.apache.org
        https://issues.apache.org/jira/browse/SOLR-773
        http://lucene.apache.org/java/2_9_1/api/contrib-
        spatial/index.html
        Many, many more general resources on the web
                                Lucid Imagination, Inc.
Voyager Spatial Data Search
                       Ryan McKinley
               Co-founder, Voyager GIS
Where is my Data?
• Files stored across the network – desktop,
  external drives, databases etc.
• Many distinct data formats
• Massive datasets keep getting bigger.
• Poor cataloging tools
• Limited metadata
Voyager Solution
Voyager is a search engine for your geographic data.

• Find data with simple text search and
  geographic constraints
• Keep data in its existing location (no need to
  import to a new system)
• Tools to work with search results
Building Local/Geo Search with Apache Lucene and Solr
Building Local/Geo Search with Apache Lucene and Solr
Building Local/Geo Search with Apache Lucene and Solr
Implementation
• Data Discovery / Extraction
• Solr search
• Wicket UI
Data Extraction
• For each result, we extract basic information:




- ESRI ArcObjects
- GDAL
- PDFBox
- Geotools
- Tika
- etc
Geographic Search in Solr
• Need to search by ‘extent’ not point
• Works well with a standard RTree
• Built a custom Lucene Filter to
  intersect/search within a given extent.
Work in Progress
• Custom Gazateer
  – “Building 12” > ‘-96.X 30.X -96.X 30.X’


• Named Entity Extraction
  – Geographic words that appear in titles / text get
    indexed with geographic properties
Geographic Search in Solr 1.5+
• Standard API, pluggable implementation.
  – Standard Qparser, pluggable indexing
• Single input ‘field’ could index multiple lucene
  fields.
• Share objects between different parts of the
  request cycle (only calculate distance once)
• Augment results with calculated value
  – Manual or from function query
How Solr powers local search at
           YP.com



           Sameer Maggon
           November 18, 2009




© 2008 AT&T Intellectual Property. All rights reserved.
AT&T and the AT&T logo are trademarks of AT&T Intellectual Property.
YP.com
        Technical Challenges
        Custom Relevance Model
        Scalability / Architecture
        Conclusion




© 2008 AT&T Intellectual Property. All rights reserved.
AT&T and the AT&T logo are trademarks of AT&T Intellectual Property.
YP.com (beta)


Local Search Site


Focused on providing
relevant results


Uses Solr for search




                       AT&T Proprietary (Restricted) Only for use by authorized individuals or any above-   3
                        designated team(s) within the AT&T companies and not for general distribution
Technical Challenges



        Relevancy                                                                                Scalability

Topically relevant results                                         10s of millions of
                                                                   records
Constrained by contextual
geographical search                                                Response time less
                                                                   than 200ms
Local relevancy is not just
keyword and location –                                             Fault resistant
ratings, brands, etc                                               More than 150 million
                                                                   searches per month




                        AT&T Proprietary (Restricted) Only for use by authorized individuals or any above-     4
                         designated team(s) within the AT&T companies and not for general distribution
Custom Relevance Model


  Topical             +     Geographical                                                                    +               Social

Complex handling of       Distance modulation based on                                                              Business with 4.5 stars and
multiword queries         business density                                                                          200 reviews is more relevant
                                                                                                                    than 5.0 star 1 review




                               AT&T Proprietary (Restricted) Only for use by authorized individuals or any above-                              5
                                designated team(s) within the AT&T companies and not for general distribution
Custom Relevance Model


   Topical             +     Geographical                                                                    +               Social

Complex handling of        Distance modulation based on                                                              Business with 4.5 stars and
multiword queries          business density                                                                          200 reviews is more relevant
                                                                                                                     than 5.0 star 1 review




Field Boosts for certain    LocalSolr as a geographic                                                                CustomScoreQuery to tie
fields                      filter                                                                                   all different scores together
Dismax to handle complex    Ability to modulate score
queries                     based on business density




                                AT&T Proprietary (Restricted) Only for use by authorized individuals or any above-                                   6
                                 designated team(s) within the AT&T companies and not for general distribution
Geographic Sharding


                                                           Score Combinations

                                                           Performance was better


                                                           Provisioning is a bit complex




               AT&T Proprietary (Restricted) Only for use by authorized individuals or any above-   7
                designated team(s) within the AT&T companies and not for general distribution
Search Architecture

                 Search Slaves                                                                      Masters

                                                        shards
    API Layer




                                                             replication                                          Feeder /
                                                                                                              Document Pipeline




                rows

                       AT&T Proprietary (Restricted) Only for use by authorized individuals or any above-                     8
                        designated team(s) within the AT&T companies and not for general distribution
Bottom Line



Solr has enabled us to innovate faster
   • Quick iterations of relevancy model and functionality
   • Open Platform with much more flexibility
   • Scalable Architecture to meet our business needs
Bottom Line



Solr has enabled us to innovate faster
   • Quick iterations of relevancy model and functionality
   • Open Platform with much more flexibility
   • Scalable Architecture to meet our business needs




Thus, delivering value to our consumers
Resources




       http://bit.ly/lucid-local




                     Lucid Imagination, Inc.
Q&A


Lucid Imagination, Inc.
http://bit.ly/lucid-local

More Related Content

PPTX
Fast for sharepoint internet and commerce scenarios
PDF
Scott Rigby - Adobe
PDF
Geo-Spatial Mapping Initiative – Africa and Asia
PPTX
Big Geo Data: Open Source and Open Standards
PPSX
Document management system using liferay 7
PPT
The Go-Geo! Spatial Data Portal: A Data Discovery and Research Tool for UK Ac...
PDF
Understanding Lucene Search Performance
PPTX
Customized Navigation Using SOLR
Fast for sharepoint internet and commerce scenarios
Scott Rigby - Adobe
Geo-Spatial Mapping Initiative – Africa and Asia
Big Geo Data: Open Source and Open Standards
Document management system using liferay 7
The Go-Geo! Spatial Data Portal: A Data Discovery and Research Tool for UK Ac...
Understanding Lucene Search Performance
Customized Navigation Using SOLR

Viewers also liked (20)

PPT
Solr Cluster installation tool "Anuenue"
PPT
Juan gris
PPT
情報科学演習 09
PPTX
Ecma 262 5th Edition を読む #5 第9条
PDF
What’s new in apache solr 1.4
PPTX
Updated: Sources of Funding
PPTX
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
PPT
Van gogh
PDF
Indexing Text and HTML Files with Solr
PPTX
корея
PDF
Impact of open source search on the intelligence community
PDF
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
PPT
Tennis
PPT
Jonh Lennon
PPTX
Oslb office365
PPTX
20101023 ie9 cache
PPT
Spanish bombss
PPTX
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14
PPT
Cancer
PDF
Learn How to Master Solr1 4
Solr Cluster installation tool "Anuenue"
Juan gris
情報科学演習 09
Ecma 262 5th Edition を読む #5 第9条
What’s new in apache solr 1.4
Updated: Sources of Funding
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Van gogh
Indexing Text and HTML Files with Solr
корея
Impact of open source search on the intelligence community
Building a Lightweight Discovery Interface for Chinese Patents, Presented by ...
Tennis
Jonh Lennon
Oslb office365
20101023 ie9 cache
Spanish bombss
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14
Cancer
Learn How to Master Solr1 4
Ad

Similar to Building Local/Geo Search with Apache Lucene and Solr (20)

PPT
Local Search using Solr at YP.com
PDF
Solr the intelligent search engine
PDF
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
PDF
The Next Generation of Big Data Analytics
PDF
Bringing Geospatial Business Intelligence to the Enterprise
PPT
7 dee finding the right methodologies marshall sponder - 9-12-12 - submitted
PPTX
Introduction to FluentData - The Micro ORM
PDF
Database@Home - Maps and Spatial Analyses: How to use them
PPTX
Being a mobile entrepreneur
PPTX
Mesh Labs Introduction June 2012
PDF
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
PDF
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
PPTX
FME Geo Enabling Field Sales Team
PDF
Enterprise Location Intelligence
PDF
Présentation IBM InfoSphere MDM 11.3
PPTX
2012 06 hortonworks paris hug
PPT
Domain driven design
PDF
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
PDF
Faceted Navigation
PDF
Site/Location Hubs - A Hot Trend In Master Data Management (MDM)
Local Search using Solr at YP.com
Solr the intelligent search engine
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
The Next Generation of Big Data Analytics
Bringing Geospatial Business Intelligence to the Enterprise
7 dee finding the right methodologies marshall sponder - 9-12-12 - submitted
Introduction to FluentData - The Micro ORM
Database@Home - Maps and Spatial Analyses: How to use them
Being a mobile entrepreneur
Mesh Labs Introduction June 2012
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
FME Geo Enabling Field Sales Team
Enterprise Location Intelligence
Présentation IBM InfoSphere MDM 11.3
2012 06 hortonworks paris hug
Domain driven design
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
Faceted Navigation
Site/Location Hubs - A Hot Trend In Master Data Management (MDM)
Ad

More from Lucidworks (Archived) (20)

PDF
Integrating Hadoop & Solr
PDF
The Data-Driven Paradigm
PDF
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
PDF
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
PPTX
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
PPTX
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
PPTX
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
PPTX
What's new in solr june 2014
PPTX
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
PPTX
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
PPTX
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
PDF
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
PDF
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
PPTX
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
PPTX
Solr At AOL, Presented by Sean Timm at SolrExchage DC
PPTX
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
PPTX
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
PPTX
Building a data driven search application with LucidWorks SiLK
PPTX
Introducing LucidWorks App for Splunk Enterprise webinar
PDF
Solr4 nosql search_server_2013
Integrating Hadoop & Solr
The Data-Driven Paradigm
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
What's new in solr june 2014
Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Building a data driven search application with LucidWorks SiLK
Introducing LucidWorks App for Splunk Enterprise webinar
Solr4 nosql search_server_2013

Recently uploaded (20)

PPTX
Machine Learning_overview_presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Big Data Technologies - Introduction.pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Encapsulation theory and applications.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Empathic Computing: Creating Shared Understanding
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPT
Teaching material agriculture food technology
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Cloud computing and distributed systems.
Machine Learning_overview_presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Big Data Technologies - Introduction.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Encapsulation theory and applications.pdf
A comparative analysis of optical character recognition models for extracting...
sap open course for s4hana steps from ECC to s4
Empathic Computing: Creating Shared Understanding
Assigned Numbers - 2025 - Bluetooth® Document
“AI and Expert System Decision Support & Business Intelligence Systems”
Teaching material agriculture food technology
MIND Revenue Release Quarter 2 2025 Press Release
Programs and apps: productivity, graphics, security and other tools
Agricultural_Statistics_at_a_Glance_2022_0.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Cloud computing and distributed systems.

Building Local/Geo Search with Apache Lucene and Solr

  • 1. Building Local/Geo Search with Apache Lucene and Solr
  • 2. Agenda Grant Ingersoll, Lucid Imagination Introduction Basics of geo-spatial search Tools available in Lucene and Solr Ryan McKinley, Voyager GIS Spatial search in Action: Sameer Maggon, AT&T Interactive How Solr powers local search at YP.com Lucid Imagination, Inc.
  • 3. Introductions Grant Ingersoll Lucene/Solr committer Co-author of upcoming “Taming Text” Ryan McKinley Lucene/Solr committer Co-founder of Voyager GIS Sameer Maggon Search Eng. Team lead at AT&T Interactive Active user of Lucene since 2001 Lucid Imagination, Inc.
  • 4. Use Cases Asset Management “Dude, where’s my map?” Social Networking Find all friends near me Targeted, local search results and ads “restaurants in Austin Texas” “Starbucks, 55313” Business Intelligence Restrict doc set for analysis by location Lucid Imagination, Inc.
  • 5. Spatial Search Concepts Spatial Data Types Points (latitude/longitude) Lines Shapes Maps and overlays Streets, POI http://www.openstreetmap.org/?lat=44.9744&lon=-93.2484&zoom=14&layers=B000FTFT Integration with unstructured text Metadata, descriptions, user reviews, etc. Lucid Imagination, Inc.
  • 6. Application Needs Query Parsing Efficient distance calculations Euclidean, Great Circle (Haversine), Vincenty’s Filtering Bounding Box Sort by Distance Relevance Enhancement Faceting Advanced: shape intersections, routes Lucid Imagination, Inc.
  • 7. Lucene 2.9/Solr 1.4 Features for Spatial Search Lucene/Solr are excellent for dealing with unstructured text 2.9/1.4 adds: Better Numeric handling for range searches Spatial contribution with features for (2.9 only, coming in 1.5): • Creating Cartesian Tiers (Grids) • Geohashes • Calculating distances • Filter implementations Lucid Imagination, Inc.
  • 8. Query Parsing Query parsing is often the most difficult to get right User error, ambiguity in names Mixture of topic and location: bars in Minneapolis MN Geocoding translates addresses, POIs into lat/lon or other Several publicly available services: geonames.org, Google Maps Often have built-in throttles, so may not be effective for prod. Query logs are invaluable for developing an effective parser Lucid Imagination, Inc.
  • 9. Filtering Range queries can significantly slow down search if done improperly Goal: reduce the number of terms to evaluate Solution 1: New Trie-based numeric capabilities Solution 2: Cartesian Tiers Lucid Imagination, Inc.
  • 10. Cartesian Tiers Divide up the space into grids and assign it an id Each tier breaks the space down into 2tier grids Sample code using Lucene spatial contrib: CartesianTierPlotter pl = new CartesianTierPlotter(10, new SinusoidalProjector(), "spatial"); pl.getTierBoxId(latitude, longitude); See http://www.nsshutdown.com/projects/lucene/wh itepaper/locallucene_v2.html Lucid Imagination, Inc.
  • 11. What’s next? Tighter integration in Solr Work already under way Native field types, query parsing support, faceting support Resources java-user@lucene,apache.org, solr-user@lucene.apache.org https://issues.apache.org/jira/browse/SOLR-773 http://lucene.apache.org/java/2_9_1/api/contrib- spatial/index.html Many, many more general resources on the web Lucid Imagination, Inc.
  • 12. Voyager Spatial Data Search Ryan McKinley Co-founder, Voyager GIS
  • 13. Where is my Data? • Files stored across the network – desktop, external drives, databases etc. • Many distinct data formats • Massive datasets keep getting bigger. • Poor cataloging tools • Limited metadata
  • 14. Voyager Solution Voyager is a search engine for your geographic data. • Find data with simple text search and geographic constraints • Keep data in its existing location (no need to import to a new system) • Tools to work with search results
  • 18. Implementation • Data Discovery / Extraction • Solr search • Wicket UI
  • 19. Data Extraction • For each result, we extract basic information: - ESRI ArcObjects - GDAL - PDFBox - Geotools - Tika - etc
  • 20. Geographic Search in Solr • Need to search by ‘extent’ not point • Works well with a standard RTree • Built a custom Lucene Filter to intersect/search within a given extent.
  • 21. Work in Progress • Custom Gazateer – “Building 12” > ‘-96.X 30.X -96.X 30.X’ • Named Entity Extraction – Geographic words that appear in titles / text get indexed with geographic properties
  • 22. Geographic Search in Solr 1.5+ • Standard API, pluggable implementation. – Standard Qparser, pluggable indexing • Single input ‘field’ could index multiple lucene fields. • Share objects between different parts of the request cycle (only calculate distance once) • Augment results with calculated value – Manual or from function query
  • 23. How Solr powers local search at YP.com Sameer Maggon November 18, 2009 © 2008 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property.
  • 24. YP.com Technical Challenges Custom Relevance Model Scalability / Architecture Conclusion © 2008 AT&T Intellectual Property. All rights reserved. AT&T and the AT&T logo are trademarks of AT&T Intellectual Property.
  • 25. YP.com (beta) Local Search Site Focused on providing relevant results Uses Solr for search AT&T Proprietary (Restricted) Only for use by authorized individuals or any above- 3 designated team(s) within the AT&T companies and not for general distribution
  • 26. Technical Challenges Relevancy Scalability Topically relevant results 10s of millions of records Constrained by contextual geographical search Response time less than 200ms Local relevancy is not just keyword and location – Fault resistant ratings, brands, etc More than 150 million searches per month AT&T Proprietary (Restricted) Only for use by authorized individuals or any above- 4 designated team(s) within the AT&T companies and not for general distribution
  • 27. Custom Relevance Model Topical + Geographical + Social Complex handling of Distance modulation based on Business with 4.5 stars and multiword queries business density 200 reviews is more relevant than 5.0 star 1 review AT&T Proprietary (Restricted) Only for use by authorized individuals or any above- 5 designated team(s) within the AT&T companies and not for general distribution
  • 28. Custom Relevance Model Topical + Geographical + Social Complex handling of Distance modulation based on Business with 4.5 stars and multiword queries business density 200 reviews is more relevant than 5.0 star 1 review Field Boosts for certain LocalSolr as a geographic CustomScoreQuery to tie fields filter all different scores together Dismax to handle complex Ability to modulate score queries based on business density AT&T Proprietary (Restricted) Only for use by authorized individuals or any above- 6 designated team(s) within the AT&T companies and not for general distribution
  • 29. Geographic Sharding Score Combinations Performance was better Provisioning is a bit complex AT&T Proprietary (Restricted) Only for use by authorized individuals or any above- 7 designated team(s) within the AT&T companies and not for general distribution
  • 30. Search Architecture Search Slaves Masters shards API Layer replication Feeder / Document Pipeline rows AT&T Proprietary (Restricted) Only for use by authorized individuals or any above- 8 designated team(s) within the AT&T companies and not for general distribution
  • 31. Bottom Line Solr has enabled us to innovate faster • Quick iterations of relevancy model and functionality • Open Platform with much more flexibility • Scalable Architecture to meet our business needs
  • 32. Bottom Line Solr has enabled us to innovate faster • Quick iterations of relevancy model and functionality • Open Platform with much more flexibility • Scalable Architecture to meet our business needs Thus, delivering value to our consumers
  • 33. Resources http://bit.ly/lucid-local Lucid Imagination, Inc.