SlideShare a Scribd company logo
BETTER SEARCH
     ENGINE TESTING
        FOR SOLR



CVILLE MEETUP | EPUGH@O19S.COM | @DEP4B

                                          1
WHY AM I QUALIFIED TO BE
           UP HERE?
•   Principal of OpenSource
    Connections - Solr Search
    Consultancy

• Member    of Apache Software
    Foundation

• Presenter  at conferences (OSCON,
    ApacheCON, UberConf, ExpoQA,
    STPcon 2009/2011!)

• Fascinated by the art of software
    development
                                      2
3.
           X
             Co
           So m
             on ing


AUTHOR




                      3
FATHER




         4
AGENDA

   Why Test Search?


Two General Approaches

  Test Tools for Solr


      Wrap Up



                         5
REDUCE HUMAN ERROR




                     6
photo credit to Jay Grieves @ www.bigswingingdeveloper.com
                                                             7
TWO APPROACHES TO TEST

               Humans

   Automated
     Tests




                         8
HUMAN TESTING



• Better   deals with ambiguity




                                  9
RELEVANCY IS LIKE
PORNOGRAPHY....



                    10
I KNOW IT WHEN I SEE IT!




  http://en.wikipedia.org/wiki/Les_Amants

                                            11
AUTOMATED TESTING
                    12
SWEET SPOT


• Repeat Testing   for Relevancy Tuning

• Performance Testing: hitting   SLA?

• Regression Testing   (Typical use case for automated testing)




                                                                  13
“ONE TRUE ANSWER”

• Absolute Truth   / Matrix / Grid / TREC / Relevancy Assertions

 • The    correct answers for each search are known ahead of
   time

 • Humans   judges often decide these correct answers, stored
   as Relevancy Assertions

 • Can    be labor intensive to setup

• A “Numerical   Grade” is produced for comparision

                                                                   14
SO WHAT IS A JUDGEMENT
         LIST?




                         15
16
COMPARABLE RESULTS
                Original     AjaxSolr        WebSolr

gata 4        40.0         80.0         80.0
HIF 1 alpha   90.0         100.0        30

HIF-1 alpha   100.0        100.0        30

HIF1 alpha    70.0         100.0        100.0

HIF1A         100.0        100.0        100.0
iNOS          60.0         60.0         60.0
IRE-1 alpha   30           90.0         90.0
                                                       17
WRAP UP




          18
Search	
  Stack


                  User	
  Interface

                  Search	
  Engine

                   Informa3on

                                      19
20
21
A HYBRID APPROACH?

             Humans

 Automated
   Tests




                      22
A HYBRID APPROACH?



  Tool Supported
     Testing     Humans
   Automated
     Tests
               Judgements
                            23
TESTING IS A HUMAN ACTIVITY


               James Bach,
               satisfice.com




                               24
THANK YOU!



• twitter:   dep4b

• speakerrate:   http://www.speakerrate.com/epugh/

• email:   epugh@opensourceconnections.com

                          25
                                                 25

More Related Content

KEY
Better Search Engine Testing
KEY
A Dickens of A Keynote
PDF
DevOps Army of One - DevOps Days Silicon Valley 2015
PDF
Building a Successful Internal Adversarial Simulation Team - Chris Gates & Ch...
PDF
Going Purple : From full time breaker to part time fixer: 1 year later
KEY
SEERS - Standardised Bug Reporting
PDF
Adversarial Simulation Nickerson/Gates Wild West Hacking Fest Oct 2017
PDF
Your Automated Execution Does Not Have to be Flaky
Better Search Engine Testing
A Dickens of A Keynote
DevOps Army of One - DevOps Days Silicon Valley 2015
Building a Successful Internal Adversarial Simulation Team - Chris Gates & Ch...
Going Purple : From full time breaker to part time fixer: 1 year later
SEERS - Standardised Bug Reporting
Adversarial Simulation Nickerson/Gates Wild West Hacking Fest Oct 2017
Your Automated Execution Does Not Have to be Flaky

Similar to Better search engine testing for solr (20)

PDF
Better Search Engine Testing - Eric Pugh
PPTX
So we're going no-QA - how do we get the devs to do enough testing?
ODP
Practical Security Testing for Developers using OWASP ZAP at Dot Net Bangalor...
PPTX
10 Useful Testing Tools for Open Source Projects @ TuxCon 2015
PDF
Shawn Wallace - Test automation in brownfield applications
PDF
Sandbox vs manual malware analysis v1.1
PPTX
5 cro tools that i can't live without
PDF
Open source-secret-sauce-rit-2010
PDF
Evaluative Research
PPT
Cues for Better Scent in Debugging
PPTX
Web Application Software Testing
PPTX
Lessons Learned in Test Automation From Zombieland
PDF
Testing in a Continuous Delivery World - LondonCD Meetup - May 2014
PPTX
Jesse Hull - LOGIN Conference 2012
PDF
Chaining the Beast - Testing Wonder Applications in the Real World
KEY
Introduction to atdd
PDF
Agile Testing Days 2018 USA - API Testing Fundamentals
PPTX
My Little Webap - DevOpsSec is Magic
ODP
JavaOne 2014 Security Testing for Developers using OWASP ZAP
PPTX
Natural born conversion killers - Conversion Jam
Better Search Engine Testing - Eric Pugh
So we're going no-QA - how do we get the devs to do enough testing?
Practical Security Testing for Developers using OWASP ZAP at Dot Net Bangalor...
10 Useful Testing Tools for Open Source Projects @ TuxCon 2015
Shawn Wallace - Test automation in brownfield applications
Sandbox vs manual malware analysis v1.1
5 cro tools that i can't live without
Open source-secret-sauce-rit-2010
Evaluative Research
Cues for Better Scent in Debugging
Web Application Software Testing
Lessons Learned in Test Automation From Zombieland
Testing in a Continuous Delivery World - LondonCD Meetup - May 2014
Jesse Hull - LOGIN Conference 2012
Chaining the Beast - Testing Wonder Applications in the Real World
Introduction to atdd
Agile Testing Days 2018 USA - API Testing Fundamentals
My Little Webap - DevOpsSec is Magic
JavaOne 2014 Security Testing for Developers using OWASP ZAP
Natural born conversion killers - Conversion Jam
Ad

More from OpenSource Connections (20)

PDF
Why User Behavior Insights? KMWorld Enterprise Search & Discovery 2024
PDF
Test driven relevancy
PDF
How To Structure Your Search Team for Success
PPT
The right path to making search relevant - Taxonomy Bootcamp London 2019
PDF
Payloads and OCR with Solr
PPTX
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
PDF
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
PPTX
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
PPTX
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
PDF
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
PPTX
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
PPTX
Haystack 2019 - Search with Vectors - Simon Hughes
PPTX
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
PPTX
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
PDF
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
PDF
Haystack 2019 - Architectural considerations on search relevancy in the conte...
PPTX
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
PPTX
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
PPTX
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Why User Behavior Insights? KMWorld Enterprise Search & Discovery 2024
Test driven relevancy
How To Structure Your Search Team for Success
The right path to making search relevant - Taxonomy Bootcamp London 2019
Payloads and OCR with Solr
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Ad

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Electronic commerce courselecture one. Pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
KodekX | Application Modernization Development
PDF
Machine learning based COVID-19 study performance prediction
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Network Security Unit 5.pdf for BCA BBA.
Approach and Philosophy of On baking technology
Digital-Transformation-Roadmap-for-Companies.pptx
The AUB Centre for AI in Media Proposal.docx
Empathic Computing: Creating Shared Understanding
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Advanced methodologies resolving dimensionality complications for autism neur...
Electronic commerce courselecture one. Pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
KodekX | Application Modernization Development
Machine learning based COVID-19 study performance prediction
NewMind AI Monthly Chronicles - July 2025
Understanding_Digital_Forensics_Presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Dropbox Q2 2025 Financial Results & Investor Presentation
MYSQL Presentation for SQL database connectivity
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Modernizing your data center with Dell and AMD
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Network Security Unit 5.pdf for BCA BBA.

Better search engine testing for solr

  • 1. BETTER SEARCH ENGINE TESTING FOR SOLR CVILLE MEETUP | EPUGH@O19S.COM | @DEP4B 1
  • 2. WHY AM I QUALIFIED TO BE UP HERE? • Principal of OpenSource Connections - Solr Search Consultancy • Member of Apache Software Foundation • Presenter at conferences (OSCON, ApacheCON, UberConf, ExpoQA, STPcon 2009/2011!) • Fascinated by the art of software development 2
  • 3. 3. X Co So m on ing AUTHOR 3
  • 4. FATHER 4
  • 5. AGENDA Why Test Search? Two General Approaches Test Tools for Solr Wrap Up 5
  • 7. photo credit to Jay Grieves @ www.bigswingingdeveloper.com 7
  • 8. TWO APPROACHES TO TEST Humans Automated Tests 8
  • 9. HUMAN TESTING • Better deals with ambiguity 9
  • 11. I KNOW IT WHEN I SEE IT! http://en.wikipedia.org/wiki/Les_Amants 11
  • 13. SWEET SPOT • Repeat Testing for Relevancy Tuning • Performance Testing: hitting SLA? • Regression Testing (Typical use case for automated testing) 13
  • 14. “ONE TRUE ANSWER” • Absolute Truth / Matrix / Grid / TREC / Relevancy Assertions • The correct answers for each search are known ahead of time • Humans judges often decide these correct answers, stored as Relevancy Assertions • Can be labor intensive to setup • A “Numerical Grade” is produced for comparision 14
  • 15. SO WHAT IS A JUDGEMENT LIST? 15
  • 16. 16
  • 17. COMPARABLE RESULTS Original AjaxSolr WebSolr gata 4 40.0 80.0 80.0 HIF 1 alpha 90.0 100.0 30 HIF-1 alpha 100.0 100.0 30 HIF1 alpha 70.0 100.0 100.0 HIF1A 100.0 100.0 100.0 iNOS 60.0 60.0 60.0 IRE-1 alpha 30 90.0 90.0 17
  • 18. WRAP UP 18
  • 19. Search  Stack User  Interface Search  Engine Informa3on 19
  • 20. 20
  • 21. 21
  • 22. A HYBRID APPROACH? Humans Automated Tests 22
  • 23. A HYBRID APPROACH? Tool Supported Testing Humans Automated Tests Judgements 23
  • 24. TESTING IS A HUMAN ACTIVITY James Bach, satisfice.com 24
  • 25. THANK YOU! • twitter: dep4b • speakerrate: http://www.speakerrate.com/epugh/ • email: epugh@opensourceconnections.com 25 25