SlideShare a Scribd company logo
How We Incrementally Improved
           Search




        Ravi Mynampaty
         @ravimynampaty
Agenda


   Background
•   Roadmap
•   Implementation
•   Analytics
•   Benefits
•   Challenges
•   Next Steps
Background: A few years ago…

•   Out-of-the-box Ultraseek
•   No optimization, no customization
•   Fraction of HBS content indexed / searchable
•   Many dead ends
•   Proliferation of different search tools
•   User sentiment
     • “search sucks”
     • “why can’t it be more like Google”
Background: Our Vision

•   One Search Box to Rule Them All
•   The long term goal: enterprise search
•   One-stop searching
•   Google-like simplicity
•   Handle refinement / navigation on results page
Agenda


•   Background
   Roadmap
•   Implementation
•   Analytics
•   Benefits
•   Challenges
•   Next Steps
Roadmap: Preliminary Steps

• Inventory document collections

• Inventory search-type tools

• Of the above, identify
      – most heavily used
      – strategically significant
      – high impact
      – Low Hanging Fruit (LHF)
Roadmap: Implementation Plan

•   Prioritize tasks by ease of content access
    and implementation (LHF)

•   Develop timeline

•   Build prototypes and iterate the design
Agenda


•   Background
•   Roadmap
   Implementation
•   Analytics
•   Benefits
•   Challenges
•   Next Steps
Implementation: How we built it

• Customized Ultraseek’s results display code
• Worked with owners of software apps
      –Provided JSON APIs
      –Allowed us to spider their app/repository
• HTML is the API !!
• In other words:
      No rocket science involved
Implementation: Three Integration Approaches


• Blended Search (e.g., Faculty/Staff Directory)

• Brokered Query (e.g., Video Catalog)

• Query Resubmit (e.g., Alumni Directory)
Implementation: Blended Search

Spider HBS web content outside of HBS.EDU
• Harbus.org (student newspaper)
• Club and affiliated sites




Spider HBS content located in other applications
• Faculty and staff phone book
• Alumni Class Notes application
Implementation : Optimize and clean up search indexes

 Work with content owners to create good HTML page titles
   • Faculty Publications pages
   • 20th Century Leadership database
   • Address MS-Office / PDF files too

 Eliminating duplicate search results / use filters

 Adjusting Relevance per collection / source / file path
Implementation : Create Best Bets




Top 10 Queries
Oct – Dec
Implementation: Unify Blended Search + Query Resubmit
Query refinement options
(Blended Search)




Query resubmit options
“Integration-lite”
Implementation: Expanding the Net w/ Brokered Search


• When direct indexing isn’t practical
    Harvard.edu search
    HBS VideoTools (intranet only)
    MBA Event Calendar (intranet only)
• A query is handed off to another search engine
• Results are returned “behind the scenes” as
  JavaScript Object Notation (JSON) / Python
• Ajax-like support of asynchronous search
  processes
Implementation: Brokered Query in Action
Implementation: Brokered Query in Action
Implementation: Brokered Query in Action
Implementation: One-offs

• Software Dev Docs (cmd line)

$ find ./software/docs –name ‘*html’
         | xargs grep -i oracle | less

(returns 100s of docs)

• Built web-based search UI
Agenda


•   Background
•   Roadmap
•   Implementation
   Analytics
•   Benefits
•   Challenges
•   Next Steps
Analytics: Tracking Usage of Features
Analytics: Tracking Best Bets
Analytics: Tracking Best Bets
Agenda


•   Background
•   Roadmap
•   Implementation
•   Analytics
   Benefits
•   Challenges
•   Next Steps
Benefits

• Single point of access for various repositories

• Shortcomings of underlying tools overcome

• Better access to content from rest of Harvard

• Traffic boost to e-commerce site
Agenda


•   Background
•   Roadmap
•   Implementation
•   Analytics
•   Benefits
   Challenges
•   Next Steps
Challenges


• Search is never done

• Complex permissions issues

• SERP design convergence

• SharePoint
Agenda


•   Background
•   Roadmap
•   Implementation
•   Analytics
•   Benefits
•   Challenges
   Next Steps
Next Steps

• Tackling the mixed-mode situation
• Integration with taxonomies
• Search experience within HBS applications
• Faceted search where rich metadata
  available
• Analytics feeding website design and
  vocabulary development
Conclusion


• Tactical, iterative approach enabled
  significant progress

• Implementing simpler features/tweaks may
  have higher impact

• Your existing search engine may have more
  gas in it than you realize

More Related Content

PPT
Document management #RWIRW
PPTX
Frances McNamara - Discovery strategies for Kuali OLE - VuFind at the Univers...
PPTX
I serve the users
PPTX
Sap abap course content
PPTX
T44u 2015, web development best practice
PPTX
T44u 2015, imperial college
PPTX
Shaking hands with the developer: How IT Communications can help you build a ...
PDF
Drupal North East - Drupal 6 to 7 migration case study
Document management #RWIRW
Frances McNamara - Discovery strategies for Kuali OLE - VuFind at the Univers...
I serve the users
Sap abap course content
T44u 2015, web development best practice
T44u 2015, imperial college
Shaking hands with the developer: How IT Communications can help you build a ...
Drupal North East - Drupal 6 to 7 migration case study

What's hot (20)

PPTX
Web Forms, or How I Learned to Stop Worrying and Love Web Services
PDF
Sap abap online training
PPTX
RDA Toolkit Essentials 01.16
PPTX
Briefer: UX design process
PPTX
RDA Toolkit Essentials - 06.18.2014
PPTX
RDA Toolkit Essentials webinar 03.19.14
KEY
Sitecore at the University of Alberta
PPTX
Zero to Sixty with Oracle ApEx
PPTX
11.14 RDA Toolkit essentials
PPTX
07.18 rda toolkit essentials
PPTX
09.19 rda toolkit essentials
PPTX
09.18.13 RDA Toolkit Essentials
PDF
Developer Conference 1.5 - Making the Move to Visual COBOL (Transvive)
PPT
333 seminar2 danacompton
PPTX
RDA Toolkit Essentials 2013.06.11
PPTX
RDA Toolkit Essentials 2015-06-11
PPTX
RDA Toolkit Essentials 2015-03-18
PPTX
RDA Toolkit Essentials 2014-12-17
PPTX
Dynamic sitemaps
PDF
ALM Works Structure - Boston Atlassian User Group
Web Forms, or How I Learned to Stop Worrying and Love Web Services
Sap abap online training
RDA Toolkit Essentials 01.16
Briefer: UX design process
RDA Toolkit Essentials - 06.18.2014
RDA Toolkit Essentials webinar 03.19.14
Sitecore at the University of Alberta
Zero to Sixty with Oracle ApEx
11.14 RDA Toolkit essentials
07.18 rda toolkit essentials
09.19 rda toolkit essentials
09.18.13 RDA Toolkit Essentials
Developer Conference 1.5 - Making the Move to Visual COBOL (Transvive)
333 seminar2 danacompton
RDA Toolkit Essentials 2013.06.11
RDA Toolkit Essentials 2015-06-11
RDA Toolkit Essentials 2015-03-18
RDA Toolkit Essentials 2014-12-17
Dynamic sitemaps
ALM Works Structure - Boston Atlassian User Group
Ad

Similar to How We Incrementally Improved Search (20)

PDF
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
PPTX
Relevancy and Search Quality Analysis - Search Technologies
PDF
Developing a Search & Findability Practice for the Enterprise – Ravi Mynampat...
PDF
Developing a Search & Findability Practice for the Enterprise
PDF
Enterprise Search in SharePoint 2013
PPTX
SPConnections - Search Administration in SharePoint 2013
PDF
Better Search UX
PDF
Mastering Elasticsearch 2nd Edition Edition Rafal Kuc
PPTX
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
PDF
Instant search - A hands-on tutorial
PDF
Searchland: Search quality for Beginners
PDF
Exploring session search
PDF
ITB2019 Easy ElasticSearch with cbElasticSearch - Jon Clausen
PDF
Hibernate Search in Action 1st Edition Emmanuel Bernard
PDF
Naukri Search Team achievements, 2009-2010
PDF
Fried dallas spug
PDF
Into The Box 2018 cbelasticsearch
PPTX
SPCAdriatics - Search Administration and Troubleshooting in SharePoint 2013
PDF
Hibernate Search in Action 1st Edition Emmanuel Bernard
PPT
SPLive Orlando - Beyond the Search Center - Application or Solution?
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
Relevancy and Search Quality Analysis - Search Technologies
Developing a Search & Findability Practice for the Enterprise – Ravi Mynampat...
Developing a Search & Findability Practice for the Enterprise
Enterprise Search in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013
Better Search UX
Mastering Elasticsearch 2nd Edition Edition Rafal Kuc
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
Instant search - A hands-on tutorial
Searchland: Search quality for Beginners
Exploring session search
ITB2019 Easy ElasticSearch with cbElasticSearch - Jon Clausen
Hibernate Search in Action 1st Edition Emmanuel Bernard
Naukri Search Team achievements, 2009-2010
Fried dallas spug
Into The Box 2018 cbelasticsearch
SPCAdriatics - Search Administration and Troubleshooting in SharePoint 2013
Hibernate Search in Action 1st Edition Emmanuel Bernard
SPLive Orlando - Beyond the Search Center - Application or Solution?
Ad

More from Ravi Mynampaty (11)

PDF
Build Your Own World Class Directory Search From Alpha to Omega
PDF
Let Search Power Your Intranet!
PDF
How we spiked the HBS water supply with Solr
PDF
Building a Solr-driven Web Portal
PDF
Clustering as presented at UX Poland 2013
PDF
Unix for Librarians
PDF
Clustering Search Log Data
PDF
Findability Standards
PDF
What to Feed Your Search Engine: The Evolution of Search Analytics at HBS
PDF
Business owner findability interview questions
PDF
Developing & Implementing Findability Standards
Build Your Own World Class Directory Search From Alpha to Omega
Let Search Power Your Intranet!
How we spiked the HBS water supply with Solr
Building a Solr-driven Web Portal
Clustering as presented at UX Poland 2013
Unix for Librarians
Clustering Search Log Data
Findability Standards
What to Feed Your Search Engine: The Evolution of Search Analytics at HBS
Business owner findability interview questions
Developing & Implementing Findability Standards

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
project resource management chapter-09.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
August Patch Tuesday
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
DP Operators-handbook-extract for the Mautical Institute
MIND Revenue Release Quarter 2 2025 Press Release
project resource management chapter-09.pdf
Chapter 5: Probability Theory and Statistics
1 - Historical Antecedents, Social Consideration.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Group 1 Presentation -Planning and Decision Making .pptx
WOOl fibre morphology and structure.pdf for textiles
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Assigned Numbers - 2025 - Bluetooth® Document
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
August Patch Tuesday
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Encapsulation_ Review paper, used for researhc scholars
TLE Review Electricity (Electricity).pptx
Building Integrated photovoltaic BIPV_UPV.pdf

How We Incrementally Improved Search

  • 1. How We Incrementally Improved Search Ravi Mynampaty @ravimynampaty
  • 2. Agenda  Background • Roadmap • Implementation • Analytics • Benefits • Challenges • Next Steps
  • 3. Background: A few years ago… • Out-of-the-box Ultraseek • No optimization, no customization • Fraction of HBS content indexed / searchable • Many dead ends • Proliferation of different search tools • User sentiment • “search sucks” • “why can’t it be more like Google”
  • 4. Background: Our Vision • One Search Box to Rule Them All • The long term goal: enterprise search • One-stop searching • Google-like simplicity • Handle refinement / navigation on results page
  • 5. Agenda • Background  Roadmap • Implementation • Analytics • Benefits • Challenges • Next Steps
  • 6. Roadmap: Preliminary Steps • Inventory document collections • Inventory search-type tools • Of the above, identify – most heavily used – strategically significant – high impact – Low Hanging Fruit (LHF)
  • 7. Roadmap: Implementation Plan • Prioritize tasks by ease of content access and implementation (LHF) • Develop timeline • Build prototypes and iterate the design
  • 8. Agenda • Background • Roadmap  Implementation • Analytics • Benefits • Challenges • Next Steps
  • 9. Implementation: How we built it • Customized Ultraseek’s results display code • Worked with owners of software apps –Provided JSON APIs –Allowed us to spider their app/repository • HTML is the API !! • In other words: No rocket science involved
  • 10. Implementation: Three Integration Approaches • Blended Search (e.g., Faculty/Staff Directory) • Brokered Query (e.g., Video Catalog) • Query Resubmit (e.g., Alumni Directory)
  • 11. Implementation: Blended Search Spider HBS web content outside of HBS.EDU • Harbus.org (student newspaper) • Club and affiliated sites Spider HBS content located in other applications • Faculty and staff phone book • Alumni Class Notes application
  • 12. Implementation : Optimize and clean up search indexes Work with content owners to create good HTML page titles • Faculty Publications pages • 20th Century Leadership database • Address MS-Office / PDF files too Eliminating duplicate search results / use filters Adjusting Relevance per collection / source / file path
  • 13. Implementation : Create Best Bets Top 10 Queries Oct – Dec
  • 14. Implementation: Unify Blended Search + Query Resubmit
  • 15. Query refinement options (Blended Search) Query resubmit options “Integration-lite”
  • 16. Implementation: Expanding the Net w/ Brokered Search • When direct indexing isn’t practical  Harvard.edu search  HBS VideoTools (intranet only)  MBA Event Calendar (intranet only) • A query is handed off to another search engine • Results are returned “behind the scenes” as JavaScript Object Notation (JSON) / Python • Ajax-like support of asynchronous search processes
  • 20. Implementation: One-offs • Software Dev Docs (cmd line) $ find ./software/docs –name ‘*html’ | xargs grep -i oracle | less (returns 100s of docs) • Built web-based search UI
  • 21. Agenda • Background • Roadmap • Implementation  Analytics • Benefits • Challenges • Next Steps
  • 25. Agenda • Background • Roadmap • Implementation • Analytics  Benefits • Challenges • Next Steps
  • 26. Benefits • Single point of access for various repositories • Shortcomings of underlying tools overcome • Better access to content from rest of Harvard • Traffic boost to e-commerce site
  • 27. Agenda • Background • Roadmap • Implementation • Analytics • Benefits  Challenges • Next Steps
  • 28. Challenges • Search is never done • Complex permissions issues • SERP design convergence • SharePoint
  • 29. Agenda • Background • Roadmap • Implementation • Analytics • Benefits • Challenges  Next Steps
  • 30. Next Steps • Tackling the mixed-mode situation • Integration with taxonomies • Search experience within HBS applications • Faceted search where rich metadata available • Analytics feeding website design and vocabulary development
  • 31. Conclusion • Tactical, iterative approach enabled significant progress • Implementing simpler features/tweaks may have higher impact • Your existing search engine may have more gas in it than you realize