SlideShare a Scribd company logo
Akshay Sura
Partner | Konabos Consulting
Sameer Maggon
Founder & Search Ninja | SearchStax
SOLRCLOUD: BEST PRACTICES FOR
SITECORE. DESIGN, BUILD AND DEVOPS
CONSIDERATIONS
Los Angeles
User Group
Los Angeles
User Group
Los Angeles
User Group
AGENDA
1. Get you familiar with Apache Solr
2. Sitecore > Azure vs. Apache Solr
3. Overview & Comparison of Solr Topologies
4. Architecting a Production Grade Solr & Zookeeper
5. Backups and Recovery in Solr
6. Cross Region Solr deployment
7. Demo of Solr-as-a-Service making it happen in 30 mins
8. Q&A
Los Angeles
User Group
Los Angeles
User Group
ABOUT AKSHAY SURA & SAMEER MAGGON
Sameer Maggon
• University of Southern California Alum –
Viterbi Engineering School (Masters in
Computer Science)
• Built platforms and teams for Enterprise
Search and Search-based Consumer
companies.
• Spent 15+ years working with Open Source
Search Engines (started with Lucene 1.1)
• Currently Work at SearchStax, Inc. –
Founded the company in 2015 and primary
focus is Product & Technology
Akshay Sura
• Six-time Sitecore MVP
• Founder of SUGCON North America
2015, SUGCON India 2018 &
2019, Unofficial Sitecore Training
and the Sitecore Slack
Los Angeles
User Group
Sitecore and Azure Search works great
until you don’t ask for much
Los Angeles
User Group
Los Angeles
User Group
COMMON AZURE
SEARCH
LIMITATIONS RE.
SITECORE
•High Frequency or Heavy Content Indexing Needs
•Performance Bottlenecks
•Large number of fields (Field Limitation)
•Multiple Languages
•Not Extensible – Lack of further customizability using
custom or community plugins (Content Extraction - PDF,
word, etc)
•As you scale up, cost increases drastically
•Not available in all regions – Does not meet Disaster
Recovery Compliance needs
•Feature gap
•Proprietary API
Los Angeles
User Group
Solr is the popular, blazing-fast, open source
enterprise search platform built on Apache
Lucene
Los Angeles
User Group
Los Angeles
User Group
ABOUT SOLR
• Trusted by 90%+ Fortune 500 Companies
• True Open Source with Strong Community
• Massive Scalability – used by Salesforce, Walmart, Iron Mountain, etc.
• Feature Rich
• Faceted Search, Fuzzy Search, Spellcheck, Geospatial, Multi-language, and more.
• Auto-Complete / Search Suggest
• Content Extraction from Filetypes
• Highly Customizable to specific needs
• Customizable Relevance Models
• Machine Learning Support
Los Angeles
User Group
Sitecore Practitioner’s experience with Sitecore
and Solr
Los Angeles
User Group
Los Angeles
User Group
WORLD-CLASS SOLR / SITECORE
IMPLEMENTATION CYCLE
• Topology
• Security
• Log management
• Monitoring
• Reporting
• Alerting
• Backup
• DR strategy
• Capacity planning strategy
Design
• Hi-availability Solr
• Hi-availability ZK
• Security at rest
• Security in transit
• Log management
• Backup management
• Monitoring system
• Reporting system
• Alerting system
• DR (cold, hot)
• Multi-region
• Autoscaling
• Integrate Sitecore with Solr • Incident management
• OS patch management
• Solr security patches
• ZK security patches
• AWS node replacements
• Cloud node maintenance
• Periodic vulnerability scanning
• Solr error / log triage & analysis
• Solr performance optimization
• Solr minor/major version upgrades
• Scale-up
• Scale-down
Build OperateIntegrate
Los Angeles
User Group
Los Angeles
User Group
SOLR TOPOLOGIES
• Standalone/SolrCloud Server
• Master-Slave
• High Availability SolrCloud
• SolrCloud with CDCR (Cross Data
Center Replication)
Los Angeles
User Group
Los Angeles
User Group
STANDALONE SERVER
OR SOLRCLOUD
• Zookeeper and Solr run on
same server
• Ideal for Non-Production
(Development or QA
Environments)
• Can be run in a “Cloud”
mode or “Standalone” mode
• No High Availability or Fault
Tolerance
Los Angeles
User Group
Los Angeles
User Group
SOLRCLOUD JOURNEY TO HIGH AVAILABILITY
• Zookeeper is single point of failure
Los Angeles
User Group
Los Angeles
User Group
SOLRCLOUD JOURNEY TO HIGH AVAILABILITY
• In Cloud Environments, IPs can
change and nodes will go down or
get recycled.
• Don’t want your application to
directly be tied to the individual
Solr nodes
Los Angeles
User Group
Los Angeles
User Group
SOLRCLOUD (HIGH-
AVAILABILITY)
• High-Availability Zookeeper
Ensemble
• High-Availability Solr Servers
(with replication)
• Load Balancer / App
Gateway
• Ability to Scale up without re-
architecting
Los Angeles
User Group
Los Angeles
User Group
BACKUPS AND RECOVERY: SOLRCLOUD
Los Angeles
User Group
Los Angeles
User Group
BACKUPS AND RECOVERY: SOLRCLOUD
• SolrCloud Backups requires shared
filesystem mounted on same path on all
nodes
• Sitecore developers responsibility or
implementation of:
­ backup schedules
­ retention logic and cleanup
­ backup/restore verification procedures
Shared filesystem,
mounted on all nodes Shared filesystem
Los Angeles
User Group
Los Angeles
User Group
SOLR TOPOLOGIES:
SOLRCLOUD WITH CDCR
• Highly Resilient (Entire Data
Center can become
unavailable without impacting
availability)
• Localized Traffic Routing is
possible
• Can be scaled to multiple
regions
Los Angeles
User Group
There is an easier way > Solr-as-a-Service
Los Angeles
User Group
Los Angeles
User Group
SEARCHSTAX
DEMO
Los Angeles
User Group
Los Angeles
User Group
THANK YOU – Q&A
Akshay Sura
as@kanabos.com
tw: @akshaysura13
Sameer Maggon
sameer@searchstax.com
tw: @maggon

More Related Content

PDF
Spc software lab ppt
PPTX
Getting started with Apache Spark
PPT
Evolving IGN’s New APIs with Scala
PDF
Solr Consistency and Recovery Internals - Mano Kovacs, Cloudera
PPTX
Gcp-LoadBalancers
PPTX
Solr consistency and recovery internals
PDF
How do Solr and Azure Search compare?
PDF
Apache Solr 5.0 and beyond
Spc software lab ppt
Getting started with Apache Spark
Evolving IGN’s New APIs with Scala
Solr Consistency and Recovery Internals - Mano Kovacs, Cloudera
Gcp-LoadBalancers
Solr consistency and recovery internals
How do Solr and Azure Search compare?
Apache Solr 5.0 and beyond

Similar to SolrCloud-Best Practices for Sitecore. Design, build, and devops considerations (20)

PDF
Scaling SolrCloud to a Large Number of Collections - Fifth Elephant 2014
PPTX
Scaling SolrCloud to a large number of Collections
PDF
SolrCloud Cluster management via APIs
PPTX
Webinar: Lucidworks Managed Search
PDF
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...
PPTX
EnterpriseSearch
PDF
Meet Solr For The Tirst Again
PDF
Deploying and managing Solr at scale
PDF
Ease of use in Apache Solr
PDF
Migrating from Azure Search to SearcStax
PDF
Solr @ eBay Kleinanzeigen
PPTX
Play Architecture, Implementation, Shiny Objects, and a Proposal
PDF
World Class Solr Power - in 30 minutes
KEY
Solr 101
PDF
Solr search engine with multiple table relation
PPTX
Architecting Multi-Cloud Applications - Myth or Reality?
PPTX
Laravel and SOLR
PDF
Automated Cluster Management and Recovery for Large Scale Multi-Tenant Sea...
PPTX
A practical introduction to Oracle NoSQL Database - OOW2014
PDF
Rapid prototyping with solr - By Erik Hatcher
Scaling SolrCloud to a Large Number of Collections - Fifth Elephant 2014
Scaling SolrCloud to a large number of Collections
SolrCloud Cluster management via APIs
Webinar: Lucidworks Managed Search
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...
EnterpriseSearch
Meet Solr For The Tirst Again
Deploying and managing Solr at scale
Ease of use in Apache Solr
Migrating from Azure Search to SearcStax
Solr @ eBay Kleinanzeigen
Play Architecture, Implementation, Shiny Objects, and a Proposal
World Class Solr Power - in 30 minutes
Solr 101
Solr search engine with multiple table relation
Architecting Multi-Cloud Applications - Myth or Reality?
Laravel and SOLR
Automated Cluster Management and Recovery for Large Scale Multi-Tenant Sea...
A practical introduction to Oracle NoSQL Database - OOW2014
Rapid prototyping with solr - By Erik Hatcher
Ad

More from Sameer Maggon (6)

PDF
Future of enterprise apps is open source and cloud computing
PDF
Securing Solr Search Data in the Cloud
PDF
Behind the Scenes of Multi-Cloud Solr-as-a-Service
PDF
Making search better by tracking & utilizing user search behavior
PDF
Building great mobile search with Productsy and CloudSearch
PPT
Local Search using Solr at YP.com
Future of enterprise apps is open source and cloud computing
Securing Solr Search Data in the Cloud
Behind the Scenes of Multi-Cloud Solr-as-a-Service
Making search better by tracking & utilizing user search behavior
Building great mobile search with Productsy and CloudSearch
Local Search using Solr at YP.com
Ad

Recently uploaded (20)

PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
cuic standard and advanced reporting.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Electronic commerce courselecture one. Pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Spectroscopy.pptx food analysis technology
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Network Security Unit 5.pdf for BCA BBA.
The AUB Centre for AI in Media Proposal.docx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
cuic standard and advanced reporting.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Per capita expenditure prediction using model stacking based on satellite ima...
NewMind AI Weekly Chronicles - August'25-Week II
Electronic commerce courselecture one. Pdf
Machine learning based COVID-19 study performance prediction
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Unlocking AI with Model Context Protocol (MCP)
Encapsulation_ Review paper, used for researhc scholars
The Rise and Fall of 3GPP – Time for a Sabbatical?
Reach Out and Touch Someone: Haptics and Empathic Computing
Spectroscopy.pptx food analysis technology

SolrCloud-Best Practices for Sitecore. Design, build, and devops considerations

  • 1. Akshay Sura Partner | Konabos Consulting Sameer Maggon Founder & Search Ninja | SearchStax SOLRCLOUD: BEST PRACTICES FOR SITECORE. DESIGN, BUILD AND DEVOPS CONSIDERATIONS Los Angeles User Group
  • 2. Los Angeles User Group Los Angeles User Group AGENDA 1. Get you familiar with Apache Solr 2. Sitecore > Azure vs. Apache Solr 3. Overview & Comparison of Solr Topologies 4. Architecting a Production Grade Solr & Zookeeper 5. Backups and Recovery in Solr 6. Cross Region Solr deployment 7. Demo of Solr-as-a-Service making it happen in 30 mins 8. Q&A
  • 3. Los Angeles User Group Los Angeles User Group ABOUT AKSHAY SURA & SAMEER MAGGON Sameer Maggon • University of Southern California Alum – Viterbi Engineering School (Masters in Computer Science) • Built platforms and teams for Enterprise Search and Search-based Consumer companies. • Spent 15+ years working with Open Source Search Engines (started with Lucene 1.1) • Currently Work at SearchStax, Inc. – Founded the company in 2015 and primary focus is Product & Technology Akshay Sura • Six-time Sitecore MVP • Founder of SUGCON North America 2015, SUGCON India 2018 & 2019, Unofficial Sitecore Training and the Sitecore Slack
  • 4. Los Angeles User Group Sitecore and Azure Search works great until you don’t ask for much
  • 5. Los Angeles User Group Los Angeles User Group COMMON AZURE SEARCH LIMITATIONS RE. SITECORE •High Frequency or Heavy Content Indexing Needs •Performance Bottlenecks •Large number of fields (Field Limitation) •Multiple Languages •Not Extensible – Lack of further customizability using custom or community plugins (Content Extraction - PDF, word, etc) •As you scale up, cost increases drastically •Not available in all regions – Does not meet Disaster Recovery Compliance needs •Feature gap •Proprietary API
  • 6. Los Angeles User Group Solr is the popular, blazing-fast, open source enterprise search platform built on Apache Lucene
  • 7. Los Angeles User Group Los Angeles User Group ABOUT SOLR • Trusted by 90%+ Fortune 500 Companies • True Open Source with Strong Community • Massive Scalability – used by Salesforce, Walmart, Iron Mountain, etc. • Feature Rich • Faceted Search, Fuzzy Search, Spellcheck, Geospatial, Multi-language, and more. • Auto-Complete / Search Suggest • Content Extraction from Filetypes • Highly Customizable to specific needs • Customizable Relevance Models • Machine Learning Support
  • 8. Los Angeles User Group Sitecore Practitioner’s experience with Sitecore and Solr
  • 9. Los Angeles User Group Los Angeles User Group WORLD-CLASS SOLR / SITECORE IMPLEMENTATION CYCLE • Topology • Security • Log management • Monitoring • Reporting • Alerting • Backup • DR strategy • Capacity planning strategy Design • Hi-availability Solr • Hi-availability ZK • Security at rest • Security in transit • Log management • Backup management • Monitoring system • Reporting system • Alerting system • DR (cold, hot) • Multi-region • Autoscaling • Integrate Sitecore with Solr • Incident management • OS patch management • Solr security patches • ZK security patches • AWS node replacements • Cloud node maintenance • Periodic vulnerability scanning • Solr error / log triage & analysis • Solr performance optimization • Solr minor/major version upgrades • Scale-up • Scale-down Build OperateIntegrate
  • 10. Los Angeles User Group Los Angeles User Group SOLR TOPOLOGIES • Standalone/SolrCloud Server • Master-Slave • High Availability SolrCloud • SolrCloud with CDCR (Cross Data Center Replication)
  • 11. Los Angeles User Group Los Angeles User Group STANDALONE SERVER OR SOLRCLOUD • Zookeeper and Solr run on same server • Ideal for Non-Production (Development or QA Environments) • Can be run in a “Cloud” mode or “Standalone” mode • No High Availability or Fault Tolerance
  • 12. Los Angeles User Group Los Angeles User Group SOLRCLOUD JOURNEY TO HIGH AVAILABILITY • Zookeeper is single point of failure
  • 13. Los Angeles User Group Los Angeles User Group SOLRCLOUD JOURNEY TO HIGH AVAILABILITY • In Cloud Environments, IPs can change and nodes will go down or get recycled. • Don’t want your application to directly be tied to the individual Solr nodes
  • 14. Los Angeles User Group Los Angeles User Group SOLRCLOUD (HIGH- AVAILABILITY) • High-Availability Zookeeper Ensemble • High-Availability Solr Servers (with replication) • Load Balancer / App Gateway • Ability to Scale up without re- architecting
  • 15. Los Angeles User Group Los Angeles User Group BACKUPS AND RECOVERY: SOLRCLOUD
  • 16. Los Angeles User Group Los Angeles User Group BACKUPS AND RECOVERY: SOLRCLOUD • SolrCloud Backups requires shared filesystem mounted on same path on all nodes • Sitecore developers responsibility or implementation of: ­ backup schedules ­ retention logic and cleanup ­ backup/restore verification procedures Shared filesystem, mounted on all nodes Shared filesystem
  • 17. Los Angeles User Group Los Angeles User Group SOLR TOPOLOGIES: SOLRCLOUD WITH CDCR • Highly Resilient (Entire Data Center can become unavailable without impacting availability) • Localized Traffic Routing is possible • Can be scaled to multiple regions
  • 18. Los Angeles User Group There is an easier way > Solr-as-a-Service
  • 19. Los Angeles User Group Los Angeles User Group SEARCHSTAX DEMO
  • 20. Los Angeles User Group Los Angeles User Group THANK YOU – Q&A Akshay Sura as@kanabos.com tw: @akshaysura13 Sameer Maggon sameer@searchstax.com tw: @maggon