SlideShare a Scribd company logo
Side by Side with
Solr and Elasticsearch
Radu GheorgheRafał Kuć
RaduRafał
LogseneLogsene
AgendaOverview
documents
queries
mapping
index&store
aggregations
percolations
scale out
searches
tools ecosystem
documents
schema
index&store
facets
scale out
searches
tools ecosystem
backupreplicate
{
"id": "4",
"url": "https://www.youtube.com/watch?v=IutoHcJT61k",
"title": "#bbuzz: Rafał Kuć: Battle of the Giants: Solr vs ElasticSearch, Round 2",
"uploaded_by": "newthinking communications",
"upload_date": "2013-06-19",
"views": 380,
"likes": 1,
"tags": ["elasticsearch", "solr", "lucene", "comparison"]
}
Let’s Index Videos
Examples available at:
https://github.com/sematext/berlin-buzzwords-samples/
Demo time: Start your engines!
hkcarworld.com
MappingSchema
schema.xml+... -> ZooKeeper
<schema name="
BerlinBuzzwords2014" version="1.5">
<fields>
<field name="id" type="string"
indexed="true" stored="true"
required="true"
multiValued="false" />
...
<field name="tags" type="string"
indexed="true" stored="true"
multiValued="true"/>
</fields>
...
</schema>
PUT -> /bbuzz/videos/_mapping
{
"videos": {
"_id": {
"path": "id"
},
"properties": {
...
"tags": {
"type": "string",
"index": "not_analyzed"
},
...
}
}
}
URI Request“q” Parameter
GET -> /solr/bbuzz/select
params -> q=elasticsearch
fl=*,score
...
<result name="response"
numFound="7" start="0">
<doc>
<float name="score">0.
44896343</float>
<str name="id">2</str>
<str name="url">
/watch?v=6QX5hXf_e7c</str>
<str name="title">Introduction
to Elasticsearch by Radu</str>
...
</doc>
...
GET -> /bbuzz/videos/_search
params -> q=elasticsearch
...
"hits" : [ {
"_index" : "bbuzz",
"_type" : "videos",
"_id" : "2",
"_score" : 0.26516503,
"_source" : {
"url": "/watch?v=6QX5hXf_e7c",
"title": "Introduction to Elasticsearch
by Radu",
...
Bool QueryBool Query
GET -> /solr/bbuzz/select
q=title:elasticsearch OR tags:logs
q=title:elasticsearch tags:logs
q.op=OR
GET -> /bbuzz/videos/_search
{
"query": {
"bool": {
"should": [
{
"match": {
"title": "elasticsearch"
}
},
{
"term": {
"tags": "logs"
...
PercolatorGrouping
GET -> /solr/bbuzz/select
q=elasticsearch
group=true
group.field=uploaded_by
PUT -> /bbuzz/.percolator/1
{
"query" : {
"term" : { "tags" : "elasticsearch" }
}
}
GET -> /bbuzz/videos/_percolate
{
"doc": {
"title": "Scaling Massive ES Clusters",
"tags": [ "elasticsearch", "scaling"]
}
}
HierarchiesHierarchies
names:
-> first: Rafał, last: Kuć
-> first: Radu, last: Gheorghe
nested (block join)
parent-child (query time join)
"names": [
{ "first": "Rafał", "last": "Kuć" },
{ "first": "Radu", "last": "Gheorghe" },
]
nested (block join)
parent-child
Rafał
Kuć
Radu
Gheorghe
2 names
⇐
Rafał
Kuć
Radu
Gheorghe
names
Rafał
Kuć
Radu
Gheorghe
2 names
⇐
Rafał
Kuć
Radu
Gheorghe
names
AggregationsFacets
facet=true
facet.field=tags
facet=true
facet.query=uploaded_by:
LuceneSolrRevolution
facet.query=uploaded_by:"
newthinking communications"
"aggregations" : {
"tags" : {
"terms" : { "field" : "tags" }
}
}
"aggregations": {
"uploader_count": {
"cardinality": {
"field": "uploaded_by"
}
}
}
Nesting AggsPivot Facets
facet=true
facet.pivot=tags,views
"aggregations" : {
"tags" : {
"terms" : { "field" : "tags" },
"aggregations": {
"dates": {
"date_histogram": {
"field": "upload_date",
"interval": "month",
"format" : "yyyy-MM"
}
}
}
}
}
Demo time: Graph all the things!
http://f1.thejournal.ie/media/2013/05/meatloaf-2.jpg
Stats APIsStats
JMX / Solr admin / clusterstate GET -> /_stats
"index_total" : 15118403,
"index_time" : "4.2h",
...
"query_total" : 41092,
"query_time" : "57.2m",
GET -> /_cluster/stats
"heap_used_in_bytes" : 83960392,
...
Backup
PUT -> /_snapshot/bbuzz
{
"type": "fs",
"settings": {
"location": "/mnt/bbuzz_backup"
}
}'
PUT -> /_snapshot/bbuzz/1
{
"indices": "bbuzz"
}'
POST -> /_snapshot/bbuzz/1/_restore"
Demo time: Scaling out
Apache Software Foundation
Contributors
Code
Mailing list
Elasticsearch
Contributors
Code
Mailing list
things to comeNew juicy
facet by function
https://issues.apache.
org/jira/browse/SOLR-1581
analytics component
https://issues.apache.
org/jira/browse/SOLR-5302
Solr as standalone application
5.0 - no general issue yet
top_hits aggregation
https://github.
com/elasticsearch/elasticsearch/pull/61
24
minumum_should_match on has_child
https://github.
com/elasticsearch/elasticsearch/issues/
6019
filters aggregation
https://github.
com/elasticsearch/elasticsearch/issues/
6118
most projects work well with either
many small differences, few show-stoppers
choose the best. for your use-case.
Want to work with both?
We’re hiring!
Worldwide
http://www.staff.amu.edu.pl/~zbzw/glob/glob.gif
Thank you!
Radu Gheorghe
@radu0gheorghe
Rafał Kuć
@kucrafal
Examples available at:
https://github.com/sematext/berlin-buzzwords-samples/
@sematext

More Related Content

PDF
Introduction to solr
PDF
Side by Side with Elasticsearch & Solr, Part 2
PDF
Large Scale Log Analytics with Solr (from Lucene Revolution 2015)
PDF
From Zero to Production Hero: Log Analysis with Elasticsearch (from Velocity ...
PDF
elasticsearch - advanced features in practice
PDF
Introduction to Elasticsearch
ODP
Cool bonsai cool - an introduction to ElasticSearch
PDF
Analyse your SEO Data with R and Kibana
Introduction to solr
Side by Side with Elasticsearch & Solr, Part 2
Large Scale Log Analytics with Solr (from Lucene Revolution 2015)
From Zero to Production Hero: Log Analysis with Elasticsearch (from Velocity ...
elasticsearch - advanced features in practice
Introduction to Elasticsearch
Cool bonsai cool - an introduction to ElasticSearch
Analyse your SEO Data with R and Kibana

What's hot (20)

PDF
Data Exploration with Elasticsearch
PDF
Introduction to Elasticsearch
PPT
Building a CRM on top of ElasticSearch
PDF
ElasticSearch in action
PDF
Elasticsearch for Data Analytics
PPTX
Solr vs. Elasticsearch - Case by Case
PPT
How ElasticSearch lives in my DevOps life
PPTX
Elasticsearch Distributed search & analytics on BigData made easy
PDF
Elasticsearch in 15 minutes
PDF
Simple search with elastic search
PDF
Workshop: Learning Elasticsearch
PPTX
Elk stack
PDF
Debugging and Testing ES Systems
PDF
Introduction to Elasticsearch
PDF
ElasticES-Hadoop: Bridging the world of Hadoop and Elasticsearch
PDF
Use Cases for Elastic Search Percolator
PDF
Machine Learning in a Twitter ETL using ELK
PPT
Solr and Elasticsearch, a performance study
PDF
Your Data, Your Search, ElasticSearch (EURUKO 2011)
PPTX
ElasticSearch - Introduction to Aggregations
Data Exploration with Elasticsearch
Introduction to Elasticsearch
Building a CRM on top of ElasticSearch
ElasticSearch in action
Elasticsearch for Data Analytics
Solr vs. Elasticsearch - Case by Case
How ElasticSearch lives in my DevOps life
Elasticsearch Distributed search & analytics on BigData made easy
Elasticsearch in 15 minutes
Simple search with elastic search
Workshop: Learning Elasticsearch
Elk stack
Debugging and Testing ES Systems
Introduction to Elasticsearch
ElasticES-Hadoop: Bridging the world of Hadoop and Elasticsearch
Use Cases for Elastic Search Percolator
Machine Learning in a Twitter ETL using ELK
Solr and Elasticsearch, a performance study
Your Data, Your Search, ElasticSearch (EURUKO 2011)
ElasticSearch - Introduction to Aggregations
Ad

Similar to Side by Side with Elasticsearch and Solr (20)

PDF
Elasticsearch first-steps
PPTX
Apache solr
PDF
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
PDF
KEYNOTE: Lucene / Solr road map
PDF
Solr: 4 big features
PPTX
Apache solr
PPTX
DrupalTour. Lviv — Apache solr. Advanced use cases (Artem Sylchuk, InternetDe...
PDF
Solr 3.1 and beyond
PDF
Solr Masterclass Bangkok, June 2014
PDF
Webinar: Solr 6 Deep Dive - SQL and Graph
PDF
Oslo Solr MeetUp March 2012 - Solr4 alpha
PDF
Data Engineering with Solr and Spark
KEY
Elasticsearch & "PeopleSearch"
PPTX
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
PPTX
Search Engine Building with Lucene and Solr (So Code Camp San Diego 2014)
PDF
Elasticsearch in 15 Minutes
PDF
PDF
Searching Billions of Product Logs in Real Time (Use Case)
PPTX
Apache Solr for begginers
Elasticsearch first-steps
Apache solr
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
KEYNOTE: Lucene / Solr road map
Solr: 4 big features
Apache solr
DrupalTour. Lviv — Apache solr. Advanced use cases (Artem Sylchuk, InternetDe...
Solr 3.1 and beyond
Solr Masterclass Bangkok, June 2014
Webinar: Solr 6 Deep Dive - SQL and Graph
Oslo Solr MeetUp March 2012 - Solr4 alpha
Data Engineering with Solr and Spark
Elasticsearch & "PeopleSearch"
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
Search Engine Building with Lucene and Solr (So Code Camp San Diego 2014)
Elasticsearch in 15 Minutes
Searching Billions of Product Logs in Real Time (Use Case)
Apache Solr for begginers
Ad

More from Sematext Group, Inc. (20)

PDF
Tweaking the Base Score: Lucene/Solr Similarities Explained
PDF
OOPs, OOMs, oh my! Containerizing JVM apps
PPTX
Is observability good for your brain?
PDF
Introducing log analysis to your organization
PPTX
Solr Search Engine: Optimize Is (Not) Bad for You
PDF
Solr on Docker - the Good, the Bad and the Ugly
PDF
Monitoring and Log Management for
PDF
Building Resilient Log Aggregation Pipeline with Elasticsearch & Kafka
PDF
Elasticsearch for Logs & Metrics - a deep dive
PDF
How to Run Solr on Docker and Why
PDF
Tuning Solr & Pipeline for Logs
PPTX
Running High Performance & Fault-tolerant Elasticsearch Clusters on Docker
PDF
Top Node.js Metrics to Watch
PPT
Running High Performance and Fault Tolerant Elasticsearch Clusters on Docker
PDF
Docker Logging Webinar
PDF
Docker Monitoring Webinar
PDF
Metrics, Logs, Transaction Traces, Anomaly Detection at Scale
PPTX
Tuning Elasticsearch Indexing Pipeline for Logs
PDF
Solr Anti Patterns
PDF
Tuning Solr for Logs
Tweaking the Base Score: Lucene/Solr Similarities Explained
OOPs, OOMs, oh my! Containerizing JVM apps
Is observability good for your brain?
Introducing log analysis to your organization
Solr Search Engine: Optimize Is (Not) Bad for You
Solr on Docker - the Good, the Bad and the Ugly
Monitoring and Log Management for
Building Resilient Log Aggregation Pipeline with Elasticsearch & Kafka
Elasticsearch for Logs & Metrics - a deep dive
How to Run Solr on Docker and Why
Tuning Solr & Pipeline for Logs
Running High Performance & Fault-tolerant Elasticsearch Clusters on Docker
Top Node.js Metrics to Watch
Running High Performance and Fault Tolerant Elasticsearch Clusters on Docker
Docker Logging Webinar
Docker Monitoring Webinar
Metrics, Logs, Transaction Traces, Anomaly Detection at Scale
Tuning Elasticsearch Indexing Pipeline for Logs
Solr Anti Patterns
Tuning Solr for Logs

Recently uploaded (20)

PDF
cuic standard and advanced reporting.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
Empathic Computing: Creating Shared Understanding
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PPT
Teaching material agriculture food technology
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
cuic standard and advanced reporting.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
The AUB Centre for AI in Media Proposal.docx
Reach Out and Touch Someone: Haptics and Empathic Computing
Unlocking AI with Model Context Protocol (MCP)
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Review of recent advances in non-invasive hemoglobin estimation
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Empathic Computing: Creating Shared Understanding
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
MYSQL Presentation for SQL database connectivity
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
GamePlan Trading System Review: Professional Trader's Honest Take
Teaching material agriculture food technology
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Advanced methodologies resolving dimensionality complications for autism neur...
Network Security Unit 5.pdf for BCA BBA.
Understanding_Digital_Forensics_Presentation.pptx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication

Side by Side with Elasticsearch and Solr