SlideShare a Scribd company logo
You’re not using
ElasticSearch?
Timon Vonk (@timonvonk)
About me
•

CTO @ Tolq

•

Freelance hacker and consultant

•

Used ElasticSearch as a solution for Translation
Memory, text author recommendation and as a
spam filter.
What is it?
•

Full text search engine based on Apache Lucene

•

Cloud in mind, built for speeed

•

Easy to use JSON api

•

A *lot* of room for custom solutions
Why use it (vs solr)?
•

Cloud based setup out of the box. Setting this up is
super easy. That means automagic replication,
sharding and bonus mapreduce.

•

Indexes happen in a few seconds (vs minutes in solr)

•

Again, easy Json API

•

Nosql, mappings can be generated on the fly

•

Well scriptable and customisable for fancy
aggregation or dynamic analysing
O M G W H AT I S
LUCENE
•

Java library for doing full text searches by
Apache

•

Just a library, by no means a solution in itself
(although you can)

•

More in depth, the search works via a terms
indexing algorithm. Scoring is not only based on
occurrence, but also on uniqueness
Making a query
•

You send json to a _search endpoint on either an index, with maybe
a type

This is a basic full text search with a response:
!

GET localhost:9200/example/peanuts/_search

{ ‘query’: { text: { ‘my_field’: ’many search terms’ }}}

{ took: 5, timed_out: false,

_shards: { total: 5, successful: 5, failed: 0 },

hits: [

{ _index: “example”,

_type: “peanuts”,

_score: 0.9,

_source: { …data }

}

]

}

}
Other types of queries

•

Terms, full text, boolean, fuzzy, geolocation and
lots more variants

•

You can also do filters to narrow down results
Analysing
•

Before data is indexed or queries are made its
analysed

•

At this point, terms are scored

•

But before that, the data is normalised. This
usually includes stemming and stop word
removal. With support for over 30 languages.
Woot!

•

You can create your own analysers.
Facet me awesome
•

Facets allow you to do aggregation over your
search.

{ “query”: … }
{ “facets”: {
“my_facets”: {
“terms”: { “name”: “utrechtrb” }
}
}
}

Facets will be deprecated in 1.0 in favour of ‘aggregations’
Ruby libraries
•

Good old Tire

deprecated

•

Stretcher!

•

Or just a json client
H o w To l q u s e s
ElasticSearch
•

We use ElasticSearch as a Translation Memory.
In our case, that means we suggest other
relevant translations while a translator is
translating.

•

By storing original texts and dynamically
adjusting the analyser to the correct languages,
we can suggest similar translations.
Tolq code was here
Tolq code was here
Tolq code was here
Also, it works great for
text search!

Thanks!

More Related Content

PDF
Solr: 4 big features
PDF
How Solr Search Works
PPT
Elastic search apache_solr
PPTX
Scaling Analytics with elasticsearch
PDF
Introduction to Apache Solr
PDF
Using elasticsearch with rails
PPTX
Battle of the giants: Apache Solr vs ElasticSearch
PPT
Solr and Elasticsearch, a performance study
Solr: 4 big features
How Solr Search Works
Elastic search apache_solr
Scaling Analytics with elasticsearch
Introduction to Apache Solr
Using elasticsearch with rails
Battle of the giants: Apache Solr vs ElasticSearch
Solr and Elasticsearch, a performance study

What's hot (20)

PPTX
Introduction to Apache Solr
PPTX
Enterprise Search Using Apache Solr
PDF
Data Exploration with Elasticsearch
PDF
Solr Recipes
PDF
Introduction to Solr
PDF
Apache Solr crash course
PPTX
Battle of the Giants - Apache Solr vs. Elasticsearch (ApacheCon)
PPTX
Introduction to Apache Lucene/Solr
PPTX
20130310 solr tuorial
PDF
it's just search
PPTX
Apache Solr
PDF
Apache Solr/Lucene Internals by Anatoliy Sokolenko
PPTX
Intro to Apache Lucene and Solr
ODP
Introduction to Apache solr
PPTX
Battle of the Giants round 2
PPTX
What You Missed in Computer Science
PDF
Building your own search engine with Apache Solr
PDF
Modernizing WordPress Search with Elasticsearch
PDF
Integrating the Solr search engine
PPTX
Hibernate Tips ‘n’ Tricks - 15 Tips to solve common problems
Introduction to Apache Solr
Enterprise Search Using Apache Solr
Data Exploration with Elasticsearch
Solr Recipes
Introduction to Solr
Apache Solr crash course
Battle of the Giants - Apache Solr vs. Elasticsearch (ApacheCon)
Introduction to Apache Lucene/Solr
20130310 solr tuorial
it's just search
Apache Solr
Apache Solr/Lucene Internals by Anatoliy Sokolenko
Intro to Apache Lucene and Solr
Introduction to Apache solr
Battle of the Giants round 2
What You Missed in Computer Science
Building your own search engine with Apache Solr
Modernizing WordPress Search with Elasticsearch
Integrating the Solr search engine
Hibernate Tips ‘n’ Tricks - 15 Tips to solve common problems
Ad

Viewers also liked (12)

PDF
Discussion 2
DOC
Internet importance
PPTX
BENEFITS OF JOINING THE VIP CLUB
PPT
презентация1
PPT
Jadual kedatangan© ppt
PPTX
Color code-presentation
PPTX
Symbiosis SIMCB Bangalore New media and pr
PPTX
SHP Listings Guide
PPTX
DOC
Importance of the internet
Discussion 2
Internet importance
BENEFITS OF JOINING THE VIP CLUB
презентация1
Jadual kedatangan© ppt
Color code-presentation
Symbiosis SIMCB Bangalore New media and pr
SHP Listings Guide
Importance of the internet
Ad

Similar to You're not using ElasticSearch (outdated) (20)

PPTX
ElasticSearch in Production: lessons learned
ODP
Elastic Search
PPTX
BigData Search Simplified with ElasticSearch
PPTX
Elasticsearch workshop presentation
PPTX
Elastic pivorak
PDF
Elasticsearch and Spark
ODP
Elasticsearch V/s Relational Database
PDF
Elasticsearch Introduction at BigData meetup
PDF
Elasticsearch
PDF
Elasticsearch
PDF
Introduction to Elasticsearch
PDF
Intro to Elaticsearch - Elasticsearch Bucharest Group @ Softbinator
PDF
ElasticSearch - index server used as a document database
PPTX
Elasticsearch, Logstash, Kibana. Cool search, analytics, data mining and more...
PPTX
Elasticsearch
PDF
Voxpopme - Elasticsearch Service
PPTX
Introduction to Elasticsearch
PPT
How ElasticSearch lives in my DevOps life
PDF
Elasticsearch at EyeEm
PDF
Explore Elasticsearch and Why It’s Worth Using
ElasticSearch in Production: lessons learned
Elastic Search
BigData Search Simplified with ElasticSearch
Elasticsearch workshop presentation
Elastic pivorak
Elasticsearch and Spark
Elasticsearch V/s Relational Database
Elasticsearch Introduction at BigData meetup
Elasticsearch
Elasticsearch
Introduction to Elasticsearch
Intro to Elaticsearch - Elasticsearch Bucharest Group @ Softbinator
ElasticSearch - index server used as a document database
Elasticsearch, Logstash, Kibana. Cool search, analytics, data mining and more...
Elasticsearch
Voxpopme - Elasticsearch Service
Introduction to Elasticsearch
How ElasticSearch lives in my DevOps life
Elasticsearch at EyeEm
Explore Elasticsearch and Why It’s Worth Using

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
Teaching material agriculture food technology
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
Big Data Technologies - Introduction.pptx
PDF
KodekX | Application Modernization Development
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
“AI and Expert System Decision Support & Business Intelligence Systems”
Per capita expenditure prediction using model stacking based on satellite ima...
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
The AUB Centre for AI in Media Proposal.docx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Teaching material agriculture food technology
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Machine learning based COVID-19 study performance prediction
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
NewMind AI Monthly Chronicles - July 2025
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Advanced methodologies resolving dimensionality complications for autism neur...
Diabetes mellitus diagnosis method based random forest with bat algorithm
20250228 LYD VKU AI Blended-Learning.pptx
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Big Data Technologies - Introduction.pptx
KodekX | Application Modernization Development
Unlocking AI with Model Context Protocol (MCP)
MYSQL Presentation for SQL database connectivity
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...

You're not using ElasticSearch (outdated)

  • 2. About me • CTO @ Tolq • Freelance hacker and consultant • Used ElasticSearch as a solution for Translation Memory, text author recommendation and as a spam filter.
  • 3. What is it? • Full text search engine based on Apache Lucene • Cloud in mind, built for speeed • Easy to use JSON api • A *lot* of room for custom solutions
  • 4. Why use it (vs solr)? • Cloud based setup out of the box. Setting this up is super easy. That means automagic replication, sharding and bonus mapreduce. • Indexes happen in a few seconds (vs minutes in solr) • Again, easy Json API • Nosql, mappings can be generated on the fly • Well scriptable and customisable for fancy aggregation or dynamic analysing
  • 5. O M G W H AT I S LUCENE • Java library for doing full text searches by Apache • Just a library, by no means a solution in itself (although you can) • More in depth, the search works via a terms indexing algorithm. Scoring is not only based on occurrence, but also on uniqueness
  • 6. Making a query • You send json to a _search endpoint on either an index, with maybe a type This is a basic full text search with a response: ! GET localhost:9200/example/peanuts/_search
 { ‘query’: { text: { ‘my_field’: ’many search terms’ }}} { took: 5, timed_out: false,
 _shards: { total: 5, successful: 5, failed: 0 },
 hits: [
 { _index: “example”,
 _type: “peanuts”,
 _score: 0.9,
 _source: { …data }
 }
 ]
 }
 }
  • 7. Other types of queries • Terms, full text, boolean, fuzzy, geolocation and lots more variants • You can also do filters to narrow down results
  • 8. Analysing • Before data is indexed or queries are made its analysed • At this point, terms are scored • But before that, the data is normalised. This usually includes stemming and stop word removal. With support for over 30 languages. Woot! • You can create your own analysers.
  • 9. Facet me awesome • Facets allow you to do aggregation over your search.
 { “query”: … } { “facets”: { “my_facets”: { “terms”: { “name”: “utrechtrb” } } } } Facets will be deprecated in 1.0 in favour of ‘aggregations’
  • 10. Ruby libraries • Good old Tire
 deprecated • Stretcher! • Or just a json client
  • 11. H o w To l q u s e s ElasticSearch • We use ElasticSearch as a Translation Memory. In our case, that means we suggest other relevant translations while a translator is translating. • By storing original texts and dynamically adjusting the analyser to the correct languages, we can suggest similar translations.
  • 15. Also, it works great for text search! Thanks!