SlideShare a Scribd company logo
AI from your data lake
Using Solr for analytics
Who we are
Cassandra Targett
Lucene/Solr Committer & PMC
Director of Engineering at
Lucidworks
Solr and HDP Search
Development
Marcelline Saunders
Director of Global Partner
Enablement at Lucidworks
Lucidworks is the primary sponsor
of the Apache Solr project
Employs over 40% of the active
committers on the Solr project
Contributes over 70% of Solr's
open source codebase
40%
70%
Based in San Francisco
Offices in Bangalore, Bangkok,
New York City, London, Raleigh
Over 400 customers across the
Fortune 1000
Fusion, a Solr-powered platform
for search-driven apps
Consulting and support for
organizations using Solr Produces the world’s largest open source user
conference for Lucene/Solr (now also AI!)
Visit activate-conf.com for more information & to register
About Solr
Solr is the most popular search engine available today
Built on Lucene
Open source
Scalable
Distributed
Flexible
Extensible
Search Features:
● Admin UI
● Facets
● Hit highlights
● Multiple languages
● Spell check, auto-
complete
What is HDP Search?
Developed by Lucidworks
Built & Distributed by
Hortonworks
Add-on package for HDP, which
includes:
● Apache Solr
● HDFS, Hive and Pig Connectors
● Ambari MPack for Solr
● Banana
● Documentation
HDP Search
SerDe
Job Jar
Data Files
AI Features in
Solr
● Streaming Expressions
○ Math programming syntax
○ Train regression models
○ Classify results of a search
○ Parallel processing
○ Graph Traversal
○ Parallel SQL
● Learning-to-Rank
● Analytics Component
Streaming Expressions
Powerful stream processing language for Solr
● Suite of functions to query,
transform, and aggregate your
data
● Functions can be nested to
perform multiple tasks in one
request
● Work across your entire
dataset
● Request/response stream
processing
● Batch stream processing
● Fast interactive MapReduce
● Aggregations (pushed down
faceted and shuffling
MapReduce)
● Parallel relational algebra
(distributed joins, intersections,
unions, complements)
● Publish/subscribe messaging
● Distributed graph traversal
● Machine learning and parallel
iterative model training
● Anomaly detection
● Recommendation systems
● Retrieve and rank services
● Text classification and feature
extraction
● Streaming NLP
● Build your own!
What Can You Do?
Stream Sources
output -> tuples
Streaming Sources originate streams (of
tuples).
● search
● jdbc
● echo
● facet
● features
● nodes
● knn
● model
● random
● significantTerms
● shortestPath
● shuffle
● stats
● timeseries
● train
● topic
● tuple
Stream
Decorators
input -> tuples
output -> tuples
● cartesianProduct
● classify
● commit
● complement
● daemon
● eval
● executor
● fetch
● having
● leftOuterJoin
● hashJoin
● innerJoin
● intersect
● merge
● null
● outerHashJoin
● parallel
● priority
● reduce
● rollup
● scoreNodes
● select
● sort
● top
● unique
● update
Stream Decorators wrap other stream functions or
perform operations on a stream (of tuples).
Stream
Evaluators
input -> parameter
(possibly from a field in a
tuple)
output -> parameter
(possibly from a field in a
tuple)
● analyze
● abs
● add
● div
● log
● mult
● sub
● pow
● mod
● ceil
● floor
● sin
● asin
● sinh
● cos
● acos
● atan
● round
● sqrt
● cbrt
● and
● eq
● eor
● gteq
● gt
● if
● lteq
● lt
● not
● or
● raw
● sample
Stream Evaluators are functions that evaluate
parameters and return a result. These can be used
to transform values inside the tuples in a streaming
expression, or can be used independently.
● regress
● predict
● standardize
● distance
● kmeans
● timeseries
● monteCarlo
● cumulativeProbablity
● betaDistribution
● termVectors
● matrix
● rowCount
● mean
● describe
● percentile
● cov
...and many MORE
Parallel Batch
Processing
Train a Logistic Regression
Model
Distributed Joins
Pull Results from External Database
Sources: https://lucene.apache.org/solr/guide/streaming-expressions.html http://joelsolr.blogspot.com/2016/10/solr-63-batch-jobs-parallel-etl-and.html
Classify Search
Results
Rapid Export of all
Search Results
Streaming Expression Examples
Parallel SQL
● SQL interface for writing streaming expressions
● Statements are parsed to proper streaming expression syntax
● Supports a basic SQL syntax: SELECT, WHERE, ORDER BY,
LIMIT, etc.
rollup
(search
(techproducts,q=”*:*”,fl=”id,color”,sort=”color asc”),
over=”color”, count(*))
SELECT count(*) from techproducts
WHERE _text_=’(*:*)’ GROUP BY color
Graph Traversal
● Part of Solr’s broader Streaming
Expressions capability
● Implements a powerful, breadth-first
traversal
● Works across shards AND collections
● Supports aggregations
● Cycle aware
● Ability to both traverse AND score
nodes within the graph
Graph Traversal - Syntax
All movies that user "trey" watched
gatherNodes(movielens,walk="trey->user_name_s",gather="movie_id_i")
All movies that viewers of a specific movie watched
gatherNodes(movielens,
gatherNodes(movielens,walk="123->movie_id_i",gather="user_id_i"),
walk="node->user_id_i",gather="movie_id_i", trackTraversal="true"
)
Graph Traversal - Use Cases
• Anomaly detection /
fraud detection
• Recommenders
• Social network analysis
• Graph Search
• Access Control
• Relationship discovery / scoring
Examples
o Find all draft blog posts about “Parallel SQL”
written by a developer
o Find all tweets mentioning “Solr” by me or people
I follow
o Find all draft blog posts about “Parallel SQL”
written by a developer
o Find 3-star hotels in NYC my friends stayed in
last year
Learning to Rank (LTR)
Rank query results based on trained models
Traditional relevance ranking uses algorithms that calculate user
query terms to terms in the document (TF/IDF, BM25)
LTR allows you to rank results for user queries according to trained
models stored in Solr (trained outside Solr)
Factors for training data:
● Implicit: clicks, time spent on page, historical sales, previously
viewed documents
● Explicit: human judgement
Analytics Component
Calculate complex statistical aggregations over result sets.
Expressions, functions and groupings of data from your documents:
● Expressions: calculations to perform over the result set to
return a single value
● Functions: variables re-used in expressions or groupings
● Groupings: facets, which can include functions or expressions
neg, round, ceil, if, gt, lt, add, sub, div, sum, count,
unique, percentile, date, concat, log, pow, mean, min, max
Tools for Analytics & Visualization
Search Driven Analytics
Motivation
- Go beyond full text search
- Self-service exploration of data
- Provide tools for analysts to mine data without having to
understand query languages
- Create views of data for users
Why SQL with Search?
● Known query language
● Eliminates re-training users on proprietary tools and query
languages
● Third party BI tools use JDBC/ODBC
● Leverage powerful full text search
● Join Solr collections
● Join Solr collections with other data sources
Analytics Visualization tools
Banana (available with HDP Search)
Solr 6.0 + (Solr SQL)
- Apache Zeppelin
Lucidworks Fusion (Spark SQL - Solr SQL)
- Tableau
- Apache Zeppelin
- Jupyter
- Any third party product that supports JDBC/ODBC
Lucidworks Fusion App Insights
Banana Dashboards
Provided with HDP
Search
Easily create
dashboards for a Solr
collection
Based on facet queries
Requires basic
knowledge of Solr
Banana Dashboards
Zeppelin Integration
AI from your data lake: Using Solr for analytics
FusionSQL - Using Spark and Solr together
Tableau: Solr Collections look like tables
Join across Solr Collections
Fusion - Tableau: Self Service BI/Analytics
Tableau Analytics
• Leverage existing BI tools like Tableau and
Zeppelin
• Add full-text search and advanced Solr AI
features to your SQL query
• Ranking by relevance
• Joins across collections
• Fast and responsive queries at scale
• Ask interesting questions of your data
SQL
Benefits with
Solr/Fusion
35
Fusion App Insights
• Customizable dashboards to visualize
Query Analytics.
• Built in Analytics reports based on
Fusion AI Smart jobs for analyzing query
performance.
• Experiment analysis to give you
feedback on how search variants are
performing.
• Thorough analytics on users, sessions,
and all interactions (signals)
AI from your data lake: Using Solr for analytics
Resources
Solr Reference Guide:
● Streaming Expressions: https://lucene.apache.org/solr/guide/streaming-expressions.html
● Setting up Solr to be used with generic SQL clients: https://lucene.apache.org/solr/guide/7_3/parallel-sql-
interface.html#generic-clients
● Solr and Apache Zeppelin: https://lucene.apache.org/solr/guide/7_3/solr-jdbc-apache-zeppelin.html#solr-jdbc-apache-
zeppelin
Lucidworks Fusion (Solr SQL and Spark SQL) - setting up Tableau
https://lucidworks.com/2017/02/01/sql-in-fusion-3/
Tech at Bloomberg: The search for Solr analytics: https://www.techatbloomberg.com/blog/the-search-for-solr-analytics/
Questions?

More Related Content

PPT
Self Esteem
PDF
The Power of Mindsets
PPTX
12 Steps to self development and assessment
PPTX
Building Positive Attitude
PPTX
The Social Self
PPTX
Goals In Our Life
PPT
Digital divide
PPTX
What is Cultural Psych
Self Esteem
The Power of Mindsets
12 Steps to self development and assessment
Building Positive Attitude
The Social Self
Goals In Our Life
Digital divide
What is Cultural Psych

Similar to AI from your data lake: Using Solr for analytics (20)

PDF
Data Science with Solr and Spark
PDF
Webinar: What's New in Solr 7
PDF
Data Engineering with Solr and Spark
PDF
Webinar: Solr 6 Deep Dive - SQL and Graph
PDF
Solr as a Spark SQL Datasource
PDF
Parallel Computing with SolrCloud: Presented by Joel Bernstein, Alfresco
PDF
Parallel SQL and Streaming Expressions in Apache Solr 6
PDF
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
PDF
KEYNOTE: Lucene / Solr road map
PPTX
Parallel SQL for SolrCloud
PPTX
Self-learned Relevancy with Apache Solr
PDF
PDF
Solr for Data Science
PDF
Integrating Hadoop & Solr
PPTX
The Apache Solr Smart Data Ecosystem
PPTX
Building Search & Recommendation Engines
PDF
Solr and Spark for Real-Time Big Data Analytics: Presented by Tim Potter, Luc...
PDF
Fusion 3 Overview Webinar
PPTX
Introduction to Apache Lucene/Solr
PPTX
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Data Science with Solr and Spark
Webinar: What's New in Solr 7
Data Engineering with Solr and Spark
Webinar: Solr 6 Deep Dive - SQL and Graph
Solr as a Spark SQL Datasource
Parallel Computing with SolrCloud: Presented by Joel Bernstein, Alfresco
Parallel SQL and Streaming Expressions in Apache Solr 6
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
KEYNOTE: Lucene / Solr road map
Parallel SQL for SolrCloud
Self-learned Relevancy with Apache Solr
Solr for Data Science
Integrating Hadoop & Solr
The Apache Solr Smart Data Ecosystem
Building Search & Recommendation Engines
Solr and Spark for Real-Time Big Data Analytics: Presented by Tim Potter, Luc...
Fusion 3 Overview Webinar
Introduction to Apache Lucene/Solr
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Ad

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Ad

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Modernizing your data center with Dell and AMD
PDF
Unlocking AI with Model Context Protocol (MCP)
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
A Presentation on Artificial Intelligence
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Electronic commerce courselecture one. Pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Chapter 3 Spatial Domain Image Processing.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Encapsulation_ Review paper, used for researhc scholars
Modernizing your data center with Dell and AMD
Unlocking AI with Model Context Protocol (MCP)
“AI and Expert System Decision Support & Business Intelligence Systems”
A Presentation on Artificial Intelligence
Spectral efficient network and resource selection model in 5G networks
Advanced methodologies resolving dimensionality complications for autism neur...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Building Integrated photovoltaic BIPV_UPV.pdf
Review of recent advances in non-invasive hemoglobin estimation
Electronic commerce courselecture one. Pdf
Understanding_Digital_Forensics_Presentation.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Mobile App Security Testing_ A Comprehensive Guide.pdf
MYSQL Presentation for SQL database connectivity
Build a system with the filesystem maintained by OSTree @ COSCUP 2025

AI from your data lake: Using Solr for analytics

  • 1. AI from your data lake Using Solr for analytics
  • 2. Who we are Cassandra Targett Lucene/Solr Committer & PMC Director of Engineering at Lucidworks Solr and HDP Search Development Marcelline Saunders Director of Global Partner Enablement at Lucidworks
  • 3. Lucidworks is the primary sponsor of the Apache Solr project Employs over 40% of the active committers on the Solr project Contributes over 70% of Solr's open source codebase 40% 70% Based in San Francisco Offices in Bangalore, Bangkok, New York City, London, Raleigh Over 400 customers across the Fortune 1000 Fusion, a Solr-powered platform for search-driven apps Consulting and support for organizations using Solr Produces the world’s largest open source user conference for Lucene/Solr (now also AI!)
  • 4. Visit activate-conf.com for more information & to register
  • 5. About Solr Solr is the most popular search engine available today Built on Lucene Open source Scalable Distributed Flexible Extensible Search Features: ● Admin UI ● Facets ● Hit highlights ● Multiple languages ● Spell check, auto- complete
  • 6. What is HDP Search? Developed by Lucidworks Built & Distributed by Hortonworks Add-on package for HDP, which includes: ● Apache Solr ● HDFS, Hive and Pig Connectors ● Ambari MPack for Solr ● Banana ● Documentation
  • 8. AI Features in Solr ● Streaming Expressions ○ Math programming syntax ○ Train regression models ○ Classify results of a search ○ Parallel processing ○ Graph Traversal ○ Parallel SQL ● Learning-to-Rank ● Analytics Component
  • 9. Streaming Expressions Powerful stream processing language for Solr ● Suite of functions to query, transform, and aggregate your data ● Functions can be nested to perform multiple tasks in one request ● Work across your entire dataset
  • 10. ● Request/response stream processing ● Batch stream processing ● Fast interactive MapReduce ● Aggregations (pushed down faceted and shuffling MapReduce) ● Parallel relational algebra (distributed joins, intersections, unions, complements) ● Publish/subscribe messaging ● Distributed graph traversal ● Machine learning and parallel iterative model training ● Anomaly detection ● Recommendation systems ● Retrieve and rank services ● Text classification and feature extraction ● Streaming NLP ● Build your own! What Can You Do?
  • 11. Stream Sources output -> tuples Streaming Sources originate streams (of tuples). ● search ● jdbc ● echo ● facet ● features ● nodes ● knn ● model ● random ● significantTerms ● shortestPath ● shuffle ● stats ● timeseries ● train ● topic ● tuple
  • 12. Stream Decorators input -> tuples output -> tuples ● cartesianProduct ● classify ● commit ● complement ● daemon ● eval ● executor ● fetch ● having ● leftOuterJoin ● hashJoin ● innerJoin ● intersect ● merge ● null ● outerHashJoin ● parallel ● priority ● reduce ● rollup ● scoreNodes ● select ● sort ● top ● unique ● update Stream Decorators wrap other stream functions or perform operations on a stream (of tuples).
  • 13. Stream Evaluators input -> parameter (possibly from a field in a tuple) output -> parameter (possibly from a field in a tuple) ● analyze ● abs ● add ● div ● log ● mult ● sub ● pow ● mod ● ceil ● floor ● sin ● asin ● sinh ● cos ● acos ● atan ● round ● sqrt ● cbrt ● and ● eq ● eor ● gteq ● gt ● if ● lteq ● lt ● not ● or ● raw ● sample Stream Evaluators are functions that evaluate parameters and return a result. These can be used to transform values inside the tuples in a streaming expression, or can be used independently. ● regress ● predict ● standardize ● distance ● kmeans ● timeseries ● monteCarlo ● cumulativeProbablity ● betaDistribution ● termVectors ● matrix ● rowCount ● mean ● describe ● percentile ● cov ...and many MORE
  • 14. Parallel Batch Processing Train a Logistic Regression Model Distributed Joins Pull Results from External Database Sources: https://lucene.apache.org/solr/guide/streaming-expressions.html http://joelsolr.blogspot.com/2016/10/solr-63-batch-jobs-parallel-etl-and.html Classify Search Results Rapid Export of all Search Results Streaming Expression Examples
  • 15. Parallel SQL ● SQL interface for writing streaming expressions ● Statements are parsed to proper streaming expression syntax ● Supports a basic SQL syntax: SELECT, WHERE, ORDER BY, LIMIT, etc. rollup (search (techproducts,q=”*:*”,fl=”id,color”,sort=”color asc”), over=”color”, count(*)) SELECT count(*) from techproducts WHERE _text_=’(*:*)’ GROUP BY color
  • 16. Graph Traversal ● Part of Solr’s broader Streaming Expressions capability ● Implements a powerful, breadth-first traversal ● Works across shards AND collections ● Supports aggregations ● Cycle aware ● Ability to both traverse AND score nodes within the graph
  • 17. Graph Traversal - Syntax All movies that user "trey" watched gatherNodes(movielens,walk="trey->user_name_s",gather="movie_id_i") All movies that viewers of a specific movie watched gatherNodes(movielens, gatherNodes(movielens,walk="123->movie_id_i",gather="user_id_i"), walk="node->user_id_i",gather="movie_id_i", trackTraversal="true" )
  • 18. Graph Traversal - Use Cases • Anomaly detection / fraud detection • Recommenders • Social network analysis • Graph Search • Access Control • Relationship discovery / scoring Examples o Find all draft blog posts about “Parallel SQL” written by a developer o Find all tweets mentioning “Solr” by me or people I follow o Find all draft blog posts about “Parallel SQL” written by a developer o Find 3-star hotels in NYC my friends stayed in last year
  • 19. Learning to Rank (LTR) Rank query results based on trained models Traditional relevance ranking uses algorithms that calculate user query terms to terms in the document (TF/IDF, BM25) LTR allows you to rank results for user queries according to trained models stored in Solr (trained outside Solr) Factors for training data: ● Implicit: clicks, time spent on page, historical sales, previously viewed documents ● Explicit: human judgement
  • 20. Analytics Component Calculate complex statistical aggregations over result sets. Expressions, functions and groupings of data from your documents: ● Expressions: calculations to perform over the result set to return a single value ● Functions: variables re-used in expressions or groupings ● Groupings: facets, which can include functions or expressions neg, round, ceil, if, gt, lt, add, sub, div, sum, count, unique, percentile, date, concat, log, pow, mean, min, max
  • 21. Tools for Analytics & Visualization
  • 22. Search Driven Analytics Motivation - Go beyond full text search - Self-service exploration of data - Provide tools for analysts to mine data without having to understand query languages - Create views of data for users
  • 23. Why SQL with Search? ● Known query language ● Eliminates re-training users on proprietary tools and query languages ● Third party BI tools use JDBC/ODBC ● Leverage powerful full text search ● Join Solr collections ● Join Solr collections with other data sources
  • 24. Analytics Visualization tools Banana (available with HDP Search) Solr 6.0 + (Solr SQL) - Apache Zeppelin Lucidworks Fusion (Spark SQL - Solr SQL) - Tableau - Apache Zeppelin - Jupyter - Any third party product that supports JDBC/ODBC Lucidworks Fusion App Insights
  • 25. Banana Dashboards Provided with HDP Search Easily create dashboards for a Solr collection Based on facet queries Requires basic knowledge of Solr
  • 29. FusionSQL - Using Spark and Solr together
  • 30. Tableau: Solr Collections look like tables
  • 31. Join across Solr Collections
  • 32. Fusion - Tableau: Self Service BI/Analytics
  • 34. • Leverage existing BI tools like Tableau and Zeppelin • Add full-text search and advanced Solr AI features to your SQL query • Ranking by relevance • Joins across collections • Fast and responsive queries at scale • Ask interesting questions of your data SQL Benefits with Solr/Fusion
  • 35. 35 Fusion App Insights • Customizable dashboards to visualize Query Analytics. • Built in Analytics reports based on Fusion AI Smart jobs for analyzing query performance. • Experiment analysis to give you feedback on how search variants are performing. • Thorough analytics on users, sessions, and all interactions (signals)
  • 37. Resources Solr Reference Guide: ● Streaming Expressions: https://lucene.apache.org/solr/guide/streaming-expressions.html ● Setting up Solr to be used with generic SQL clients: https://lucene.apache.org/solr/guide/7_3/parallel-sql- interface.html#generic-clients ● Solr and Apache Zeppelin: https://lucene.apache.org/solr/guide/7_3/solr-jdbc-apache-zeppelin.html#solr-jdbc-apache- zeppelin Lucidworks Fusion (Solr SQL and Spark SQL) - setting up Tableau https://lucidworks.com/2017/02/01/sql-in-fusion-3/ Tech at Bloomberg: The search for Solr analytics: https://www.techatbloomberg.com/blog/the-search-for-solr-analytics/