SlideShare a Scribd company logo
solr-fusionLeander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
● development ist part of the SE-project finc
● hired a company to develop solr-fusion
● everything is Open Source, from the beginning
(see the github link above)
https://github.com/outermedia/solr-fusion
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
Sharding, req. same Schema
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
NO Sharding, with different schemas
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
NO Sharding, with different schemas
solr-fusion
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
1. receive solr-query from the application
2. translate query for each server/schema
3. collect results from servers
4. recalculate score values
5. probably some data manipulation
6. merge results
7. send merged result-list back to application
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
Concerns. What about...
● heterogeneous data quality?
● accurate relevance calculation?
● different metadata types?
● …
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
Concerns. What about...
● heterogeneous data quality?
● accurate relevance calculation?
● different metadata types?
● …
Yes, it is an experiment.
Good enough is just fine.
Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de
https://github.com/outermedia/solr-fusion
Please feel free to join the development!
http://finc.info
seige@ub.uni-leipzig.de

More Related Content

PDF
Build your own discovery index of scholary e-resources
PDF
Batch import of large RDF datasets into Semantic MediaWiki
PDF
Hooking up Semantic MediaWiki with external tools via SPARQL
PDF
Git for Excel
PDF
Introduction to go, and why it's awesome
PPTX
Introducing .NET Core Open Source
PPTX
Code4 lib 20141129 python
Build your own discovery index of scholary e-resources
Batch import of large RDF datasets into Semantic MediaWiki
Hooking up Semantic MediaWiki with external tools via SPARQL
Git for Excel
Introduction to go, and why it's awesome
Introducing .NET Core Open Source
Code4 lib 20141129 python

What's hot (11)

PDF
Designing RESTful APIs
PPTX
Everyday Tools for the Semantic Web Developer
PPTX
Cpu.ppt INTRODUCTION TO “C”
PPTX
How to debug machine learning call stacks
PDF
Transparency6
 
PDF
Ian huston getting started with cloud foundry
PPTX
Prototyping the internet of things with Node-RED
PPTX
Hadoop summit 2016
PDF
ログ収集プラットフォーム開発におけるElasticsearchの運用
PPTX
Python Pants Build System for Large Codebases
PPTX
InChI Resolver
Designing RESTful APIs
Everyday Tools for the Semantic Web Developer
Cpu.ppt INTRODUCTION TO “C”
How to debug machine learning call stacks
Transparency6
 
Ian huston getting started with cloud foundry
Prototyping the internet of things with Node-RED
Hadoop summit 2016
ログ収集プラットフォーム開発におけるElasticsearchの運用
Python Pants Build System for Large Codebases
InChI Resolver
Ad

Viewers also liked (20)

PPTX
Educational technology plan
PDF
What's the worst that can happen #Lascot14 #LKCE14 2014
DOCX
Rhashida a
PDF
Its undergraduate-6734-2204109604-presentasi
PPTX
Contour shoe drawing
DOC
Mom Shap Tips
PDF
LeanUX 2015 talk: From Connection to Value Network
PPTX
United states Historical Portraits
PPTX
Quick call
PPTX
CEER 2012 Math Lecture
PDF
Math-tanong CEER 2012 - Set 1 Solutions
PDF
Sentidos produzidos sobre as TIC no Proinfantil
PDF
PODEVOIP Empresa
PPT
El uso de internet en la educación
ODP
Nos presentmos el maderal 2012 13
PPT
Summer camp
PDF
Glennie Website Solutions
PDF
Seige arndt-lightning talk swib13
PDF
Marpol regs4ships
Educational technology plan
What's the worst that can happen #Lascot14 #LKCE14 2014
Rhashida a
Its undergraduate-6734-2204109604-presentasi
Contour shoe drawing
Mom Shap Tips
LeanUX 2015 talk: From Connection to Value Network
United states Historical Portraits
Quick call
CEER 2012 Math Lecture
Math-tanong CEER 2012 - Set 1 Solutions
Sentidos produzidos sobre as TIC no Proinfantil
PODEVOIP Empresa
El uso de internet en la educación
Nos presentmos el maderal 2012 13
Summer camp
Glennie Website Solutions
Seige arndt-lightning talk swib13
Marpol regs4ships
Ad

Recently uploaded (20)

PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
1. Introduction to Computer Programming.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Empathic Computing: Creating Shared Understanding
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Mushroom cultivation and it's methods.pdf
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
Unlocking AI with Model Context Protocol (MCP)
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Encapsulation theory and applications.pdf
1. Introduction to Computer Programming.pptx
Machine learning based COVID-19 study performance prediction
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
cloud_computing_Infrastucture_as_cloud_p
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Group 1 Presentation -Planning and Decision Making .pptx
Univ-Connecticut-ChatGPT-Presentaion.pdf
Approach and Philosophy of On baking technology
Digital-Transformation-Roadmap-for-Companies.pptx
NewMind AI Weekly Chronicles - August'25-Week II
Empathic Computing: Creating Shared Understanding
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Mushroom cultivation and it's methods.pdf
SOPHOS-XG Firewall Administrator PPT.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Spectral efficient network and resource selection model in 5G networks

Solr fusion lt elag2014

  • 1. solr-fusionLeander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion
  • 2. ● development ist part of the SE-project finc ● hired a company to develop solr-fusion ● everything is Open Source, from the beginning (see the github link above) https://github.com/outermedia/solr-fusion
  • 3. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion
  • 4. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion Sharding, req. same Schema
  • 5. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion NO Sharding, with different schemas
  • 6. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion NO Sharding, with different schemas solr-fusion
  • 7. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion 1. receive solr-query from the application 2. translate query for each server/schema 3. collect results from servers 4. recalculate score values 5. probably some data manipulation 6. merge results 7. send merged result-list back to application
  • 8. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion Concerns. What about... ● heterogeneous data quality? ● accurate relevance calculation? ● different metadata types? ● …
  • 9. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion Concerns. What about... ● heterogeneous data quality? ● accurate relevance calculation? ● different metadata types? ● … Yes, it is an experiment. Good enough is just fine.
  • 10. Leander Seige, Leipzig University Library, seige@ub.uni-leipzig.de https://github.com/outermedia/solr-fusion Please feel free to join the development! http://finc.info seige@ub.uni-leipzig.de