SlideShare a Scribd company logo
Kino  : Making Semantic Annotations Easier Ajith Ranabahu # , Priti Parikh # , Maryam Panahiazar # ,  Amit Sheth #   and Flora Logan-Klumpler* # Ohio Center of Excellence in Knowledge-enabled Computing ( kno.e.sis ) * Sanger Institute and University of Cambridge, UK
What is Kino ? A search engine of sort currently specialized for scientific domains Semantic Annotation (using SA-REST) for services and documents (or any Web resources) Indexing system Uses SA-REST,  Faceted search integrates with NCBO  Tools include browser plugin & back-end server
Two use cases service lookups:  eg: BioCatalogue returns about 75 Web services for the search term “gene prediction”. However, it returns only 20 Web services for the term “gene finding”, even though gene prediction and gene finding are synonyms.  supporting the whole cycle of annotation and curation at Sangers
Use case 1 : Service lookups for gene data  Find Web services using a service catalog for these operations Tagging is not consistent, keyword based Lookups are not accurate and do not take synonyms into account ! Genome Genome Sequencing Sequenced Genome Gene Prediction Predicted Gene Gene sequence Comparison Cellular location determination Function prediction Detailed Information about the Gene
Use case 2 : Scientific Document Annotation Read Published Paper Add notes via a browser plugin Look up terms and update the notes Are the  annotations  complete ? Submit annotations to the database yes no Long and cumbersome cycle of term lookup and updates. This done by scientists – is there a way to make this process easier for them ? Courtesy of  Sanger Institute UK
How does it work ? NCBO Ontology Access API NCBO Ontology Repository Kino Search API SOLRJ Kino Index API SOLR Web Interface Lucene Index Kino Browser Plugin Web Pages Kino Web Front-end Other Front -ends NCBO REST Service Kino Back-end Kino browser based annotation Kino Search Interfaces
What did we learn? Back-end automatically including even the synonyms make a significant improvement in search recall. Having integrated tooling enables faster cycles in scientific literature annotations.
Check out our poster for more details Video demo at  http://wiki.knoesis.org/index.php/Kino Paper at http://knoesis.org/library/resource.php?id=1553

More Related Content

PPT
Kino : Making Semantic Annotations Easier
PDF
ICAR 2015 Workshop - Nick Provart
PPT
AAAS talk: Bioinformatics for Biologists
PPTX
Stanford workshop2020
PPTX
Vaughn aip walkthru_pag2015
PDF
Plant ontology web services on Araport
Kino : Making Semantic Annotations Easier
ICAR 2015 Workshop - Nick Provart
AAAS talk: Bioinformatics for Biologists
Stanford workshop2020
Vaughn aip walkthru_pag2015
Plant ontology web services on Araport

Viewers also liked (8)

PPTX
Knoesis Student Achievement
PDF
Trust Management: A Tutorial
PPTX
Web and Complex Systems Lab @ Kno.e.sis
PPTX
2015 Kno.e.sis Center Annual Review
PPTX
Kno.e.sis Approach to Impactful Research & Training for Exceptional Careers
PPT
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
PPTX
Smart Data - How you and I will exploit Big Data for personalized digital hea...
PDF
How to Become a Thought Leader in Your Niche
Knoesis Student Achievement
Trust Management: A Tutorial
Web and Complex Systems Lab @ Kno.e.sis
2015 Kno.e.sis Center Annual Review
Kno.e.sis Approach to Impactful Research & Training for Exceptional Careers
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Smart Data - How you and I will exploit Big Data for personalized digital hea...
How to Become a Thought Leader in Your Niche
Ad

Similar to Kino : Making Semantic Annotations Easier (20)

PPTX
Web Apollo Tutorial for the i5K copepod research community.
PDF
BG7, a new system for bacterial genome annotation designed for NGS data
PPT
Java Introductie
PPT
Finding knowledge, data and answers on the Semantic Web
PPTX
Building a Semantic search Engine in a library
PPTX
Module development
PPTX
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
PPTX
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
PPTX
InterPro and InterProScan 5.0
 
PDF
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
PPTX
Sequencedatabases
PDF
iMicrobe_ASLO_2015
PPTX
Building Tools for Neuroimaging
PPTX
Zmasek TOPSAN Biohackathon 2011
PDF
The BioAssay Research Database
PPTX
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...
PPT
SooryaKiran Bioinformatics
PDF
exRNA Data Analysis Tools in the Genboree Workbench
PPTX
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
PPT
2012 03 01_bioinformatics_ii_les1
Web Apollo Tutorial for the i5K copepod research community.
BG7, a new system for bacterial genome annotation designed for NGS data
Java Introductie
Finding knowledge, data and answers on the Semantic Web
Building a Semantic search Engine in a library
Module development
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
Arabidopsis Information Portal: A Community-Extensible Platform for Open Data
InterPro and InterProScan 5.0
 
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
Sequencedatabases
iMicrobe_ASLO_2015
Building Tools for Neuroimaging
Zmasek TOPSAN Biohackathon 2011
The BioAssay Research Database
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...
SooryaKiran Bioinformatics
exRNA Data Analysis Tools in the Genboree Workbench
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
2012 03 01_bioinformatics_ii_les1
Ad

Recently uploaded (20)

PDF
Mushroom cultivation and it's methods.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Machine Learning_overview_presentation.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Spectroscopy.pptx food analysis technology
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
1. Introduction to Computer Programming.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PPT
Teaching material agriculture food technology
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Mushroom cultivation and it's methods.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
A Presentation on Artificial Intelligence
Diabetes mellitus diagnosis method based random forest with bat algorithm
Group 1 Presentation -Planning and Decision Making .pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Machine Learning_overview_presentation.pptx
Encapsulation theory and applications.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Programs and apps: productivity, graphics, security and other tools
Spectroscopy.pptx food analysis technology
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
1. Introduction to Computer Programming.pptx
A comparative analysis of optical character recognition models for extracting...
Teaching material agriculture food technology
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Per capita expenditure prediction using model stacking based on satellite ima...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11

Kino : Making Semantic Annotations Easier

  • 1. Kino : Making Semantic Annotations Easier Ajith Ranabahu # , Priti Parikh # , Maryam Panahiazar # , Amit Sheth # and Flora Logan-Klumpler* # Ohio Center of Excellence in Knowledge-enabled Computing ( kno.e.sis ) * Sanger Institute and University of Cambridge, UK
  • 2. What is Kino ? A search engine of sort currently specialized for scientific domains Semantic Annotation (using SA-REST) for services and documents (or any Web resources) Indexing system Uses SA-REST, Faceted search integrates with NCBO Tools include browser plugin & back-end server
  • 3. Two use cases service lookups: eg: BioCatalogue returns about 75 Web services for the search term “gene prediction”. However, it returns only 20 Web services for the term “gene finding”, even though gene prediction and gene finding are synonyms. supporting the whole cycle of annotation and curation at Sangers
  • 4. Use case 1 : Service lookups for gene data Find Web services using a service catalog for these operations Tagging is not consistent, keyword based Lookups are not accurate and do not take synonyms into account ! Genome Genome Sequencing Sequenced Genome Gene Prediction Predicted Gene Gene sequence Comparison Cellular location determination Function prediction Detailed Information about the Gene
  • 5. Use case 2 : Scientific Document Annotation Read Published Paper Add notes via a browser plugin Look up terms and update the notes Are the annotations complete ? Submit annotations to the database yes no Long and cumbersome cycle of term lookup and updates. This done by scientists – is there a way to make this process easier for them ? Courtesy of Sanger Institute UK
  • 6. How does it work ? NCBO Ontology Access API NCBO Ontology Repository Kino Search API SOLRJ Kino Index API SOLR Web Interface Lucene Index Kino Browser Plugin Web Pages Kino Web Front-end Other Front -ends NCBO REST Service Kino Back-end Kino browser based annotation Kino Search Interfaces
  • 7. What did we learn? Back-end automatically including even the synonyms make a significant improvement in search recall. Having integrated tooling enables faster cycles in scientific literature annotations.
  • 8. Check out our poster for more details Video demo at http://wiki.knoesis.org/index.php/Kino Paper at http://knoesis.org/library/resource.php?id=1553

Editor's Notes

  • #2: Kino is a pearl hunter - he goes in depths and finds pearls in this novel: http://en.wikipedia.org/wiki/The_Pearl_%28novel%29
  • #3: Kino is a semantic annotation and indexing system for scientific domains. It is built around SA-REST and our own APIhut technology as the backend, using the NCBO RESTful APIs. The ease of annotation comes with the use of a browser plugin and the integrated server back-end which can directly receive annotated content. Any Web document can be annotated and published to a faceted index easily. Since the NCBO repositories are tightly integrated, annotators can directly lookup the matching concepts and apply the annotation to the document.
  • #4: We had two usecase – one with service lookups and one with scientific document annotations In service lookup case, we learnt that there is a significant advantage in the case of using even the synonyms associated with the concept. It’s clear that if more sophisticated information can be used from the ontologies at the time of indexing, it would definitely be helpful for better recall. For document annotation – a real use case from Sanger, the whole cycle of annotation and curation can now be reduced significantly by this integrated tooling. The current workflow is very labor intensive but using a toolset like Kino reduces the effort and time required significantly.
  • #5: The lookup of services is slow and inaccurate – each tag or keyword put manually is subjected to non-uniformity and cultural biases. Often a scientist may have to sift through a number of services to find the needed one.
  • #6: This document annotation process , currently in use by Sanger, has many cycles between the annotations. The cause for this is that the document is not directly annotated but notes are added instead and the concepts are looked up later in the cycle based on the notes. The lack of tool integration puts a heavy burden on the scientists and also the process is prone to much errors.
  • #7: With the first animation : The annotation process starts by highlighting words, phrases or links in the web documents. When the appropriate menus are clicked, the user is presented with a the search results from NCBO and well as the options to browse for a desired concept. When the concept is selected, the in-memory document is modified to attach the annotation. With the second animation: Once the document is fully annotated, it can be submitted via the plugin, to the kino back-end instance. The index api filters out the annotations, makes lookups for synonyms for the concepts via NCBO and finally indexes the document using the Apache SOLR engine. The complete set of front-end and back-end functions are exposed via APIs so that programs (other than the plugin and the front-end) can make use of the kino system to directly submit and retrieve content. Third animation: During lookups, the users can use the AjaX front-end to lookup documents using concept names, synonyms with concept names or just plain keywords to look up documents. They can filter the results using facets and restrict the search for specific annotations.
  • #8: We had two usecase – one with service lookups and one with scientific document annotations In service lookup case, we learnt that there is a significant advantage in the case of using even the synonyms associated with the concept. It’s clear that if more sophisticated information can be used from the ontologies at the time of indexing, it would definitely be helpful for better recall. For document annotation – a real use case from Sanger, the whole cycle of annotation and curation can now be reduced significantly by this integrated tooling. The current workflow is very labor intensive but using a toolset like Kino reduces the effort and time required significantly.
  • #9: Take a look at our poster and also the videos in the Kino page. Paper is in the proceeding or at the link given.