Dac Trung Nguyen, Timothy	Sheils,	Geetha Mandava,	Noel	Southall,	Rajarshi	Guha
NCATS,	NIH
Putting	Targets	in	Context
IDG Knowledge Management Center
Entity	browsing	(filterable	&	linked)Search	(full	text,	auto-suggest)
Detailed	view	of	entities Built	on	top	of	a	robust	REST	API
An Interface to the KMC
Current Status
191 facets
17.8 GB database
30 GB Lucene indexes
36K LoC (Java)
14K LoC (Scala)
Image available
Source code available
20,120	targets
15,094	diseases
2.3M	publications
4,500	drugs
Nguyen	&	Mathias	et	al,	NAR,	2017
https://spotlite.nih.gov/ncats/pharos/issues
https://hub.docker.com/r/ncats/pharos/
What’s Included?
• Pharos presents data from a variety of sources,
integrated by U. New Mexico
• Primary focus is the protein target
• Wherever possible, targets are linked to other
entities (which are also interlinked)
Ø Small molecules, Diseases, Publications
• Target related data include
Ø Identifiers, ontology terms, sequence, expression data,
publications (curated & text mined), phenotypes, PPI
Data Sources
Full data source list at
http://targetcentral.ws/Pharos
Full data source list at http://targetcentral.ws/Pharos
Biologists	&	
Clinical	Researcher
• Characterize	&	
validate	novel	
targets
• Identify	key	small	
molecules	or	
biologics
Informatics	
Scientists
• Data	mining
• Support	target	
validation	
projects
Program	Staff
• Explore	the	
research	
landscape
• New	directions	
for	research &	
funding
Target Audience
Do You Know What You Want?
• Efficient full text search
• Primary entry point when exploring and for
hypothesis generation
• Fast autosuggestion facility
Ø Suggestions grouped
by type (disease,
ligand, …)
• Searches run across
all entity types
Ø But can be restricted
to specific ones
Multiple Search Options
Batch	search Sequence	search
Structure	searchText	search
(Possibly) Lots of Results
Filters
Visualization
• Key requirement for efficient exploration, summary
• Increase information density in limited screen real
estate, take context into account
• Interactivity is desirable, high quality for easy
inclusion in documents
• Simple is better than fancy but pretty pictures
have value, make for a better experience
• Integrate and link to external visualization
Visualization Highlights
Visualization	dashboard	– filters	appropriately
represented,	plots	act	as	filters
Inline	visualization	to	increase	information	density
Summary	visualizations	
overlay	multiple	dimensions	
and	can	be	context	aware
Integrating External Tools
Tclin,	Kinase
Tdark,	GPCR
Pharos
TinX
Documentation
Entity Dossier
Multiple	dossiers
Set	operationsVisualization	tools
Download
Dossiers as Context
Overlay	data	from	targets	in	a	dossier
Use Case – Targets for Obesity
20K targets 3K targets
616 targets
4 targets
ALPK1 7 targets listed in
5R01NS044385-14
KIF7
Disease
Obesity
GWAS Trait
Obesity
IDG Family
GPCR/IC/
Kinase
Grants
5R01NS...
Use Case - Target Similarity
• Find understudied
targets that have
similar data
profiles to well
studied targets
• Supports
recommendations,
prioritization
Tdark targets	whose	most	
similar	target	is	not	Tdark
Outreach & Dissemination Activities
User Feedback Deployment
Webinars Documentation
NER API for
targets & diseases
@idg_pharos
Recent	papers	to	
Pharos	links	via	
Tweets
Pharos Usage
• Usage statistics over
the last one year
are generally
increasing
• 89K pageviews
• 14K sessions
• 7.5K users
The Long Term Vision
• Incorporate dependencies
between data types to support
inference and sophisticated filters
• From presentation to summarization
Ø Use explicit links & computational
inference to generate (semi-) natural language
summary using all known data
Ø Influenced by the query
• The result is a biological dashboard,
customized for the user and the query
Target X has been implicated in 3
diseases related to skeletal, urological
and nervous systems. It has been
investigated in 5 in vitro assay, 2 in
vivo assays. There are 4 compounds
active against this target, 3 of which
are in clinical trials.
Feedback
• Explore the UI, try it, break it, and let us know
what works and what doesn’t
• Are there data types and relations that would help
you but are not available?
https://pharos.nih.gov
pharos@nih.gov
@idg_pharos
Acknowledgements
Dac-Trung Nguyen,	Kyle	Brinacombe,	Timothy	
Sheils,	Geetha Mandava,	Noel	Southall,	Ajit Jadhav
Steve	Mathias,	Oleg	Ursu,	Jeremy	Yang,	
Christian	Bologa,	Daniel	Canon,	Tudor	Oprea
Nicholas	Fernandez,	Andrew	Rouillard,	Avi Mayan
Ajay	Pillai,	Aaron	Pawlyk,	Christine	Colvis
Tomita	Lab	/	Finkbeiner lab

More Related Content

PDF
Pharos – A Torch to Use in Your Journey In the Dark Genome
PDF
Pharos: A Torch to Use in Your Journey in the Dark Genome
PDF
BioAssay Research Database Presentation at the Chem Axon UGM 2013
PDF
ELSS use cases and strategy
PPTX
Conference presentation from #iccs2014 in Noordwijkerhout
PPTX
Data reuse and scholarly reward: understanding practice and building infrastr...
PPTX
NCBO haendel talk 2013
PDF
A FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics
Pharos – A Torch to Use in Your Journey In the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark Genome
BioAssay Research Database Presentation at the Chem Axon UGM 2013
ELSS use cases and strategy
Conference presentation from #iccs2014 in Noordwijkerhout
Data reuse and scholarly reward: understanding practice and building infrastr...
NCBO haendel talk 2013
A FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics

What's hot (20)

PPTX
Martone grethe
PPTX
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
PPTX
Research data and scholarly publications: going from casual acquaintances to ...
PPTX
Leveraging publication metadata to help overcome the data ingest bottleneck
PDF
Knowledge Exchange, Nov 2011, Bonn
PPT
Data Mining and Big Data Analytics in Pharma
PPTX
Why should researchers care about data curation?
PPTX
The Dryad Digital Repository: Published evolutionary data as part of the gre...
PPT
Pulverer-embo-source data-nfdp13
PDF
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
PDF
Pathway studio into webinar 052715v1
PDF
Gaining credit for sharing research data
PPTX
effective data sharing for a learning healthcare system
PDF
Analyzing Perturbed Co-Expression Networks in Cancer Using a Graph Database
PPT
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
PPT
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
PDF
dkNET Poster Experimental Biology 2019
PDF
Considerations and challenges in building an end to-end microbiome workflow
PPSX
Rii stock centerdir_aug9_2016
PDF
Next Generation Sequence with Pathway Studio
Martone grethe
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Research data and scholarly publications: going from casual acquaintances to ...
Leveraging publication metadata to help overcome the data ingest bottleneck
Knowledge Exchange, Nov 2011, Bonn
Data Mining and Big Data Analytics in Pharma
Why should researchers care about data curation?
The Dryad Digital Repository: Published evolutionary data as part of the gre...
Pulverer-embo-source data-nfdp13
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
Pathway studio into webinar 052715v1
Gaining credit for sharing research data
effective data sharing for a learning healthcare system
Analyzing Perturbed Co-Expression Networks in Cancer Using a Graph Database
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
dkNET Poster Experimental Biology 2019
Considerations and challenges in building an end to-end microbiome workflow
Rii stock centerdir_aug9_2016
Next Generation Sequence with Pathway Studio
Ad

Similar to Pharos: Putting targets in context (20)

PPTX
Data-knowledge transition zones within the biomedical research ecosystem
PDF
GARNet workshop on Integrating Large Data into Plant Science
PDF
Opening up pharmacological space, the OPEN PHACTs api
PPTX
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
PPTX
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
PPTX
FedCentric_Presentation
PDF
FAIR Data Knowledge Graphs–from Theory to Practice
PPTX
FAIR Data Knowledge Graphs
PPTX
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
PPTX
Presentation from Code Camp 2017
PPTX
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
PPTX
Open PHACTS for BDE SC1.1
PPT
Grid And Healthcare For IOM July 2009
PPTX
Starting the Hadoop Journey at a Global Leader in Cancer Research
PPTX
Starting the Hadoop Journey at a Global Leader in Cancer Research
PPTX
dkNET Introduction for Librarians
PPT
Stratergies for the intergration of information (IPI_ConfEX)
PDF
2015 GU-ICBI Poster (third printing)
PDF
Research Data Alliance (RDA) Webinar: What do you really know about that anti...
PPT
Semantic Web Technologies as a Framework for Clinical Informatics
Data-knowledge transition zones within the biomedical research ecosystem
GARNet workshop on Integrating Large Data into Plant Science
Opening up pharmacological space, the OPEN PHACTs api
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
FedCentric_Presentation
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
Presentation from Code Camp 2017
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
Open PHACTS for BDE SC1.1
Grid And Healthcare For IOM July 2009
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
dkNET Introduction for Librarians
Stratergies for the intergration of information (IPI_ConfEX)
2015 GU-ICBI Poster (third printing)
Research Data Alliance (RDA) Webinar: What do you really know about that anti...
Semantic Web Technologies as a Framework for Clinical Informatics
Ad

More from Rajarshi Guha (20)

PDF
Pharos - Face of the KMC
PDF
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
PDF
What can your library do for you?
PDF
So I have an SD File … What do I do next?
PDF
Characterization of Chemical Libraries Using Scaffolds and Network Models
PDF
From Data to Action : Bridging Chemistry and Biology with Informatics at NCATS
PDF
Robots, Small Molecules & R
PDF
Fingerprinting Chemical Structures
PDF
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
PDF
When the whole is better than the parts
PDF
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
PDF
Pushing Chemical Biology Through the Pipes
PDF
Characterization and visualization of compound combination responses in a hig...
PDF
The BioAssay Research Database
PDF
Cloudy with a Touch of Cheminformatics
PDF
Chemical Data Mining: Open Source & Reproducible
PDF
Chemogenomics in the cloud: Is the sky the limit?
PDF
Quantifying Text Sentiment in R
PDF
PMML for QSAR Model Exchange
PDF
Smashing Molecules
Pharos - Face of the KMC
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
What can your library do for you?
So I have an SD File … What do I do next?
Characterization of Chemical Libraries Using Scaffolds and Network Models
From Data to Action : Bridging Chemistry and Biology with Informatics at NCATS
Robots, Small Molecules & R
Fingerprinting Chemical Structures
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
When the whole is better than the parts
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Pushing Chemical Biology Through the Pipes
Characterization and visualization of compound combination responses in a hig...
The BioAssay Research Database
Cloudy with a Touch of Cheminformatics
Chemical Data Mining: Open Source & Reproducible
Chemogenomics in the cloud: Is the sky the limit?
Quantifying Text Sentiment in R
PMML for QSAR Model Exchange
Smashing Molecules

Recently uploaded (20)

PPTX
perinatal infections 2-171220190027.pptx
PPT
veterinary parasitology ````````````.ppt
PPTX
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PDF
Social preventive and pharmacy. Pdf
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPT
Computional quantum chemistry study .ppt
PPTX
gene cloning powerpoint for general biology 2
PPTX
Seminar Hypertension and Kidney diseases.pptx
PPT
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PPTX
Microbes in human welfare class 12 .pptx
PDF
Wound infection.pdfWound infection.pdf123
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PPTX
Presentation1 INTRODUCTION TO ENZYMES.pptx
PPTX
limit test definition and all limit tests
perinatal infections 2-171220190027.pptx
veterinary parasitology ````````````.ppt
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
Social preventive and pharmacy. Pdf
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Computional quantum chemistry study .ppt
gene cloning powerpoint for general biology 2
Seminar Hypertension and Kidney diseases.pptx
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
TORCH INFECTIONS in pregnancy with toxoplasma
Animal tissues, epithelial, muscle, connective, nervous tissue
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
Hypertension_Training_materials_English_2024[1] (1).pptx
Microbes in human welfare class 12 .pptx
Wound infection.pdfWound infection.pdf123
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
Presentation1 INTRODUCTION TO ENZYMES.pptx
limit test definition and all limit tests

Pharos: Putting targets in context