SlideShare a Scribd company logo
Structure Identification Using High Resolution
Mass Spectrometry Data and the
EPA Chemistry Dashboard
Antony J. Williams†, Andrew McEachran, Jon Sobus,
Chris Grulke, Jennifer Smith, Michelle Krzyzanowski,
Jordan Foster and Jeff Edwards
National Center for Computational Toxicology
U.S. Environmental Protection Agency, RTP, NC
August 21-25, 2016
ACS Fall Meeting, Philadelphia, PA
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
http://www.orcid.org/0000-0002-2668-4821
@ChemConnector on Twitter
Comparing Analysis Approaches
• Targeted Analysis:
- We know exactly what we’re looking for
- 10s – 100s of chemicals
• Suspect Screening Analysis (SSA):
- We have chemicals of interest
- 100s – 1,000s of chemicals
• Non-Targeted Analysis (NTA):
- We have no preconceived lists
- 1,000s – 10,000s of chemicals
- In dust, soil, food, air, water, products,
plants, animals, and…us!!
General Goals of SSA/NTA
- 1 Dust Sample
- Negative Ionization Mode
- 300 Extracted “Molecular Features”
1) Prioritize “Molecular Features”
2) Correctly assign formulas
3) Correctly assign structures
4) Determine chemical sources
5) Predict chemical concentrations
C17H19NO3 12 µg/g
(1)
(2) (3) (4) (5)
EXPOSURE
Non-targeted analysis challenges
• 3000-5000 molecular features in a given
sample
• Current technologies can identify up to 5%
• How can we improve identification???
– Simple workflows
– Reliable formula prediction (Instrument)
– Accurate ranking of likelihood (Databases)
The General Approach
Analytical Instruments Comp. Tools & Workflows
Databases
Previous Work with Suspect-Screening
We’re on the Right Path…
• … but certainly room for improvement
• ~thousands of molecular features (not unique)
• 33 confirmed chemicals
• State-of-the-art SSA yields <5% confirmed IDs
• So what else is in these (and other) samples??
2012: Definitive Study of Known-
Unknowns using ChemSpider
7
2012: Definitive Study of Known-
Unknowns using ChemSpider
8
ChemSpider
• http://www.chemspider.com
• Grows daily with new depositors and
annotations. Example Data Sources
9
Our New Dashboard
https://comptox.epa.gov
10
Bisphenol A
11
Physicochemical Properties
12
Bioassay Screening Data
13
Functional Use and Composition
14
External Links
15
National Environmental Methods Index
16
Advanced Search
17
Formula Searching
Formulae matching Bisphenol A
18
Formula Search Results
19
Download to Excel
20
Download as SDF file
21
SDF file opened in ChemFolder
22
Rank-ordering all of those hits??
• With so many hits how do you rank order
based on formulae? Or mass??
23
Comparing Performance
24
721k structures
Bisphenol A as an example
ChemSpider: 1564 Structures
25
Bisphenol A as an example
Dashboard: 215 Structures
26
A more pointed example…C15H15N3O2
6926 results
27
A more pointed example…C15H15N3O2
94 results
28
Particulate
Matter
Antibiotics used in
animal production
TYL= Tylosin
MON= Monensin
TC= Tetracycline
OTC= Oxytetracycline
CTC= Chlortetracycline
McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes:
aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555
Antibiotics in beef commercial feed
Mass-based Search Formula-based Search
Agricultural
Source
# Compounds Dashboard ChemSpider Dashboard ChemSpider
Wastewater land
application1
34 1.3 1.8 1.1 1.1
Cattle Feedyard2 5 1.0 1.0 1.0 1.0
1McEachran AD, Shea D, Bodnar W, Nichols EG. 2016. Pharmaceutical Occurrence in groundwater and surface waters in
forests land-applied with municipal wastewater. Environ Toxicol Chem 35: 898-905. DOI: 10.1002/etc.3216
2McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic
resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343;
DOI:10.1289/EHP.1408555
Mass-based Search Formula Based Search
Dashboard ChemSpider Dashboard ChemSpider
Tylosin 1/1 1/28 1/1 1/25
Monensin 1/1 1/39 1/1 1/24
Tetracycline 1/ 38 1/4008 1/11 1/355
Oxytetracycline 1/16 1/3271 1/3 1/110
Chlortetracycline 1/23 1/2545 1/3 1/77
Rank Position/Total # Results
Mean Rank Position
Rank-ordering Comparisons
Chemical Identification
Dashboard vs ChemSpider
Sorted by number of
references (ChemSpider)
or data sources
(Dashboard)
Monoisotopic Mass (+/- 0.005 amu) Search
Position of compound sorted
Source of List # of
Compounds
Search Tool Mean
Position
Median
Position #1 #2 #3 #4 #5+
McEachran et al
Wastewater
34 ChemSpider 1.8 1 28 5 0 0 1
Dashboard 1.3 1 31 2 0 0 1
Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1
Dashboard 1.7 1 10 2 0 0 1
Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1
Dashboard 1.6 1 12 3 3 1 0
Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4
Dashboard 1.08 1 22 2 0 0 0
Dashboard vs ChemSpider
Ranking Summary
Mass-based Searching Formula Based Searching
Dashboard ChemSpider Dashboard ChemSpider
Cumulative Average
Position 1.3 2.2 1.2 1.4
% in #1 Position 85% 70% 88% 80%
162 total individual chemicals in search
Functional Use to Sort Candidates
33
Anti-cancer Drug
Microbiological
Indicator Dye
Textile/Product Dye
Future Work
• Rank-ordering based on other criteria
• Already testing QSARs to build retention
time models for ranking
• External links to methods: e.g. CDC NIOSH
• Formula identification using isotope profiles
34
ToxCast
Chemicals
What impurities/
interaction products
found?
Engaging the MS Community
Conclusions
• Our NTA research is focused on understanding
our exposure to chemicals
• New dashboard with focus on high-quality
data – no large database will be perfect!
• Specific searches/functionality are being
developed with Non-targeted Analysis in mind
• Dashboard outperforms ChemSpider, a
community standard database, in ranking
chemicals of environmental concern
• Early work on new rank-ordering approaches
show that we can improve things even further.
36
Acknowledgements
EPA NCCT
Chris Grulke
Jeff Edwards
Ann Richard
Jordan Foster
Jennifer Smith
Andrew McEachran*
Michelle Krzyzanowski
EPA NERL
Kathie Dionisio
Katherine Phillips
Jon Sobus
Mark Strynar
Elin Ulrich
Seth Newton
* = ORISE Participant

More Related Content

PPTX
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPTX
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPTX
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
PDF
The influence of data curation on QSAR Modeling – examining issues of qualit...
PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
The influence of data curation on QSAR Modeling – examining issues of qualit...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...

What's hot (20)

PPTX
The needs for chemistry standards, database tools and data curation at the ch...
PPTX
An examination of data quality on QSAR Modeling in regards to the environment...
PPTX
Chemical identification of unknowns in high resolution mass spectrometry usin...
PPTX
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
PPTX
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
PPTX
Accessing information for chemicals in hydraulic fracturing fluids using the ...
PPTX
New developments in delivering public access to data from the National Center...
PPTX
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
PPTX
Development of a Tool for Systematic Integration of Traditional and New Appro...
PPTX
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
PPTX
EPA CompTox chemicals dashboard: An online resource for environmental chemists
PPTX
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
PPT
Adding complex expert knowledge into chemical database and transforming surfa...
PPTX
Non-targeted analysis supported by data and cheminformatics delivered via the...
PPTX
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
PPTX
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
PPTX
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
PPTX
Structure identification using high resolution mass spectrometry data and the...
The needs for chemistry standards, database tools and data curation at the ch...
An examination of data quality on QSAR Modeling in regards to the environment...
Chemical identification of unknowns in high resolution mass spectrometry usin...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
New developments in delivering public access to data from the National Center...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
Development of a Tool for Systematic Integration of Traditional and New Appro...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
EPA CompTox chemicals dashboard: An online resource for environmental chemists
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
Adding complex expert knowledge into chemical database and transforming surfa...
Non-targeted analysis supported by data and cheminformatics delivered via the...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
Structure identification using high resolution mass spectrometry data and the...
Ad

Viewers also liked (17)

PDF
Conférence débat sur les réseaux sociaux
PPTX
Using Ecological Momentary Assessment to Examine Post-food Consumption Affect...
PPTX
NSF Data Management Requirements 101
PDF
From Data Availability to Information Accessibility: The WellWiki Project
PPT
How One Monkey on a Typewriter Made a Difference to Online Chemistry
PPTX
Simple Springshare Mashups: Cross-Platform Strategies for Repurposing Digital...
PDF
SMS Berlin 2016 Cultural Perspectives on Strategic Management
PPTX
Investigating Impact Metrics for Performance for the US-EPA National Center f...
PPTX
A Bird in the Hand: Leveraging ILL Requests to Improve Electronic Resource A...
PPTX
Social Media Tools for Scientists and Building an Online Profile
PPTX
Shaping Expectations: Defining and Refining the Role of Technical Services in...
PPTX
Building an Online Profile Using Social Networking and Amplification Tools fo...
PPTX
Web Preservation, or Managing your Organisation’s Online Presence After the O...
PPTX
Going Concerns: A Perspective from the Nexus of Business, Culture and Instit...
PPTX
2016 davis-plantbio
PPTX
2016 bergen-sars
PPTX
2016 davis-biotech
Conférence débat sur les réseaux sociaux
Using Ecological Momentary Assessment to Examine Post-food Consumption Affect...
NSF Data Management Requirements 101
From Data Availability to Information Accessibility: The WellWiki Project
How One Monkey on a Typewriter Made a Difference to Online Chemistry
Simple Springshare Mashups: Cross-Platform Strategies for Repurposing Digital...
SMS Berlin 2016 Cultural Perspectives on Strategic Management
Investigating Impact Metrics for Performance for the US-EPA National Center f...
A Bird in the Hand: Leveraging ILL Requests to Improve Electronic Resource A...
Social Media Tools for Scientists and Building an Online Profile
Shaping Expectations: Defining and Refining the Role of Technical Services in...
Building an Online Profile Using Social Networking and Amplification Tools fo...
Web Preservation, or Managing your Organisation’s Online Presence After the O...
Going Concerns: A Perspective from the Nexus of Business, Culture and Instit...
2016 davis-plantbio
2016 bergen-sars
2016 davis-biotech
Ad

Similar to Structure Identification Using High Resolution Mass Spectrometry Data and the EPA’s Chemistry Dashboard (20)

PPTX
Non-targeted analysis supported by data and cheminformatics delivered via the...
PDF
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
PPTX
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
PPTX
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
PPTX
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
PDF
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
PPTX
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
PPTX
Accessing data to support pesticide residue and emerging contaminant analysis...
PPTX
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
PPTX
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
PPTX
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
PPTX
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
PPTX
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
PPTX
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
PPT
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
PPTX
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
PDF
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
PPTX
Accessing Data to Support Pesticide Residue and Emerging Contaminant Analysis...
PPTX
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
Non-targeted analysis supported by data and cheminformatics delivered via the...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Accessing data to support pesticide residue and emerging contaminant analysis...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
Accessing Data to Support Pesticide Residue and Emerging Contaminant Analysis...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...

Recently uploaded (20)

PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
DOCX
Viruses (History, structure and composition, classification, Bacteriophage Re...
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
An interstellar mission to test astrophysical black holes
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPT
protein biochemistry.ppt for university classes
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PDF
Sciences of Europe No 170 (2025)
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Viruses (History, structure and composition, classification, Bacteriophage Re...
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
An interstellar mission to test astrophysical black holes
Phytochemical Investigation of Miliusa longipes.pdf
protein biochemistry.ppt for university classes
Introduction to Cardiovascular system_structure and functions-1
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
TOTAL hIP ARTHROPLASTY Presentation.pptx
microscope-Lecturecjchchchchcuvuvhc.pptx
Sciences of Europe No 170 (2025)
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Cell Membrane: Structure, Composition & Functions
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg

Structure Identification Using High Resolution Mass Spectrometry Data and the EPA’s Chemistry Dashboard

  • 1. Structure Identification Using High Resolution Mass Spectrometry Data and the EPA Chemistry Dashboard Antony J. Williams†, Andrew McEachran, Jon Sobus, Chris Grulke, Jennifer Smith, Michelle Krzyzanowski, Jordan Foster and Jeff Edwards National Center for Computational Toxicology U.S. Environmental Protection Agency, RTP, NC August 21-25, 2016 ACS Fall Meeting, Philadelphia, PA The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA http://www.orcid.org/0000-0002-2668-4821 @ChemConnector on Twitter
  • 2. Comparing Analysis Approaches • Targeted Analysis: - We know exactly what we’re looking for - 10s – 100s of chemicals • Suspect Screening Analysis (SSA): - We have chemicals of interest - 100s – 1,000s of chemicals • Non-Targeted Analysis (NTA): - We have no preconceived lists - 1,000s – 10,000s of chemicals - In dust, soil, food, air, water, products, plants, animals, and…us!!
  • 3. General Goals of SSA/NTA - 1 Dust Sample - Negative Ionization Mode - 300 Extracted “Molecular Features” 1) Prioritize “Molecular Features” 2) Correctly assign formulas 3) Correctly assign structures 4) Determine chemical sources 5) Predict chemical concentrations C17H19NO3 12 µg/g (1) (2) (3) (4) (5) EXPOSURE
  • 4. Non-targeted analysis challenges • 3000-5000 molecular features in a given sample • Current technologies can identify up to 5% • How can we improve identification??? – Simple workflows – Reliable formula prediction (Instrument) – Accurate ranking of likelihood (Databases)
  • 5. The General Approach Analytical Instruments Comp. Tools & Workflows Databases
  • 6. Previous Work with Suspect-Screening
  • 7. We’re on the Right Path… • … but certainly room for improvement • ~thousands of molecular features (not unique) • 33 confirmed chemicals • State-of-the-art SSA yields <5% confirmed IDs • So what else is in these (and other) samples??
  • 8. 2012: Definitive Study of Known- Unknowns using ChemSpider 7
  • 9. 2012: Definitive Study of Known- Unknowns using ChemSpider 8
  • 10. ChemSpider • http://www.chemspider.com • Grows daily with new depositors and annotations. Example Data Sources 9
  • 15. Functional Use and Composition 14
  • 22. Download as SDF file 21
  • 23. SDF file opened in ChemFolder 22
  • 24. Rank-ordering all of those hits?? • With so many hits how do you rank order based on formulae? Or mass?? 23
  • 26. Bisphenol A as an example ChemSpider: 1564 Structures 25
  • 27. Bisphenol A as an example Dashboard: 215 Structures 26
  • 28. A more pointed example…C15H15N3O2 6926 results 27
  • 29. A more pointed example…C15H15N3O2 94 results 28
  • 30. Particulate Matter Antibiotics used in animal production TYL= Tylosin MON= Monensin TC= Tetracycline OTC= Oxytetracycline CTC= Chlortetracycline McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555 Antibiotics in beef commercial feed
  • 31. Mass-based Search Formula-based Search Agricultural Source # Compounds Dashboard ChemSpider Dashboard ChemSpider Wastewater land application1 34 1.3 1.8 1.1 1.1 Cattle Feedyard2 5 1.0 1.0 1.0 1.0 1McEachran AD, Shea D, Bodnar W, Nichols EG. 2016. Pharmaceutical Occurrence in groundwater and surface waters in forests land-applied with municipal wastewater. Environ Toxicol Chem 35: 898-905. DOI: 10.1002/etc.3216 2McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555 Mass-based Search Formula Based Search Dashboard ChemSpider Dashboard ChemSpider Tylosin 1/1 1/28 1/1 1/25 Monensin 1/1 1/39 1/1 1/24 Tetracycline 1/ 38 1/4008 1/11 1/355 Oxytetracycline 1/16 1/3271 1/3 1/110 Chlortetracycline 1/23 1/2545 1/3 1/77 Rank Position/Total # Results Mean Rank Position Rank-ordering Comparisons
  • 32. Chemical Identification Dashboard vs ChemSpider Sorted by number of references (ChemSpider) or data sources (Dashboard) Monoisotopic Mass (+/- 0.005 amu) Search Position of compound sorted Source of List # of Compounds Search Tool Mean Position Median Position #1 #2 #3 #4 #5+ McEachran et al Wastewater 34 ChemSpider 1.8 1 28 5 0 0 1 Dashboard 1.3 1 31 2 0 0 1 Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1 Dashboard 1.7 1 10 2 0 0 1 Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1 Dashboard 1.6 1 12 3 3 1 0 Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4 Dashboard 1.08 1 22 2 0 0 0
  • 33. Dashboard vs ChemSpider Ranking Summary Mass-based Searching Formula Based Searching Dashboard ChemSpider Dashboard ChemSpider Cumulative Average Position 1.3 2.2 1.2 1.4 % in #1 Position 85% 70% 88% 80% 162 total individual chemicals in search
  • 34. Functional Use to Sort Candidates 33 Anti-cancer Drug Microbiological Indicator Dye Textile/Product Dye
  • 35. Future Work • Rank-ordering based on other criteria • Already testing QSARs to build retention time models for ranking • External links to methods: e.g. CDC NIOSH • Formula identification using isotope profiles 34
  • 37. Conclusions • Our NTA research is focused on understanding our exposure to chemicals • New dashboard with focus on high-quality data – no large database will be perfect! • Specific searches/functionality are being developed with Non-targeted Analysis in mind • Dashboard outperforms ChemSpider, a community standard database, in ranking chemicals of environmental concern • Early work on new rank-ordering approaches show that we can improve things even further. 36
  • 38. Acknowledgements EPA NCCT Chris Grulke Jeff Edwards Ann Richard Jordan Foster Jennifer Smith Andrew McEachran* Michelle Krzyzanowski EPA NERL Kathie Dionisio Katherine Phillips Jon Sobus Mark Strynar Elin Ulrich Seth Newton * = ORISE Participant

Editor's Notes

  • #33: For example- Rager was actually 33 confirmed; Bade was 25