SlideShare a Scribd company logo
Structure Identification Using High Resolution
Mass Spectrometry Data and the
EPA Chemistry Dashboard
Antony J. Williams†, Andrew McEachran, Jon Sobus,
Chris Grulke, Jennifer Smith, Michelle Krzyzanowski,
Jordan Foster and Jeff Edwards
National Center for Computational Toxicology
U.S. Environmental Protection Agency, RTP, NC
August 21-25, 2016
ACS Fall Meeting, Philadelphia, PA
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
orcid.org/0000-0003-1423-330X
Comparing Analysis Approaches
• Targeted Analysis:
- We know exactly what we’re looking for
- 10s – 100s of chemicals
• Suspect Screening Analysis (SSA):
- We have chemicals of interest
- 100s – 1,000s of chemicals
• Non-Targeted Analysis (NTA):
- We have no preconceived lists
- 1,000s – 10,000s of chemicals
- In dust, soil, food, air, water, products,
plants, animals, and…us!!
General Goals of SSA/NTA
- 1 Dust Sample
- Negative Ionization Mode
- 300 Extracted “Molecular Features”
1) Prioritize “Molecular Features”
2) Correctly assign formulas
3) Correctly assign structures
4) Determine chemical sources
5) Predict chemical concentrations
C17H19NO3 12 µg/g
(1)
(2) (3) (4) (5)
EXPOSURE
Non-targeted analysis challenges
• 3000-5000 molecular features in a given
sample
• Current technologies can identify up to 5%
• How can we improve identification???
– Simple workflows
– Reliable formula prediction (Instrument)
– Accurate ranking of likelihood (Databases)
McEachran, AD. Pharmaceutical and Personal Care Product (PPCP) Occurrence, Distribution, and Export
at a Forest-Water Reuse System. 2016. Dissertation, NC State University.
The General Approach
Analytical Instruments Comp. Tools & Workflows
Databases
Previous Work with Suspect-Screening
We’re on the Right Path…
• … but certainly room for improvement
• ~thousands of molecular features (not unique)
• 33 confirmed chemicals
• State-of-the-art SSA yields <5% confirmed IDs
• So what else is in these (and other) samples??
2012: Definitive Study of Known-
Unknowns using ChemSpider
7
2012: Definitive Study of Known-
Unknowns using ChemSpider
8
ChemSpider
• http://www.chemspider.com
• Grows daily with new depositors and
annotations. Example Data Sources
9
Our New Dashboard
https://comptox.epa.gov
10
Bisphenol A
11
Physicochemical Properties
12
Bioassay Screening Data
13
Functional Use and Composition
14
External Links
15
National Environmental Methods Index
16
Advanced Search
17
Formula Searching
Formulae matching Bisphenol A
18
Formula Search Results
19
Download to Excel
20
Download as SDF file
21
SDF file opened in ChemFolder
22
Rank-ordering all of those hits??
• With so many hits how do you rank order
based on formulae? Or mass??
23
Comparing Performance
24
721k structures
Bisphenol A as an example
ChemSpider: 1564 Structures
25
Bisphenol A as an example
Dashboard: 215 Structures
26
A more pointed example…C15H15N3O2
6926 results
27
A more pointed example…C15H15N3O2
94 results
28
Particulate
Matter
Antibiotics used in
animal production
TYL= Tylosin
MON= Monensin
TC= Tetracycline
OTC= Oxytetracycline
CTC= Chlortetracycline
McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes:
aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555
Antibiotics in beef commercial feed
Mass-based Search Formula-based Search
Agricultural
Source
# Compounds Dashboard ChemSpider Dashboard ChemSpider
Wastewater land
application1
34 1.3 1.8 1.1 1.1
Cattle Feedyard2 5 1.0 1.0 1.0 1.0
1McEachran AD, Shea D, Bodnar W, Nichols EG. 2016. Pharmaceutical Occurrence in groundwater and surface waters in
forests land-applied with municipal wastewater. Environ Toxicol Chem 35: 898-905. DOI: 10.1002/etc.3216
2McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic
resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343;
DOI:10.1289/EHP.1408555
Mass-based Search Formula Based Search
Dashboard ChemSpider Dashboard ChemSpider
Tylosin 1/1 1/28 1/1 1/25
Monensin 1/1 1/39 1/1 1/24
Tetracycline 1/ 38 1/4008 1/11 1/355
Oxytetracycline 1/16 1/3271 1/3 1/110
Chlortetracycline 1/23 1/2545 1/3 1/77
Rank Position/Total # Results
Mean Rank Position
Rank-ordering Comparisons
Chemical Identification
Dashboard vs ChemSpider
Sorted by number of
references (ChemSpider)
or data sources
(Dashboard)
Monoisotopic Mass (+/- 0.005 amu) Search
Position of compound sorted
Source of List # of
Compounds
Search Tool Mean
Position
Median
Position #1 #2 #3 #4 #5+
McEachran et al
Wastewater
34 ChemSpider 1.8 1 28 5 0 0 1
Dashboard 1.3 1 31 2 0 0 1
Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1
Dashboard 1.7 1 10 2 0 0 1
Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1
Dashboard 1.6 1 12 3 3 1 0
Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4
Dashboard 1.08 1 22 2 0 0 0
Dashboard vs ChemSpider
Ranking Summary
Mass-based Searching Formula Based Searching
Dashboard ChemSpider Dashboard ChemSpider
Cumulative Average
Position 1.3 2.2 1.2 1.4
% in #1 Position 85% 70% 88% 80%
162 total individual chemicals in search
Functional Use to Sort Candidates
33
Anti-cancer Drug
Microbiological
Indicator Dye
Textile/Product Dye
Future Work
• Rank-ordering based on other criteria
• Already testing QSARs to build retention
time models for ranking
• External links to methods: e.g. CDC NIOSH
• Formula identification using isotope profiles
34
ToxCast
Chemicals
What impurities/
interaction products
found?
Engaging the MS Community
Conclusions
• Our NTA research is focused on understanding
our exposure to chemicals
• New dashboard with focus on high-quality
data – no large database will be perfect!
• Specific searches/functionality are being
developed with Non-targeted Analysis in mind
• Dashboard outperforms ChemSpider, a
community standard database, in ranking
chemicals of environmental concern
• Early work on new rank-ordering approaches
show that we can improve things even further.
36
Acknowledgements
EPA NCCT
Chris Grulke
Jeff Edwards
Ann Richard
Jordan Foster
Jennifer Smith
Andrew McEachran*
Michelle Krzyzanowski
EPA NERL
Kathie Dionisio
Katherine Phillips
Jon Sobus
Mark Strynar
Elin Ulrich
Seth Newton
* = ORISE Participant

More Related Content

PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
PPTX
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
PPTX
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
PPTX
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
PDF
The influence of data curation on QSAR Modeling – examining issues of qualit...
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPTX
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
The influence of data curation on QSAR Modeling – examining issues of qualit...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...

What's hot (20)

PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
PPTX
The needs for chemistry standards, database tools and data curation at the ch...
PPTX
Non-targeted analysis supported by data and cheminformatics delivered via the...
PPTX
An examination of data quality on QSAR Modeling in regards to the environment...
PPTX
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
PPTX
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
PPTX
New developments in delivering public access to data from the National Center...
PPTX
Accessing information for chemicals in hydraulic fracturing fluids using the ...
PPT
Adding complex expert knowledge into chemical database and transforming surfa...
PPTX
Development of a Tool for Systematic Integration of Traditional and New Appro...
PPTX
Chemical identification of unknowns in high resolution mass spectrometry usin...
PPT
Chemistry Resources Science Teachers
PDF
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
PPTX
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
PPTX
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
PPTX
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
PPSX
ChemInform RxnFinder
PPT
Pubchem
PPTX
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
The needs for chemistry standards, database tools and data curation at the ch...
Non-targeted analysis supported by data and cheminformatics delivered via the...
An examination of data quality on QSAR Modeling in regards to the environment...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
New developments in delivering public access to data from the National Center...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Adding complex expert knowledge into chemical database and transforming surfa...
Development of a Tool for Systematic Integration of Traditional and New Appro...
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemistry Resources Science Teachers
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
ChemInform RxnFinder
Pubchem
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Ad

Viewers also liked (20)

DOCX
Ieee 2016 project titles for me computer science
DOCX
Ieee 2016 project titles for me m.tech in trichy
PDF
Syllabi Gate 2017
DOCX
Ieee 2016 2017 titles in network security for java and dotnet
DOC
DOCX
Ieee power system projects with abstract in trichy
PPTX
Speed - 8th Aug'16
ODP
juan jose gomez garcia 6b
DOCX
Ieee 2016 project titles in mobilecomputing for me m.tech
ODP
juan jose gomez garci 6b
DOCX
Ieee power system projects with abstract in trichy
DOCX
Ieee 2016 project titles in mobilecomputing for me m.tech
DOCX
Ieee 2016 2017 titles in network security for java and dotnet
DOCX
Ieee 2016 project titles for me m.tech in trichy
DOCX
Ieee 2016 project titles for me computer science
DOCX
Ieee 2016 project titles for me m.tech in mobilecomputing in trichy
PPTX
Gate 2017
DOCX
Ieee power system projects with abstract in trichy
DOCX
Ieee 2016 2017 titles in network security for java and dotnet
Ieee 2016 project titles for me computer science
Ieee 2016 project titles for me m.tech in trichy
Syllabi Gate 2017
Ieee 2016 2017 titles in network security for java and dotnet
Ieee power system projects with abstract in trichy
Speed - 8th Aug'16
juan jose gomez garcia 6b
Ieee 2016 project titles in mobilecomputing for me m.tech
juan jose gomez garci 6b
Ieee power system projects with abstract in trichy
Ieee 2016 project titles in mobilecomputing for me m.tech
Ieee 2016 2017 titles in network security for java and dotnet
Ieee 2016 project titles for me m.tech in trichy
Ieee 2016 project titles for me computer science
Ieee 2016 project titles for me m.tech in mobilecomputing in trichy
Gate 2017
Ieee power system projects with abstract in trichy
Ieee 2016 2017 titles in network security for java and dotnet
Ad

Similar to Structure identification using high resolution mass spectrometry data and the EPA's Chemistry Dashboard (20)

PDF
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
PPTX
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
PDF
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
PPTX
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
PPTX
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
PPTX
Non-targeted analysis supported by data and cheminformatics delivered via the...
PPTX
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
PPTX
Accessing data to support pesticide residue and emerging contaminant analysis...
PPTX
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
PPTX
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
PPTX
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
PPTX
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
PPTX
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
PPTX
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
PPT
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
PPTX
Accessing Data to Support Pesticide Residue and Emerging Contaminant Analysis...
PPTX
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
PPTX
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
PPTX
Collecting, Evaluating, and Communicating Ecotoxicology Data: ECOTOX Knowledg...
PPTX
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
Non-targeted analysis supported by data and cheminformatics delivered via the...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Accessing data to support pesticide residue and emerging contaminant analysis...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
Accessing Data to Support Pesticide Residue and Emerging Contaminant Analysis...
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Collecting, Evaluating, and Communicating Ecotoxicology Data: ECOTOX Knowledg...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...

More from Andrew McEachran (8)

PDF
Consensus ranking and fragmentation prediction for identification of unknowns...
PDF
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
PDF
Developing tools for high resolution mass spectrometry-based screening via th...
PDF
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
PPTX
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
PDF
An open workflow to generate "MS-Ready" structures and improve non-targeted m...
PDF
A comparison of three chromatographic retention time prediction models
PPTX
Using the US EPA's CompTox Chemistry Dashboard to support identification and ...
Consensus ranking and fragmentation prediction for identification of unknowns...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
Developing tools for high resolution mass spectrometry-based screening via th...
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
An open workflow to generate "MS-Ready" structures and improve non-targeted m...
A comparison of three chromatographic retention time prediction models
Using the US EPA's CompTox Chemistry Dashboard to support identification and ...

Recently uploaded (20)

PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
2. Earth - The Living Planet earth and life
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
. Radiology Case Scenariosssssssssssssss
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
BIOMOLECULES PPT........................
PDF
The scientific heritage No 166 (166) (2025)
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
Sciences of Europe No 170 (2025)
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Biophysics 2.pdffffffffffffffffffffffffff
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
2. Earth - The Living Planet earth and life
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Classification Systems_TAXONOMY_SCIENCE8.pptx
. Radiology Case Scenariosssssssssssssss
POSITIONING IN OPERATION THEATRE ROOM.ppt
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Derivatives of integument scales, beaks, horns,.pptx
Cell Membrane: Structure, Composition & Functions
BIOMOLECULES PPT........................
The scientific heritage No 166 (166) (2025)
INTRODUCTION TO EVS | Concept of sustainability
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Sciences of Europe No 170 (2025)

Structure identification using high resolution mass spectrometry data and the EPA's Chemistry Dashboard

  • 1. Structure Identification Using High Resolution Mass Spectrometry Data and the EPA Chemistry Dashboard Antony J. Williams†, Andrew McEachran, Jon Sobus, Chris Grulke, Jennifer Smith, Michelle Krzyzanowski, Jordan Foster and Jeff Edwards National Center for Computational Toxicology U.S. Environmental Protection Agency, RTP, NC August 21-25, 2016 ACS Fall Meeting, Philadelphia, PA The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA orcid.org/0000-0003-1423-330X
  • 2. Comparing Analysis Approaches • Targeted Analysis: - We know exactly what we’re looking for - 10s – 100s of chemicals • Suspect Screening Analysis (SSA): - We have chemicals of interest - 100s – 1,000s of chemicals • Non-Targeted Analysis (NTA): - We have no preconceived lists - 1,000s – 10,000s of chemicals - In dust, soil, food, air, water, products, plants, animals, and…us!!
  • 3. General Goals of SSA/NTA - 1 Dust Sample - Negative Ionization Mode - 300 Extracted “Molecular Features” 1) Prioritize “Molecular Features” 2) Correctly assign formulas 3) Correctly assign structures 4) Determine chemical sources 5) Predict chemical concentrations C17H19NO3 12 µg/g (1) (2) (3) (4) (5) EXPOSURE
  • 4. Non-targeted analysis challenges • 3000-5000 molecular features in a given sample • Current technologies can identify up to 5% • How can we improve identification??? – Simple workflows – Reliable formula prediction (Instrument) – Accurate ranking of likelihood (Databases) McEachran, AD. Pharmaceutical and Personal Care Product (PPCP) Occurrence, Distribution, and Export at a Forest-Water Reuse System. 2016. Dissertation, NC State University.
  • 5. The General Approach Analytical Instruments Comp. Tools & Workflows Databases
  • 6. Previous Work with Suspect-Screening
  • 7. We’re on the Right Path… • … but certainly room for improvement • ~thousands of molecular features (not unique) • 33 confirmed chemicals • State-of-the-art SSA yields <5% confirmed IDs • So what else is in these (and other) samples??
  • 8. 2012: Definitive Study of Known- Unknowns using ChemSpider 7
  • 9. 2012: Definitive Study of Known- Unknowns using ChemSpider 8
  • 10. ChemSpider • http://www.chemspider.com • Grows daily with new depositors and annotations. Example Data Sources 9
  • 15. Functional Use and Composition 14
  • 22. Download as SDF file 21
  • 23. SDF file opened in ChemFolder 22
  • 24. Rank-ordering all of those hits?? • With so many hits how do you rank order based on formulae? Or mass?? 23
  • 26. Bisphenol A as an example ChemSpider: 1564 Structures 25
  • 27. Bisphenol A as an example Dashboard: 215 Structures 26
  • 28. A more pointed example…C15H15N3O2 6926 results 27
  • 29. A more pointed example…C15H15N3O2 94 results 28
  • 30. Particulate Matter Antibiotics used in animal production TYL= Tylosin MON= Monensin TC= Tetracycline OTC= Oxytetracycline CTC= Chlortetracycline McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555 Antibiotics in beef commercial feed
  • 31. Mass-based Search Formula-based Search Agricultural Source # Compounds Dashboard ChemSpider Dashboard ChemSpider Wastewater land application1 34 1.3 1.8 1.1 1.1 Cattle Feedyard2 5 1.0 1.0 1.0 1.0 1McEachran AD, Shea D, Bodnar W, Nichols EG. 2016. Pharmaceutical Occurrence in groundwater and surface waters in forests land-applied with municipal wastewater. Environ Toxicol Chem 35: 898-905. DOI: 10.1002/etc.3216 2McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555 Mass-based Search Formula Based Search Dashboard ChemSpider Dashboard ChemSpider Tylosin 1/1 1/28 1/1 1/25 Monensin 1/1 1/39 1/1 1/24 Tetracycline 1/ 38 1/4008 1/11 1/355 Oxytetracycline 1/16 1/3271 1/3 1/110 Chlortetracycline 1/23 1/2545 1/3 1/77 Rank Position/Total # Results Mean Rank Position Rank-ordering Comparisons
  • 32. Chemical Identification Dashboard vs ChemSpider Sorted by number of references (ChemSpider) or data sources (Dashboard) Monoisotopic Mass (+/- 0.005 amu) Search Position of compound sorted Source of List # of Compounds Search Tool Mean Position Median Position #1 #2 #3 #4 #5+ McEachran et al Wastewater 34 ChemSpider 1.8 1 28 5 0 0 1 Dashboard 1.3 1 31 2 0 0 1 Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1 Dashboard 1.7 1 10 2 0 0 1 Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1 Dashboard 1.6 1 12 3 3 1 0 Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4 Dashboard 1.08 1 22 2 0 0 0
  • 33. Dashboard vs ChemSpider Ranking Summary Mass-based Searching Formula Based Searching Dashboard ChemSpider Dashboard ChemSpider Cumulative Average Position 1.3 2.2 1.2 1.4 % in #1 Position 85% 70% 88% 80% 162 total individual chemicals in search
  • 34. Functional Use to Sort Candidates 33 Anti-cancer Drug Microbiological Indicator Dye Textile/Product Dye
  • 35. Future Work • Rank-ordering based on other criteria • Already testing QSARs to build retention time models for ranking • External links to methods: e.g. CDC NIOSH • Formula identification using isotope profiles 34
  • 37. Conclusions • Our NTA research is focused on understanding our exposure to chemicals • New dashboard with focus on high-quality data – no large database will be perfect! • Specific searches/functionality are being developed with Non-targeted Analysis in mind • Dashboard outperforms ChemSpider, a community standard database, in ranking chemicals of environmental concern • Early work on new rank-ordering approaches show that we can improve things even further. 36
  • 38. Acknowledgements EPA NCCT Chris Grulke Jeff Edwards Ann Richard Jordan Foster Jennifer Smith Andrew McEachran* Michelle Krzyzanowski EPA NERL Kathie Dionisio Katherine Phillips Jon Sobus Mark Strynar Elin Ulrich Seth Newton * = ORISE Participant

Editor's Notes

  • #33: For example- Rager was actually 33 confirmed; Bade was 25