SlideShare a Scribd company logo
Using online chemistry databases
to facilitate structure identification
in mass spectral data
Antony Williams,
Valery Tkachenko, Alexey Pshenichnov
ACS Denver, March 2015
Free and Easy
• Everything I will show in terms of ChemSpider
is available for free online today
• To make it easy to “take notes” these slides
will be available at:
www.slideshare.net/AntonyWilliams/
www.ChemSpider.com
ChemSpider
What will ChemSpider give us?
What will ChemSpider give us?
For Mass Spectrometrists
• Valuable searches for Mass Spec would be:
• Search the database by mass or formula for
structure identification
• Search subsets of data – e.g. “metabolism”,
pesticides etc
• Link structure-based data across the internet
• Provide “programming interfaces” to integrate
• Does ChemSpider provide value to Mass
Spectrometrists?
Pre-calculated data
Data Source Selection
• >34 million chemicals include
• Vendor collections
• Government databases
• Individual/Lab data
• Publication data
• All segregated allowing for data source
selection
Data Source Selection - Type
Data Source Selection -
Individual
Mass Spec Analysis
Jim Little, Eastman Chemical
ChemSpider Interface
1287 Hits Ranked by Defect
1287 Hits Ranked by # of
References
Top Ranked Hit
Tinuvin 328
What can I find on ChemSpider?
What can I find?
What can I find?
Source and Purchase…
What can I find on ChemSpider?
External Calculation Engines
What can I find on ChemSpider?
…and in the RSC Databases..
Linked to the Publisher
What can I find?
And out to Google Patents
What About the Entire Web?
The InChI Identifier
InChIStrings Hash to InChIKeys
Searching Internet by Structure
Extended Study
Sorting by references
Position sorted by references
Position 1 only
Web Services For Collaboration
• Many instrument vendors are using or
investigating our web-based services for
compound lookup
• Many academic sites integrating directly –
metabonomics, name lookup, mass-based
searching
Results of the ChemSpider Search
in the MarkerLynx Worksheet
Hit Details in ChemSpider
“REAL Spectral Data”
• Masses on ChemSpider are clearly valuable!
• We’d like to host “spectral curves”
• But we’re a publisher so what can we do?
Spectra: Cholesterol
ChemSpider ID 24528095 H1 NMR
ChemSpider ID 24528095 HHCOSY
Publications & “Real Spectra”
• We are turning text into spectra
• We are turning figures into spectra
ESI – Text Spectra
1H NMR (CDCl3, 400 MHz):
δ = 2.57 (m, 4H, Me, C(5a)H), 4.24 (d, 1H, J = 4.8 Hz, C(11b)H), 4.35 (t,
1H, Jb = 10.8 Hz, C(6)H), 4.47 (m, 2H, C(5)H), 4.57 (dd, 1H, J = 2.8 Hz,
C(6)H), 6.95 (d, 1H, J = 8.4 Hz, ArH), 7.18–7.94 (m, 11H, ArH)
“Where is the real data please?”
FIGURE
DATA
Future Developments
• We have extracted 100s of 1000s of text strings
from patents – next we go into our archive
• We estimate many 1000s of figures with spectral
data in our ESI and articles
• We are aiming for a million spectra online…
• But YOU can submit your data today and share it
We want this…we need YOU!
Data Mining – it’s mine, mine!
New Repository Architecture
doi: 10.1007/s10822-014-9784-5
Acknowledgments
• Jim Little, Eastman Chemical Company
• Daniel Lowe – NextMove Software
• Bill Brouwer – Plot2Txt Development
• Carlos Cobas and Stan Sykora– MestreLabs
• Patrick Wheeler - ACD/Labs
Thank you
Email: williamsa@rsc.org
ORCID: 0000-0002-2668-4821
Twitter: @ChemConnector
Personal Blog: www.chemconnector.com
SLIDES: www.slideshare.net/AntonyWilliams

More Related Content

PPTX
EPA CompTox chemicals dashboard: An online resource for environmental chemists
PPTX
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
PPT
Accessing chemical health and safety data online using Royal Society of Chemi...
PPT
The application of cloud computing to royal society of chemistry data platforms
PPT
Structure verification and elucidation using the ChemSpider database
PPT
Royal Society of Chemistry open source cheminformatics platforms and libraries
PPTX
The Role of Trust in Science at SLA 2011
PPTX
Open Notebook Science Web Services - ACS Spring 2011
EPA CompTox chemicals dashboard: An online resource for environmental chemists
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
Accessing chemical health and safety data online using Royal Society of Chemi...
The application of cloud computing to royal society of chemistry data platforms
Structure verification and elucidation using the ChemSpider database
Royal Society of Chemistry open source cheminformatics platforms and libraries
The Role of Trust in Science at SLA 2011
Open Notebook Science Web Services - ACS Spring 2011

What's hot (20)

PPTX
The collection, curation and modeling of Open Melting Point measurements
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPT
ChemSpider as an integration hub for interlinked chemistry data
PPT
eScience Resources for the Chemistry Community from the Royal Society of Chem...
PPTX
The needs for chemistry standards, database tools and data curation at the ch...
PPT
ChemInfo 2011 class1
PPTX
MiniSymp2011 Bradley
PPTX
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
PPT
The importance of standards for data exchange and interchange on the Royal So...
PPTX
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
PPTX
Bradley Opal 2011
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPTX
US EPA CompTox Chemicals Dashboard Data Integration Hub to Support Environmen...
PPTX
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
PPT
NBCC Open Notebook Science Talk
PPT
The UK National Chemical Database Service – an integration of commercial and ...
PPT
Adding complex expert knowledge into chemical database and transforming surfa...
PPTX
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
The collection, curation and modeling of Open Melting Point measurements
Delivering The Benefits of Chemical-Biological Integration in Computational T...
ChemSpider as an integration hub for interlinked chemistry data
eScience Resources for the Chemistry Community from the Royal Society of Chem...
The needs for chemistry standards, database tools and data curation at the ch...
ChemInfo 2011 class1
MiniSymp2011 Bradley
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
The importance of standards for data exchange and interchange on the Royal So...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
Bradley Opal 2011
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US EPA CompTox Chemicals Dashboard Data Integration Hub to Support Environmen...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
NBCC Open Notebook Science Talk
The UK National Chemical Database Service – an integration of commercial and ...
Adding complex expert knowledge into chemical database and transforming surfa...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Ad

Viewers also liked (14)

PPTX
10 шанц практика
PDF
Veolia france arun lakhani
PPT
Narkotiki
PPT
Merp itp banking financing model_ukr_makovyak_17_mar2015_kyiv
PPTX
The Process of Discipleship
PPTX
7 8 заверико-програма кредитування енергозбереження осбб,девал 11.03.2015
PDF
положення про проведення дпа
PDF
EMT - Event Management Team 15 yaşında...
PPTX
“Energy Union”: a packaging? Yes… with a content | Rome March 2015
PDF
Presentation_โครงการไหม ดอกสวย
PDF
Книга в гостях у малышей". Неделя детской книги в первой младшей группе.
DOCX
PPTX
презентация группа кораблик
10 шанц практика
Veolia france arun lakhani
Narkotiki
Merp itp banking financing model_ukr_makovyak_17_mar2015_kyiv
The Process of Discipleship
7 8 заверико-програма кредитування енергозбереження осбб,девал 11.03.2015
положення про проведення дпа
EMT - Event Management Team 15 yaşında...
“Energy Union”: a packaging? Yes… with a content | Rome March 2015
Presentation_โครงการไหม ดอกสวย
Книга в гостях у малышей". Неделя детской книги в первой младшей группе.
презентация группа кораблик
Ad

Similar to Using online chemistry databases to facilitate structure identification in mass spectral data (20)

PPT
Dealing with the complex challenge of managing diverse analytical chemistry d...
PPT
Utilizing Online Databases for the Purpose of Structure Identification – Appr...
PPT
PPT
PDF
PPT
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
PPT
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
PPT
PPT
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
PPT
PPT
ChemSpider, how a free community resource of data can support teaching NMR sp...
PPT
Taming The Wild West Of Internet Based Chemistry You Can Help
PPT
PPT
eScience at the Royal Society of Chemistry and our current initiatives
PPT
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
PPT
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
Dealing with the complex challenge of managing diverse analytical chemistry d...
Utilizing Online Databases for the Purpose of Structure Identification – Appr...
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider, how a free community resource of data can support teaching NMR sp...
Taming The Wild West Of Internet Based Chemistry You Can Help
eScience at the Royal Society of Chemistry and our current initiatives
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...

Recently uploaded (20)

PPTX
Application of enzymes in medicine (2).pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PPTX
Pharmacology of Autonomic nervous system
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PDF
Sciences of Europe No 170 (2025)
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPT
veterinary parasitology ````````````.ppt
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPTX
BIOMOLECULES PPT........................
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
Microbes in human welfare class 12 .pptx
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Application of enzymes in medicine (2).pptx
Biophysics 2.pdffffffffffffffffffffffffff
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
Pharmacology of Autonomic nervous system
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
Sciences of Europe No 170 (2025)
Hypertension_Training_materials_English_2024[1] (1).pptx
7. General Toxicologyfor clinical phrmacy.pptx
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
veterinary parasitology ````````````.ppt
lecture 2026 of Sjogren's syndrome l .pdf
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
BIOMOLECULES PPT........................
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
. Radiology Case Scenariosssssssssssssss
Microbes in human welfare class 12 .pptx
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...

Using online chemistry databases to facilitate structure identification in mass spectral data