Crawling Across the Web of Chemistry Using ChemSpider
Citizen Scientists Enable the Web Who is writing about chemical compounds on Wikipedia? Who is writing critical reviews of Chemistry online? Who is blogging about chemistry on the web?
For Synthesis…TotallySynthetic.com
Org Prep Daily  (Blog)
Molbank (Open Access Journal)
Synthetic Pages (Website)
Encyclopedic Articles (Wikipedia)
 
Chemistry online – An Overview Encyclopedic articles (Wikipedia) Chemical vendor databases Metabolic pathway databases Property databases Chemical Synthesis procedures Scientific publications  Chemical vendors Blogs Wikis  Open Notebook Science
What and who do you trust?
Compounds and Identifiers
What is ChemSpider? ChemSpider is: Building a Structure Centric Community for Chemists >23 million compounds, ca. 250 data sources A deposition and curation platform A publishing platform for the community Grows daily – more depositions, more links, more data sources
Search Cholesterol
Search Cholesterol
Search Cholesterol
Search Cholesterol
Search Cholesterol
Linked across the internet
Link off a structure in ChemSpider Chemical suppliers Other publications Analytical Data Related Reactions Wikipedia Patents “ Everything”
Linked to Millions of Articles
Answering Questions for Chemists Questions a chemist might ask… What is the melting point of n-butanol?  What is the chemical structure of Xanax? Chemically, what is phenolphthalein? What are the stereocenters of cholesterol? Where can I find publications about xylene? What are the different trade names for Ketoconazole? What is the NMR spectrum of Aspirin? What are the safety handling issues for Thymol Blue?
What is the structure of Flibanserin?
What is the structure of Flibanserin?
Complex Data and Information
Various Searches  Structure searching Substructure searching Subset searching – choose from 200 data sources Property searching Searches are used in various ways by different types of chemists…
ChemSpider Searches
ChemSpider Searches
Caution! Question Everything!
Vancomycin Who will curate? PubChem is not resourced to clean these errors How would you clean such a large dataset?
Vancomycin on ChemSpider  1 compound – discussions over 3 days
The EXPERTS must get it right?!
Wikipedia, C&E News, PubChem C&E News (from ACS)
“ Lathosterol”
“ Lathosterol”
“ Lathosterol”
“ Lathosterol” Removed
 
“ Lathosterol” on PubChem
Crowd-sourcing Chemistry Curation Crowd-sourced curation: identify/tag errors, edit names, synonyms, identify records to deprecate
Citizen Scientists
Become a Data Source
 
Synthesis Procedures
Links to Data or Deposit Data
Your  Blog Posted Online?
Upload Spectral Data, OPEN Data?
Data as DOIs Primary Data for Chemistry Available for the First Time  … Thieme is the first publisher to make primary chemistry data accessible worldwide Analytical data, from various experiments, is the foundation of research work and scientific papers From now on, primary data will be registered and made available online using digital object recognition in the form of Digital Object Identifiers (DOI)
Linking Data By DOI
Semantic Mark-up for Chemistry Semantic mark-up for chemistry is here RSC project prospect (structure linking, IUPAC Gold Book ontology and other ontologies). Based on the OSCAR system  ChemSpider Journal of Chemistry Nature publishing group compound linking
ChemSpider and Publishing Curation led to a set of validated dictionaries Integrated  entity extraction  with validated name dictionaries  Additional dictionaries gave reactions, groups, families, hardware and software vendors etc
ChemMantis and CJOC
Name-Structure Pairs
Deposit Structures
Species – linked to Wikipedia
Semantic Linking of Structures What would you want to link off a structure? Chemical suppliers Other publications Analytical Data Related Reactions Wikipedia Patents “ Everything”
RSC’s Project Prospect
In Development  ChemSpider Synthesis ChemSpider Synthesis will be a home for all things “synthetic”  An online resource for synthetic procedures from blogs, other online resources, RSC supplementary info, other publishers etc. Public peer-review and feedback for synthetic procedures
RSC Supplementary Info
Online Journals and Live Data
ChemSpider Everywhere : Embed
ChemSpider Everywhere: Spectral Game
ChemSpider Everywhere Crowdsourced Curation of Spectra
ChemSpider Everywhere ChemMobi Building a Structure Centric Community for Chemists
ChemSpider Web Services
ChemSpider Everywhere Linked from Wikipedia Linked from Open Notebook Science sites  Linked from Blogs using Structure/Spectra Integrated into structure drawing packages such as ACD/ChemSketch, Symyx Draw, Open Source applets
 
Where is ChemSpider Lacking? ChemSpider is limited to “defined chemicals”. No support for: Polymers Minerals Markush structures  ChemSpider is very dependent on InChIs Stereochemistry around non-carbon centers Organometallics are not correctly represented There are  millions of errors  on ChemSpider
What’s next? Keep cleaning and depositing data Enable discovery via the semantic web (RDF) Integrate software: Symyx Jdraw, NMRShiftDB Integrate RSC content – a massive archive! Integrate RSC publishing workflows and databases
Continue Building Community for Chemistry Building a Public ADME/Tox database Delivering ChemSpider Synthetic Pages Delivering ChemSpider Analytical Data Delivering ChemSpider Education Project Focus
People  Make Change Happen You are invited.. Curate ChemSpider data and link to us Deposit your data with us Structures Spectra Synthesis procedures ChemSpider Synthesis is under development
People  Make Change Happen ChemSpider was a “hobby project”  Housed in a basement and running off three servers – one bought, two built Sensitive to weather and power stability Went live at ACS Spring 2007 in Chicago ca. 6000 visitors a day, >50,000 transactions daily
Organizations Scale Innovation
There is a Downside…
There is a Downside…
Thank you [email_address] Twitter: ChemSpiderman www.chemspider.com/blog SLIDES: www.slideshare.net/AntonyWilliams

More Related Content

PPT
PPT
Taming The Wild West Of Internet Based Chemistry You Can Help
PPT
PPT
Citizen Scientists and Their Contributions to Internet Based Chemistry
PPT
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
PPT
Building a semantic chemistry platform with the royal society of chemistry
Taming The Wild West Of Internet Based Chemistry You Can Help
Citizen Scientists and Their Contributions to Internet Based Chemistry
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
Building a semantic chemistry platform with the royal society of chemistry

What's hot (20)

PPTX
RSC ChemSpider – Building An Internet Based Community For Chemists
PPT
Supporting the exploding dimensions of the chemical sciences via global netwo...
PPT
Building an integrated system for chemistry markup and online publishing inte...
PPT
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
PPT
Text Mining for Chemistry and Building a Public Platform for Document Markup
PPT
ChemSpider hosting linking and curating chemistry data for the community
PPT
The royal society of chemistry and its adoption of semantic web technologies ...
PPT
Structure verification and elucidation using the ChemSpider database
PPT
PPT
The application of cloud computing to royal society of chemistry data platforms
PPT
How Internet Resources Are Providing a Collaborative Community for Chemistry
PPT
Royal Society of Chemistry open source cheminformatics platforms and libraries
PPT
Chem spider introduction spring 2011
PPT
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
PPT
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
PPT
PPTX
Sci finder ppt
PPT
Connecting Chemists To The Internet Training at Burlington House 2010
PPT
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
RSC ChemSpider – Building An Internet Based Community For Chemists
Supporting the exploding dimensions of the chemical sciences via global netwo...
Building an integrated system for chemistry markup and online publishing inte...
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
Text Mining for Chemistry and Building a Public Platform for Document Markup
ChemSpider hosting linking and curating chemistry data for the community
The royal society of chemistry and its adoption of semantic web technologies ...
Structure verification and elucidation using the ChemSpider database
The application of cloud computing to royal society of chemistry data platforms
How Internet Resources Are Providing a Collaborative Community for Chemistry
Royal Society of Chemistry open source cheminformatics platforms and libraries
Chem spider introduction spring 2011
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
Sci finder ppt
Connecting Chemists To The Internet Training at Burlington House 2010
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
Ad

Viewers also liked (6)

PPTX
Navigatingbetween patents, papers, abstracts and databases using public sourc...
PPT
Spectral Game ACS Fall 09
PPT
PDF
The magician from transsylvania - nmr spectroscopy in body fluids
PPTX
Exploring Chemical and Biological Knowledge Spaces with PubChem
PPT
Pharmacoinformatics Database basics(sree)
Navigatingbetween patents, papers, abstracts and databases using public sourc...
Spectral Game ACS Fall 09
The magician from transsylvania - nmr spectroscopy in body fluids
Exploring Chemical and Biological Knowledge Spaces with PubChem
Pharmacoinformatics Database basics(sree)
Ad

Similar to Crawling Across the Web of Chemistry Using ChemSpider (20)

PPT
PPT
ChemSpider Overview Presentation at Special Libraries Association
PPT
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
PPT
AZ of Chemspider February 2011
PPT
ChemSpider – The Vision and Challenges Associated with Building a Free Online...
PPT
RSC ChemSpider is the online chemistry database where community contributions...
PPT
A Presentation At Nature Publishing Group Crowdsourcing, Collaborations And T...
PPT
Chemspider hosting linking and curating chemistry data for the community
PDF
A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...
PPT
The Benefits to Chemical Vendors of Putting their data on ChemSpider
PPT
How the web has weaved a web of interlinked chemistry data final
PPT
Enhancing Discoverability Across Royal Society Of Chemistry Content By Integr...
PPT
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
PPT
Integrating and curating internet based chemistry resources to serve life sci...
PPT
How Community Crowdsourcing and Social Networking is Helping to Build a Quali...
PPT
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider Overview Presentation at Special Libraries Association
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
AZ of Chemspider February 2011
ChemSpider – The Vision and Challenges Associated with Building a Free Online...
RSC ChemSpider is the online chemistry database where community contributions...
A Presentation At Nature Publishing Group Crowdsourcing, Collaborations And T...
Chemspider hosting linking and curating chemistry data for the community
A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...
The Benefits to Chemical Vendors of Putting their data on ChemSpider
How the web has weaved a web of interlinked chemistry data final
Enhancing Discoverability Across Royal Society Of Chemistry Content By Integr...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
Integrating and curating internet based chemistry resources to serve life sci...
How Community Crowdsourcing and Social Networking is Helping to Build a Quali...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...

Recently uploaded (20)

PPTX
Benefits of Physical activity for teenagers.pptx
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PPTX
Microsoft Excel 365/2024 Beginner's training
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
UiPath Agentic Automation session 1: RPA to Agents
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPTX
2018-HIPAA-Renewal-Training for executives
PPTX
Modernising the Digital Integration Hub
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
Architecture types and enterprise applications.pdf
PPT
Geologic Time for studying geology for geologist
PPTX
Configure Apache Mutual Authentication
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PDF
Five Habits of High-Impact Board Members
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Benefits of Physical activity for teenagers.pptx
OpenACC and Open Hackathons Monthly Highlights July 2025
sustainability-14-14877-v2.pddhzftheheeeee
Microsoft Excel 365/2024 Beginner's training
Zenith AI: Advanced Artificial Intelligence
UiPath Agentic Automation session 1: RPA to Agents
1 - Historical Antecedents, Social Consideration.pdf
2018-HIPAA-Renewal-Training for executives
Modernising the Digital Integration Hub
Taming the Chaos: How to Turn Unstructured Data into Decisions
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
Convolutional neural network based encoder-decoder for efficient real-time ob...
Architecture types and enterprise applications.pdf
Geologic Time for studying geology for geologist
Configure Apache Mutual Authentication
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Five Habits of High-Impact Board Members
A contest of sentiment analysis: k-nearest neighbor versus neural network
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
From MVP to Full-Scale Product A Startup’s Software Journey.pdf

Crawling Across the Web of Chemistry Using ChemSpider