SlideShare a Scribd company logo
Taming the Wild, Wild West of Chemistry on the Internet. Maybe YOU Can Help?
Citizen Scientists Enable the Web Who is writing about chemical compounds on Wikipedia? Who is writing critical reviews of Chemistry online? Who is blogging about chemistry on the web?
For Synthesis…TotallySynthetic.com
Org Prep Daily  (Blog)
Molbank (Open Access Journal)
Synthetic Pages (Website)
Encyclopedic Articles (Wikipedia)
 
Chemistry online – An Overview Encyclopedic articles (Wikipedia) Chemical vendor databases Metabolic pathway databases Property databases Chemical Synthesis procedures Scientific publications  Chemical vendors Blogs Wikis  Open Notebook Science
What and who do you trust?
Compounds and Identifiers
What is ChemSpider? ChemSpider is: Building a Structure Centric Community for Chemists >23 million compounds, ca. 250 data sources A deposition and curation platform A publishing platform for the community Grows daily – more depositions, more links, more data sources
Search Cholesterol
Search Cholesterol
Search Cholesterol
Search Cholesterol
Search Cholesterol
Linked across the internet
Link off a structure in ChemSpider Chemical suppliers Other publications Analytical Data Related Reactions Wikipedia Patents “ Everything”
Linked to Millions of Articles
Answering Questions for Chemists Questions a chemist might ask… What is the melting point of n-butanol?  What is the chemical structure of Xanax? Chemically, what is phenolphthalein? What are the stereocenters of cholesterol? Where can I find publications about xylene? What are the different trade names for Ketoconazole? What is the NMR spectrum of Aspirin? What are the safety handling issues for Thymol Blue?
What is the structure of Flibanserin?
What is the structure of Flibanserin?
Complex Data and Information
Various Searches  Structure searching Substructure searching Subset searching – choose from 200 data sources Property searching Searches are used in various ways by different types of chemists…
ChemSpider Searches
ChemSpider Searches
Antony Williams vs Identifiers Passport ID Dad, Tony, others SSN Green Card License 5 email addresses ChemSpiderman (blog, Twitter account, Facebook, Friendfeed) OpenID … .
Aspirin vs Chemical Identifiers
Aspirin names and synonyms Text searches depend on correct association 335  suggested identifiers for Aspirin just on PubChem! Disambiguation dictionaries are necessary
 
 
 
The Final Search Strategy
All Those Names, One Structure
Connections Can Lead Anywhere
The InChI Identifier
Multiple Layers
InChIStrings Hash to InChIKeys
Oleoylethanolamine
Search Engine Dependencies
Search Engine Dependencies
Vancomycin
Vancomycin Who will curate? How would you clean such a large dataset?
Chemistry on the Internet Much of the information is based on assertions and  User Beware! The Quality of information available is diverse and how does the user know what is and is not “correct”?
Caution! Question Everything!
Question Everything online: www.dhmo.org
Vancomycin on ChemSpider
Vancomycin
Vancomycin Search Molecular SKELETON Search Full Molecule
Full  Skeleton  Search: 104 Hits
Full  Molecule  Search: 4 Hits
The EXPERTS must get it right?!
Wikipedia, C&E News, PubChem C&E News (from ACS)
“ Lathosterol”
“ Lathosterol”
“ Lathosterol”
“ Lathosterol” Removed
 
“ Lathosterol” on PubChem
Crowd-sourcing Chemistry Curation Crowd-sourced curation: identify/tag errors, edit names, synonyms, identify records to deprecate
Citizen Scientists
Become a Data Source
 
Synthesis Procedures
Links to Data or Deposit Data
Your  Blog Posted Online?
Upload Spectral Data, OPEN Data?
Semantic Mark-up for Chemistry Semantic mark-up for chemistry is here RSC project prospect (structure linking, IUPAC Gold Book ontology and other ontologies). Based on the OSCAR system  ChemSpider Journal of Chemistry Nature publishing group compound linking
ChemMantis and CJOC
Name-Structure Pairs
Deposit Structures
Species – linked to Wikipedia
In Development  ChemSpider Synthesis ChemSpider Synthesis will be a home for all things “synthetic”  An online resource for synthetic procedures from blogs, other online resources, RSC supplementary info, other publishers etc. Public peer-review and feedback for synthetic procedures
Online Journals and Live Data
ChemSpider Everywhere : Embed
ChemSpider Everywhere: Spectral Game
ChemSpider Everywhere Crowdsourced Curation of Spectra
ChemSpider Everywhere ChemMobi Building a Structure Centric Community for Chemists
ChemSpider Everywhere Linked from Wikipedia Linked from Open Notebook Science sites  Linked from Blogs using Structure/Spectra Integrated into structure drawing packages such as ACD/ChemSketch, Symyx Draw, Open Source applets
Where is ChemSpider Lacking? ChemSpider is limited to “defined chemicals”. No support for: Polymers Minerals Markush structures  ChemSpider is very dependent on InChIs Stereochemistry around non-carbon centers Organometallics are not correctly represented There are  millions of errors  on ChemSpider
What’s next? Keep cleaning and depositing data Enable discovery via the semantic web (RDF) Integrate software: Symyx Jdraw, NMRShiftDB Integrate RSC content – a massive archive! Integrate RSC publishing workflows and databases
Continue Building Community for Chemistry Building a Public ADME/Tox database Delivering ChemSpider Synthetic Pages Delivering ChemSpider Analytical Data Delivering ChemSpider Education Project Focus
People  Make Change Happen You are invited.. Curate ChemSpider data and link to us Deposit your data with us Structures Spectra Synthesis procedures ChemSpider Synthesis is under development
People  Make Change Happen ChemSpider was a “hobby project”  Housed in a basement and running off three servers – one bought, two built Sensitive to weather and power stability Went live at ACS Spring 2007 in Chicago ca. 6000 visitors a day, >50,000 transactions daily
Organizations Scale Innovation
Thank you [email_address] Twitter: ChemSpiderman www.chemspider.com/blog SLIDES: www.slideshare.net/AntonyWilliams

More Related Content

PPT
PPT
Citizen Scientists and Their Contributions to Internet Based Chemistry
PPT
PPTX
RSC ChemSpider – Building An Internet Based Community For Chemists
PPT
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
PPT
Citizen Scientists and Their Contributions to Internet Based Chemistry
RSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn

What's hot (20)

PPT
PPT
Supporting the exploding dimensions of the chemical sciences via global netwo...
PPT
Text Mining for Chemistry and Building a Public Platform for Document Markup
PPT
How Internet Resources Are Providing a Collaborative Community for Chemistry
PPT
Connecting Chemists To The Internet Training at Burlington House 2010
PPT
Building a semantic chemistry platform with the royal society of chemistry
PPTX
Serving the medicinal chemistry community with Royal Society of Chemistry che...
PPT
Building an integrated system for chemistry markup and online publishing inte...
PDF
2013 CrossRef Annual Meeting CrossRef Overview Ed Pentz
PPT
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
PPT
A Presentation At Nature Publishing Group Crowdsourcing, Collaborations And T...
PPTX
Sci finder ppt
PPT
The royal society of chemistry and its adoption of semantic web technologies ...
PPT
Chem spider introduction spring 2011
PPTX
Scifinder scholar ppt
PPT
SciFinder Scholar
PPT
Enhancing Discoverability Across Royal Society Of Chemistry Content By Integr...
PPT
The application of cloud computing to royal society of chemistry data platforms
Supporting the exploding dimensions of the chemical sciences via global netwo...
Text Mining for Chemistry and Building a Public Platform for Document Markup
How Internet Resources Are Providing a Collaborative Community for Chemistry
Connecting Chemists To The Internet Training at Burlington House 2010
Building a semantic chemistry platform with the royal society of chemistry
Serving the medicinal chemistry community with Royal Society of Chemistry che...
Building an integrated system for chemistry markup and online publishing inte...
2013 CrossRef Annual Meeting CrossRef Overview Ed Pentz
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
A Presentation At Nature Publishing Group Crowdsourcing, Collaborations And T...
Sci finder ppt
The royal society of chemistry and its adoption of semantic web technologies ...
Chem spider introduction spring 2011
Scifinder scholar ppt
SciFinder Scholar
Enhancing Discoverability Across Royal Society Of Chemistry Content By Integr...
The application of cloud computing to royal society of chemistry data platforms
Ad

Viewers also liked (6)

PPT
Integrating Patents with Research Data
PPTX
Vitamin e
PPTX
กิจกรรมที่11.2
PDF
Learn BEM: CSS Naming Convention
PDF
10 Insightful Quotes On Designing A Better Customer Experience
PPTX
How to Build a Dynamic Social Media Plan
Integrating Patents with Research Data
Vitamin e
กิจกรรมที่11.2
Learn BEM: CSS Naming Convention
10 Insightful Quotes On Designing A Better Customer Experience
How to Build a Dynamic Social Media Plan
Ad

Similar to Taming The Wild West Of Internet Based Chemistry You Can Help (20)

PPT
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
PPT
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
PPT
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
PPT
ChemSpider hosting linking and curating chemistry data for the community
PPT
Chemspider hosting linking and curating chemistry data for the community
PPT
Integrating and curating internet based chemistry resources to serve life sci...
PPT
ChemSpider – The Vision and Challenges Associated with Building a Free Online...
PPT
AZ of Chemspider February 2011
PPT
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
PPT
Using Text-Mining and Crowdsourced Curation to Build a Structure Centric Comm...
PPT
ChemSpider as a Platform for Crowd Participation in Curating Chemistry
PPT
Crowdsourcing, Collaborations And Text Mining In A World Of Open Chemistry
PDF
RSC ChemSpider for students – The Free Chemistry Database for the Community
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
ChemSpider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the community
Integrating and curating internet based chemistry resources to serve life sci...
ChemSpider – The Vision and Challenges Associated with Building a Free Online...
AZ of Chemspider February 2011
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
Using Text-Mining and Crowdsourced Curation to Build a Structure Centric Comm...
ChemSpider as a Platform for Crowd Participation in Curating Chemistry
Crowdsourcing, Collaborations And Text Mining In A World Of Open Chemistry
RSC ChemSpider for students – The Free Chemistry Database for the Community

Recently uploaded (20)

PPTX
Chapter 5: Probability Theory and Statistics
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
Tartificialntelligence_presentation.pptx
PDF
Mushroom cultivation and it's methods.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
1. Introduction to Computer Programming.pptx
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Heart disease approach using modified random forest and particle swarm optimi...
Chapter 5: Probability Theory and Statistics
OMC Textile Division Presentation 2021.pptx
Tartificialntelligence_presentation.pptx
Mushroom cultivation and it's methods.pdf
Approach and Philosophy of On baking technology
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
A novel scalable deep ensemble learning framework for big data classification...
Programs and apps: productivity, graphics, security and other tools
Digital-Transformation-Roadmap-for-Companies.pptx
A Presentation on Artificial Intelligence
Assigned Numbers - 2025 - Bluetooth® Document
Building Integrated photovoltaic BIPV_UPV.pdf
Hindi spoken digit analysis for native and non-native speakers
TLE Review Electricity (Electricity).pptx
Zenith AI: Advanced Artificial Intelligence
1. Introduction to Computer Programming.pptx
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
cloud_computing_Infrastucture_as_cloud_p
Heart disease approach using modified random forest and particle swarm optimi...

Taming The Wild West Of Internet Based Chemistry You Can Help