Crowd-Sourcing to Build a Structure Centric Community for Chemists  Antony Williams Whitney Symposium 2008 - Networks
Social Networking for Chemists
Network Drug Discovery Tools www.curehunter.com
Beware the Networks!
Collaborative Authoring in Academia Group level collaboration via Wikis
Collaborative Authoring for Drug Discovery Pfizerpedia
Collaborative Knowledge Management  for Chemists – Wikipedia, Built by a Network
and biologists…WikiProteins
WikiProteins What  Is Tegafur?
Commonly Lacking… Approaches generally lack “structural intelligence” Structures have properties (Mw, MF, exp. & pred. properties) Collections of structures need to be searchable by structure Most data collections are “self-contained” and rarely connecting to other resources via “structure”
A Search Engine for Chemists Questions a chemist might ask… What is the melting point of n-butanol?  What is the chemical structure of Xanax? Chemically, what is viagra? What are the stereocenters of cholesterol? Where can I find publications about Taxol? What are the different trade names for Ketoconazole? What is the NMR spectrum of Aspirin? What are the safety handling issues for Thymol Blue? ChemSpider can answer all of these questions
ChemSpider Data Content Over 20 million unique chemical structures : Online Databases –PubChem, Drugbank, HMDB, Wikipedia Chemical Vendors – over 40 different vendors and growing Personal Depositions – individual contributions Journal Publishers  Content database vendors Analytical data collections Patents (9 MILLION Structures to search patents ) Web scraping Content is linked back to the original data sources
A Structure Centric Community for Chemists A FREE ACCESS platform for deposition, management, curation, annotation and extension of information associated with chemical structures Semantically connect to other sites providing access to knowledge, data and information of determined quality Search by alphanumeric text, chemical structure and substructure and combination searches Predict properties for submitted structures
Tell me about Aspirin
Tell me about Aspirin
Links out to KEGG Kyoto Encyclopedia of Genes and Genomes
Tell me about Aspirin
Tell me About Aspirin
Tell me about Aspirin
Tell me about Aspirin
Abstract Compounds? Is there any information about “Quesnoin”? Type in the name (and there may be many) or other identifier Paste a chemical structure Draw the structure
Example Search
Example Search
Example Search 2 What compounds have a mass of 300+/-0.001? or search a combination of intrinsic/predicted properties
Example Search 2
Complex Search
Search Open Access Journals – ChemSpider
Search PubMed – ChemSpider
The Quality of Data Online… Aggregating data opens up quality issues Structure-identifier associations are “dirty” Structures are COMMONLY incorrect – stereochem issues Manual curation of small databases is enough work – what about millions of structures? Structures are far from perfect. What is a “correct structure”? Full stereochemistry?  Historical timeline of structure? Who is the authority?
Who holds THE Quality Authority? Chemical Abstracts Service is the structural authority today. 1400 (?) employees, world standard in chemistry information 101 years of knowledge, process and expertise. MANUAL curation is key. Robotic curation is enabling How can an online, free access system peacefully co-exist with the authority?
Quality is a Major Issue- Search Butanol
Crowd-sourcing Database Compilation
Wikipedia – Crowdsourcing Chemistry
Wikipedia Chemistry Curation project Only ca. 5000 organic structures, 7000 total structures MONTHS of work so far for a team of 6 people Many errors removed in the process. Curation process is a daily event for users/depositors Slow and torturous process for stereo molecules.
Thymol Blue on ChemSpider Data online includes: UV-vis spectrum Measured experimental properties Link to Wikipedia article Links to chromatography details Multiple identifiers/trade names etc. Links to vendors/suppliers/other databases Safety information
Differences between ChemSpider/Wikipedia No Analytical Data Active editors – about 50 (?) Active depositors/curators – 30  No Prediction of properties ???? 5000 people/day; 1100 registered Detailed compound monographs Compound monographs linked Text Complex queries – Properties, Text, structure/substructure, OA publishers, Data  Sources, … ~5000 organics, 2000 others >20 million unique structures Wikipedia ChemSpider
Differences between Wikipedia/ChemSpider Growing reputation as focused on quality Worldwide reputation as quality source Chemistry is the focus of ‘Spider Chemistry is a subset of the ‘Pedia Mixed “licensing” GFL licensing for everything Growing team of WP:Chem advocates, curators and admins Strong team of WP:Chem advocates, curators and admins “ Out of a basement” on three servers and 5 volunteers Established infrastructure and Wikipedia Foundation Team Primarily  Microsoft .NET technologies with OS components  Supported by tried and tested Media-Wiki platform. ChemSpider Wikipedia
Crowd-sourcing Curation How to curate data for millions of structures?  Robot processes can clean up depositions Search for Chloride and check molecular formula for Cl Check for stereochemistry and remove names with stereo  Provide a simple-to-use platform to curate, annotate and tag data  Provide curator administration to prevent vandalism (Veropedia)
Multi-level Curation and Approval
Post Comments Anyone can “Post Comments” associated with a structure. To curate data we require login to track
Crowd-sourcing Chemistry Crowd-sourced curation: identify and tag errors, edit names, synonyms, identify records for deprecation ALSO Crowd-sourced deposition: anyone can deposit data (structures, text, images, analytical data)
But, when registered and logged in… Ability to curate and add to the database Add structures “ Clean” structures Add data (spectra, CIFs, images)  Add links to other pages (URLs) Add publication details
Adding to the Database - Structure
Adding New Text Data Add Publication Add Identifier Add URL
Adding Supplementary Info to a Structure
Can ChemSpider Enable Discovery? Yes, chemists can search by text, structure, substructure or properties to look at relationships and probe drug discovery
ChemSpider – Research in Progress Supporting Open Notebook Science as a repository – JC Bradley at Drexel University For the purpose of online virtual screening  Applying descriptors of various types to filter a database of 20 million compounds In progress: Utilizing SimBioSys’ LASSO Descriptor Collaboration based on NISS’ ChemModLab
LASSO  Ligand Activity by Surface Similarity Order
LASSO Descriptors on ChemSpider SEMANTIC WEB in action
LASSO Searching Method 1 Ask the question “What are the top 1000 molecules with similar LASSO descriptors to the actives for the Estrogen Receptor”
It WORKS - Enrichment Plot 60% of the actives were recovered in the top 1% of the database. “ Environmental binders” are weak binders  The top ranked compounds may well be active ER binders Likely candidates for experimental investigation
Tipping Point Tipping point  - the  point  at which a slow gradual change becomes irreversible and then proceeds with gathering pace
ChemSpider Forums/Blogs Forum.chemspider.com www.chemspider.com/blog
ChemSpider TouchGraph
What would we most like to do? Enable “Collaborative Science”. What would that look like? Access to chemical supplies when people need them Awareness of available literature, patents, databases of curated content – whether Open Access or not. Transaction fees (or not) are between user and provider Host Open Notebook Science exchanges
“ ChemSpider Inside” Instrument vendors integrated ChemSpider to their metabolism ID project – ChemSpider linked to all Mass Spec Intruments doing Metabolite ID? Wikipedia roundtrip linking to ChemSpider Google indexing ChemSpider at “fixed rate” Integration to desktop drawing packages Members of Microsoft BioIT Alliance Discussions on Taverna’s Workflow Sourceforge group Hosting Open Access articles shortly…
Where to from here?  Short term Integrated text and structure/substructure searching of the Open Access literature is in development Web-based scraping of structure-based information – examples in place Enhanced web services layer to integrate searches Deposit updated Patent Database (9 million structures) Reaction handling and deposition
Where to from here?  Mid-term Spidering for Chemistry – extract data from articles, webpages and data sources AND stay within copyright WiChempedia project – wiki-layers on top of ChemSpider, alongside Wikipedia curation project.  Deeper integration to text-based searching and conversion of chemical names to structures for online structure searching:  Improved integration with NCBI Entrez system Deliver “dedicated websites” for specific publishers
Where to from here?  Mid-Term An extensible datamodel “on the fly” allows us to easily expand to integrate abstract data to structures  Data mine and curate “parameters” – physicochemical and physiological parameters to enable QSAR analysis, data modeling and provision of models online (UNC-Chapel Hill, NISS)
Our Challenges There are “no employees” ChemSpider is non-funded System is hyper-dependent on ISP, power and limited compute power We are upsetting a lot of people – evangelists, cheminformatics system vendors, publishers, data content providers
Acknowledgments  The ChemSpider team of volunteer developers ChemSpider Advisory Group Our curators, depositors and users Suppliers of commercial software – Microsoft, ACD/Labs, OpenEye, ChemAxon, SimBioSys SureChem – Structure Based Online Patent Searching
Further reading www.chemspider.com/blog Internet-based tools for communication and collaboration in chemistry, Drug Discovery Today, Volume 13, Numbers 11/12, June 2008 502-506,  doi:10.1016/j.drudis.2008.03.015 A perspective of publicly accessible/open-access chemistry databases, Drug Discovery Today, Volume 13, Numbers 11/12, June 2008, 495-501,  doi:10.1016/j.drudis.2008.03.017

More Related Content

PPT
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
PPT
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
PPT
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
PPT
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
PPT
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
PPT
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
PPT
Integrating and curating internet based chemistry resources to serve life sci...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
Integrating and curating internet based chemistry resources to serve life sci...

What's hot (20)

PPT
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
PPT
PPT
Hosting a compound centric community resource for chemistry data
PPT
PPT
RSC ChemSpider is the online chemistry database where community contributions...
PPTX
ACS 248th Paper 71 ChAMP Project
PDF
Acs collaborative computational technologies for biomedical research an enabl...
PDF
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
PPT
PPT
ChemSpider – The Vision and Challenges Associated with Building a Free Online...
PPT
PPT
eScience Resources for the Chemistry Community from the Royal Society of Chem...
PPT
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
PPT
The Great Promise of Online Data for Chemistry and the Life Sciences
PPT
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
PPT
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
PPT
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
PPT
Structure verification and elucidation using the ChemSpider database
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
Hosting a compound centric community resource for chemistry data
RSC ChemSpider is the online chemistry database where community contributions...
ACS 248th Paper 71 ChAMP Project
Acs collaborative computational technologies for biomedical research an enabl...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
ChemSpider – The Vision and Challenges Associated with Building a Free Online...
eScience Resources for the Chemistry Community from the Royal Society of Chem...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
The Great Promise of Online Data for Chemistry and the Life Sciences
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Structure verification and elucidation using the ChemSpider database
Ad

Viewers also liked (7)

PPT
Saber Y Hacer
PPT
Poteo
PPT
Inspire - Jack Heath
PPT
Curso De Supply Chain Management
PPS
Leuke Foto S
Saber Y Hacer
Poteo
Inspire - Jack Heath
Curso De Supply Chain Management
Leuke Foto S
Ad

Similar to Whitney Symposium Lecture June 2008 (18)

PPT
Chemspider hosting linking and curating chemistry data for the community
PPT
ChemSpider hosting linking and curating chemistry data for the community
PPT
Taming The Wild West Of Internet Based Chemistry You Can Help
PPT
How the web has weaved a web of interlinked chemistry data final
PPT
PDF
A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...
PPT
A Presentation At Nature Publishing Group Crowdsourcing, Collaborations And T...
PPT
AZ of Chemspider February 2011
PPT
PPT
Whitney Symposium Lecturejune 2008 1220331644496491 9
PPT
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
PPT
Crowdsourcing, Collaborations And Text Mining In A World Of Open Chemistry
PPT
How Community Crowdsourcing and Social Networking is Helping to Build a Quali...
PPT
ChemSpider Overview Presentation at Special Libraries Association
Chemspider hosting linking and curating chemistry data for the community
ChemSpider hosting linking and curating chemistry data for the community
Taming The Wild West Of Internet Based Chemistry You Can Help
How the web has weaved a web of interlinked chemistry data final
A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...
A Presentation At Nature Publishing Group Crowdsourcing, Collaborations And T...
AZ of Chemspider February 2011
Whitney Symposium Lecturejune 2008 1220331644496491 9
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
Crowdsourcing, Collaborations And Text Mining In A World Of Open Chemistry
How Community Crowdsourcing and Social Networking is Helping to Build a Quali...
ChemSpider Overview Presentation at Special Libraries Association

Recently uploaded (20)

PDF
HVAC Specification 2024 according to central public works department
PDF
advance database management system book.pdf
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PPTX
TNA_Presentation-1-Final(SAVE)) (1).pptx
PPTX
Virtual and Augmented Reality in Current Scenario
PDF
Hazard Identification & Risk Assessment .pdf
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PDF
Trump Administration's workforce development strategy
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
IGGE1 Understanding the Self1234567891011
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PPTX
20th Century Theater, Methods, History.pptx
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
HVAC Specification 2024 according to central public works department
advance database management system book.pdf
Unit 4 Computer Architecture Multicore Processor.pptx
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
What if we spent less time fighting change, and more time building what’s rig...
Practical Manual AGRO-233 Principles and Practices of Natural Farming
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
TNA_Presentation-1-Final(SAVE)) (1).pptx
Virtual and Augmented Reality in Current Scenario
Hazard Identification & Risk Assessment .pdf
Share_Module_2_Power_conflict_and_negotiation.pptx
Trump Administration's workforce development strategy
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
IGGE1 Understanding the Self1234567891011
LDMMIA Reiki Yoga Finals Review Spring Summer
20th Century Theater, Methods, History.pptx
A powerpoint presentation on the Revised K-10 Science Shaping Paper

Whitney Symposium Lecture June 2008

  • 1. Crowd-Sourcing to Build a Structure Centric Community for Chemists Antony Williams Whitney Symposium 2008 - Networks
  • 3. Network Drug Discovery Tools www.curehunter.com
  • 5. Collaborative Authoring in Academia Group level collaboration via Wikis
  • 6. Collaborative Authoring for Drug Discovery Pfizerpedia
  • 7. Collaborative Knowledge Management for Chemists – Wikipedia, Built by a Network
  • 9. WikiProteins What Is Tegafur?
  • 10. Commonly Lacking… Approaches generally lack “structural intelligence” Structures have properties (Mw, MF, exp. & pred. properties) Collections of structures need to be searchable by structure Most data collections are “self-contained” and rarely connecting to other resources via “structure”
  • 11. A Search Engine for Chemists Questions a chemist might ask… What is the melting point of n-butanol? What is the chemical structure of Xanax? Chemically, what is viagra? What are the stereocenters of cholesterol? Where can I find publications about Taxol? What are the different trade names for Ketoconazole? What is the NMR spectrum of Aspirin? What are the safety handling issues for Thymol Blue? ChemSpider can answer all of these questions
  • 12. ChemSpider Data Content Over 20 million unique chemical structures : Online Databases –PubChem, Drugbank, HMDB, Wikipedia Chemical Vendors – over 40 different vendors and growing Personal Depositions – individual contributions Journal Publishers Content database vendors Analytical data collections Patents (9 MILLION Structures to search patents ) Web scraping Content is linked back to the original data sources
  • 13. A Structure Centric Community for Chemists A FREE ACCESS platform for deposition, management, curation, annotation and extension of information associated with chemical structures Semantically connect to other sites providing access to knowledge, data and information of determined quality Search by alphanumeric text, chemical structure and substructure and combination searches Predict properties for submitted structures
  • 14. Tell me about Aspirin
  • 15. Tell me about Aspirin
  • 16. Links out to KEGG Kyoto Encyclopedia of Genes and Genomes
  • 17. Tell me about Aspirin
  • 18. Tell me About Aspirin
  • 19. Tell me about Aspirin
  • 20. Tell me about Aspirin
  • 21. Abstract Compounds? Is there any information about “Quesnoin”? Type in the name (and there may be many) or other identifier Paste a chemical structure Draw the structure
  • 24. Example Search 2 What compounds have a mass of 300+/-0.001? or search a combination of intrinsic/predicted properties
  • 27. Search Open Access Journals – ChemSpider
  • 28. Search PubMed – ChemSpider
  • 29. The Quality of Data Online… Aggregating data opens up quality issues Structure-identifier associations are “dirty” Structures are COMMONLY incorrect – stereochem issues Manual curation of small databases is enough work – what about millions of structures? Structures are far from perfect. What is a “correct structure”? Full stereochemistry? Historical timeline of structure? Who is the authority?
  • 30. Who holds THE Quality Authority? Chemical Abstracts Service is the structural authority today. 1400 (?) employees, world standard in chemistry information 101 years of knowledge, process and expertise. MANUAL curation is key. Robotic curation is enabling How can an online, free access system peacefully co-exist with the authority?
  • 31. Quality is a Major Issue- Search Butanol
  • 34. Wikipedia Chemistry Curation project Only ca. 5000 organic structures, 7000 total structures MONTHS of work so far for a team of 6 people Many errors removed in the process. Curation process is a daily event for users/depositors Slow and torturous process for stereo molecules.
  • 35. Thymol Blue on ChemSpider Data online includes: UV-vis spectrum Measured experimental properties Link to Wikipedia article Links to chromatography details Multiple identifiers/trade names etc. Links to vendors/suppliers/other databases Safety information
  • 36. Differences between ChemSpider/Wikipedia No Analytical Data Active editors – about 50 (?) Active depositors/curators – 30 No Prediction of properties ???? 5000 people/day; 1100 registered Detailed compound monographs Compound monographs linked Text Complex queries – Properties, Text, structure/substructure, OA publishers, Data Sources, … ~5000 organics, 2000 others >20 million unique structures Wikipedia ChemSpider
  • 37. Differences between Wikipedia/ChemSpider Growing reputation as focused on quality Worldwide reputation as quality source Chemistry is the focus of ‘Spider Chemistry is a subset of the ‘Pedia Mixed “licensing” GFL licensing for everything Growing team of WP:Chem advocates, curators and admins Strong team of WP:Chem advocates, curators and admins “ Out of a basement” on three servers and 5 volunteers Established infrastructure and Wikipedia Foundation Team Primarily Microsoft .NET technologies with OS components Supported by tried and tested Media-Wiki platform. ChemSpider Wikipedia
  • 38. Crowd-sourcing Curation How to curate data for millions of structures? Robot processes can clean up depositions Search for Chloride and check molecular formula for Cl Check for stereochemistry and remove names with stereo Provide a simple-to-use platform to curate, annotate and tag data Provide curator administration to prevent vandalism (Veropedia)
  • 40. Post Comments Anyone can “Post Comments” associated with a structure. To curate data we require login to track
  • 41. Crowd-sourcing Chemistry Crowd-sourced curation: identify and tag errors, edit names, synonyms, identify records for deprecation ALSO Crowd-sourced deposition: anyone can deposit data (structures, text, images, analytical data)
  • 42. But, when registered and logged in… Ability to curate and add to the database Add structures “ Clean” structures Add data (spectra, CIFs, images) Add links to other pages (URLs) Add publication details
  • 43. Adding to the Database - Structure
  • 44. Adding New Text Data Add Publication Add Identifier Add URL
  • 45. Adding Supplementary Info to a Structure
  • 46. Can ChemSpider Enable Discovery? Yes, chemists can search by text, structure, substructure or properties to look at relationships and probe drug discovery
  • 47. ChemSpider – Research in Progress Supporting Open Notebook Science as a repository – JC Bradley at Drexel University For the purpose of online virtual screening Applying descriptors of various types to filter a database of 20 million compounds In progress: Utilizing SimBioSys’ LASSO Descriptor Collaboration based on NISS’ ChemModLab
  • 48. LASSO Ligand Activity by Surface Similarity Order
  • 49. LASSO Descriptors on ChemSpider SEMANTIC WEB in action
  • 50. LASSO Searching Method 1 Ask the question “What are the top 1000 molecules with similar LASSO descriptors to the actives for the Estrogen Receptor”
  • 51. It WORKS - Enrichment Plot 60% of the actives were recovered in the top 1% of the database. “ Environmental binders” are weak binders The top ranked compounds may well be active ER binders Likely candidates for experimental investigation
  • 52. Tipping Point Tipping point - the point at which a slow gradual change becomes irreversible and then proceeds with gathering pace
  • 55. What would we most like to do? Enable “Collaborative Science”. What would that look like? Access to chemical supplies when people need them Awareness of available literature, patents, databases of curated content – whether Open Access or not. Transaction fees (or not) are between user and provider Host Open Notebook Science exchanges
  • 56. “ ChemSpider Inside” Instrument vendors integrated ChemSpider to their metabolism ID project – ChemSpider linked to all Mass Spec Intruments doing Metabolite ID? Wikipedia roundtrip linking to ChemSpider Google indexing ChemSpider at “fixed rate” Integration to desktop drawing packages Members of Microsoft BioIT Alliance Discussions on Taverna’s Workflow Sourceforge group Hosting Open Access articles shortly…
  • 57. Where to from here? Short term Integrated text and structure/substructure searching of the Open Access literature is in development Web-based scraping of structure-based information – examples in place Enhanced web services layer to integrate searches Deposit updated Patent Database (9 million structures) Reaction handling and deposition
  • 58. Where to from here? Mid-term Spidering for Chemistry – extract data from articles, webpages and data sources AND stay within copyright WiChempedia project – wiki-layers on top of ChemSpider, alongside Wikipedia curation project. Deeper integration to text-based searching and conversion of chemical names to structures for online structure searching: Improved integration with NCBI Entrez system Deliver “dedicated websites” for specific publishers
  • 59. Where to from here? Mid-Term An extensible datamodel “on the fly” allows us to easily expand to integrate abstract data to structures Data mine and curate “parameters” – physicochemical and physiological parameters to enable QSAR analysis, data modeling and provision of models online (UNC-Chapel Hill, NISS)
  • 60. Our Challenges There are “no employees” ChemSpider is non-funded System is hyper-dependent on ISP, power and limited compute power We are upsetting a lot of people – evangelists, cheminformatics system vendors, publishers, data content providers
  • 61. Acknowledgments The ChemSpider team of volunteer developers ChemSpider Advisory Group Our curators, depositors and users Suppliers of commercial software – Microsoft, ACD/Labs, OpenEye, ChemAxon, SimBioSys SureChem – Structure Based Online Patent Searching
  • 62. Further reading www.chemspider.com/blog Internet-based tools for communication and collaboration in chemistry, Drug Discovery Today, Volume 13, Numbers 11/12, June 2008 502-506, doi:10.1016/j.drudis.2008.03.015 A perspective of publicly accessible/open-access chemistry databases, Drug Discovery Today, Volume 13, Numbers 11/12, June 2008, 495-501, doi:10.1016/j.drudis.2008.03.017