SlideShare a Scribd company logo
Web resources for the Carbohydrate Chemist   Roland Stenutz Stockholm University [email_address]
Content Technical considerations Resources by topic
Resources with static content GlycoWord – Encyclopedia www.glycoforum.gr.jp/science/word/wordE.html I UPAC-IUBMB Nomenclature http://www.chem.qmw.ac.uk/iupac/ Structural analysis – Laboratory manual www.casper.organ.su.se/sop/
Resources with static content Content is indexed by search engines, e.g. Google (www.google.com) and Altavista (www.altavista.com). Full text searches are best. Searches for compounds can be very difficult! Some tricks have to be used to get useful results. Compare nr of hits searching for “glucose” (3  10 6 ), “glucopyranose” (8  10 3 ) or  “50-99-7” (2  10 3 ).
Databases glycoSCIENCES.DE www.glycosciences.de CAZy – Enzymes afmb.cnrs-mrs.fr/CAZY/acc.html PDB – Crystal structures www.rcsb.org/pdb/
Databases Large collection (100-100000) of  related data. Searches can be complex, e.g. for (sub)structures. There are implied by the context; i.e. 5.15 might be a NMR chemical shift but not a price. There is very little garbage and redundancy in the databases. Can be difficult to find using search engines since they have little text that can be indexed.
Applications Most applications can be thought of as databases with an unlimited number of records. They require relatively complex interfaces since a request for data must contain all the information necessary to generate the data. They create content “on–the–fly”.
Interfaces - trivial
Interfaces
Interfaces - complex
Structured answers – easy to transfer ***** Hit 2 ***** CC: CCSD:3436 AU: Adeyeye A; Jansson PE; Lindberg B; Abaas S; Svenson SB TI: Structural studies of the Escherichia coli O-149 O-antigen polysaccharide CT: Carbohydr Res (1988) 176: 231-236 FC: 014fe513 AM: 1H-NMR BS: (GS) Escherichia coli, (GT) O149 SB: Jansson PE DA: 01-08-1990 MT: LPS AN: O-antigen SI: CBank:6914 ---------------- structure: Repeat-4)-b-D-GlcpNAc-(1-3)-b-D-GlcpNAc4,6Py-(1-3)-b-L-Rhap-(1- ================end of record
Content Technical considerations Resources by topic
Structure Complex Carbohydrate Structure Database (CarbBank) www.boc.chem.uu.nl/sugabase/databases.html also    glycosciences.de ECDB  –  E. coli  O-antigen structures and NMR www.casper.organ.su.se/ECDB/  GlycoBase of USTL  – oligosaccharides from amphibians  ustl.univ-lille1.fr/glycobase/
Conformation PDB – Protein Data Bank, “Brookhaven DB” Protein structures, incl. glycoproteins www.rcsb.org/pdb  GlycoMaps Database, SWEET-II etc… Conformational databases and applications for oligosaccharides www.glycosciences.de Disaccharide Database  Conformational maps for some disaccharides www.cermav.cnrs.fr/databank/disaccharides
Spectroscopy SugaBase  – NMR database, mainly  1 H, often incomplete. www.boc.chem.uu.nl/sugabase/databases.html  CASPER  – NMR from structure & structure from NMR. www.casper.organ.su.se/casper  GlycoFragments  – MS fragmentation from structure. www.glycosciences.de
Enzymes, Lectins and Glycoproteins CAZy  – Carbohydrate Active Enzymes afmb.cnrs-mrs.fr/CAZY/ 3D Lectin Database www.cermav.cnrs.fr/lectines/   BPGD  - Bacterial Polysaccharide Gene Database  www.microbio.usyd.edu.au/BPGD/default.htm
Using Internet resources The question must be chosen with care! Ask the same question in different ways. Ask different search engines/data bases the same question and compare the results. Always – verify the results!
Searching Even if you know exactly what information you want it can be very difficult to find it. Information is spread-out in different locations and the question may need paraphrasing. It is very difficult to get a complete answer – but often you get a hint about how to proceed. e.g. you might not find the data sought but a reference to a paper that contains the data…
Portals One interface – several DB:s glycoSCIENCES.DE Searchable by   structure/substructure bibliographic information NMR MS Contains CarbBank, Sugabase, + applications (3D-structure)
Future directions Consortium for Functional Glycomics Carbohydrate-protein interactions. Glycosylation disorders in knock-out mice. web.mit.edu/glycomics/consortium/ Japanese Consortium for Glycobiology & Glycotechnology Everything – and then some… www.jcgg.jp EuroCarbDB Structure (primary & 3D) & spectroscopy (NMR, MS) Russian initiative CarbBank/NMR (structure & NMR)
Future directions Cross-linking between resources  – makes it easy to find related information. Portals  – one interface to different resources. Better interfaces  – current interfaces are often too complex. XML  – allows data to be transferred directly to local applications.
Conclusion There is a wide range of carbohydrate related resources available on the WWW. Many provide useful information but all are rather limited in scope. There are problems transfering data between databases. The interfaces are difficult to use.  Manuals or instructions are often missing.

More Related Content

PDF
Unlocking chemical information from tables and legacy articles
PDF
CINF 4: Naming algorithms for derivatives of peptide-like natural products
PPTX
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
PPT
Mass spectrometry
PPT
компьютерный колледж
PDF
100505 koenig biological_databases
PDF
CINF 1: Generating Canonical Identifiers For (Glycoproteins And Other Chemica...
PDF
Substructure Search Face-off
Unlocking chemical information from tables and legacy articles
CINF 4: Naming algorithms for derivatives of peptide-like natural products
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
Mass spectrometry
компьютерный колледж
100505 koenig biological_databases
CINF 1: Generating Canonical Identifiers For (Glycoproteins And Other Chemica...
Substructure Search Face-off

Similar to WWW (Glibs workshop) (20)

PPT
Bioinformatic_Databases and Sequence Analysis
PPT
Biological Database Systems
PPT
Role of bioinformatics in life sciences research
PPT
2012 03 01_bioinformatics_ii_les1
PDF
BioSD Tutorial 2014 Editition
PDF
2014 11-13-sbsm032-reproducible research
PDF
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
PDF
2015 10-7-11am-reproducible research
PPT
Template Based Protein Structure Modeling
PDF
Software tools for calculating materials properties in high-throughput (pymat...
PPTX
Protein data bank
PPTX
Bioinformatics final
PDF
Biological Database (1)pptxpdfpdfpdf.pdf
PPTX
The PubChemQC Project
PPTX
Power point presentation for science research
PPTX
Imgc2011 bioinformatics tutorial
PPTX
Using Polycaprolactone for Tissue Regeneration
PPT
Bioinformatica 06-10-2011-t2-databases
PPTX
Thesis def
Bioinformatic_Databases and Sequence Analysis
Biological Database Systems
Role of bioinformatics in life sciences research
2012 03 01_bioinformatics_ii_les1
BioSD Tutorial 2014 Editition
2014 11-13-sbsm032-reproducible research
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
2015 10-7-11am-reproducible research
Template Based Protein Structure Modeling
Software tools for calculating materials properties in high-throughput (pymat...
Protein data bank
Bioinformatics final
Biological Database (1)pptxpdfpdfpdf.pdf
The PubChemQC Project
Power point presentation for science research
Imgc2011 bioinformatics tutorial
Using Polycaprolactone for Tissue Regeneration
Bioinformatica 06-10-2011-t2-databases
Thesis def
Ad

Recently uploaded (20)

PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Getting Started with Data Integration: FME Form 101
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
1. Introduction to Computer Programming.pptx
PDF
August Patch Tuesday
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Heart disease approach using modified random forest and particle swarm optimi...
Getting Started with Data Integration: FME Form 101
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
SOPHOS-XG Firewall Administrator PPT.pptx
cloud_computing_Infrastucture_as_cloud_p
Hindi spoken digit analysis for native and non-native speakers
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
A novel scalable deep ensemble learning framework for big data classification...
OMC Textile Division Presentation 2021.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Accuracy of neural networks in brain wave diagnosis of schizophrenia
1. Introduction to Computer Programming.pptx
August Patch Tuesday
Web App vs Mobile App What Should You Build First.pdf
A comparative study of natural language inference in Swahili using monolingua...
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Ad

WWW (Glibs workshop)

  • 1. Web resources for the Carbohydrate Chemist Roland Stenutz Stockholm University [email_address]
  • 2. Content Technical considerations Resources by topic
  • 3. Resources with static content GlycoWord – Encyclopedia www.glycoforum.gr.jp/science/word/wordE.html I UPAC-IUBMB Nomenclature http://www.chem.qmw.ac.uk/iupac/ Structural analysis – Laboratory manual www.casper.organ.su.se/sop/
  • 4. Resources with static content Content is indexed by search engines, e.g. Google (www.google.com) and Altavista (www.altavista.com). Full text searches are best. Searches for compounds can be very difficult! Some tricks have to be used to get useful results. Compare nr of hits searching for “glucose” (3  10 6 ), “glucopyranose” (8  10 3 ) or “50-99-7” (2  10 3 ).
  • 5. Databases glycoSCIENCES.DE www.glycosciences.de CAZy – Enzymes afmb.cnrs-mrs.fr/CAZY/acc.html PDB – Crystal structures www.rcsb.org/pdb/
  • 6. Databases Large collection (100-100000) of related data. Searches can be complex, e.g. for (sub)structures. There are implied by the context; i.e. 5.15 might be a NMR chemical shift but not a price. There is very little garbage and redundancy in the databases. Can be difficult to find using search engines since they have little text that can be indexed.
  • 7. Applications Most applications can be thought of as databases with an unlimited number of records. They require relatively complex interfaces since a request for data must contain all the information necessary to generate the data. They create content “on–the–fly”.
  • 11. Structured answers – easy to transfer ***** Hit 2 ***** CC: CCSD:3436 AU: Adeyeye A; Jansson PE; Lindberg B; Abaas S; Svenson SB TI: Structural studies of the Escherichia coli O-149 O-antigen polysaccharide CT: Carbohydr Res (1988) 176: 231-236 FC: 014fe513 AM: 1H-NMR BS: (GS) Escherichia coli, (GT) O149 SB: Jansson PE DA: 01-08-1990 MT: LPS AN: O-antigen SI: CBank:6914 ---------------- structure: Repeat-4)-b-D-GlcpNAc-(1-3)-b-D-GlcpNAc4,6Py-(1-3)-b-L-Rhap-(1- ================end of record
  • 12. Content Technical considerations Resources by topic
  • 13. Structure Complex Carbohydrate Structure Database (CarbBank) www.boc.chem.uu.nl/sugabase/databases.html also  glycosciences.de ECDB – E. coli O-antigen structures and NMR www.casper.organ.su.se/ECDB/ GlycoBase of USTL – oligosaccharides from amphibians ustl.univ-lille1.fr/glycobase/
  • 14. Conformation PDB – Protein Data Bank, “Brookhaven DB” Protein structures, incl. glycoproteins www.rcsb.org/pdb GlycoMaps Database, SWEET-II etc… Conformational databases and applications for oligosaccharides www.glycosciences.de Disaccharide Database Conformational maps for some disaccharides www.cermav.cnrs.fr/databank/disaccharides
  • 15. Spectroscopy SugaBase – NMR database, mainly 1 H, often incomplete. www.boc.chem.uu.nl/sugabase/databases.html CASPER – NMR from structure & structure from NMR. www.casper.organ.su.se/casper GlycoFragments – MS fragmentation from structure. www.glycosciences.de
  • 16. Enzymes, Lectins and Glycoproteins CAZy – Carbohydrate Active Enzymes afmb.cnrs-mrs.fr/CAZY/ 3D Lectin Database www.cermav.cnrs.fr/lectines/ BPGD - Bacterial Polysaccharide Gene Database www.microbio.usyd.edu.au/BPGD/default.htm
  • 17. Using Internet resources The question must be chosen with care! Ask the same question in different ways. Ask different search engines/data bases the same question and compare the results. Always – verify the results!
  • 18. Searching Even if you know exactly what information you want it can be very difficult to find it. Information is spread-out in different locations and the question may need paraphrasing. It is very difficult to get a complete answer – but often you get a hint about how to proceed. e.g. you might not find the data sought but a reference to a paper that contains the data…
  • 19. Portals One interface – several DB:s glycoSCIENCES.DE Searchable by structure/substructure bibliographic information NMR MS Contains CarbBank, Sugabase, + applications (3D-structure)
  • 20. Future directions Consortium for Functional Glycomics Carbohydrate-protein interactions. Glycosylation disorders in knock-out mice. web.mit.edu/glycomics/consortium/ Japanese Consortium for Glycobiology & Glycotechnology Everything – and then some… www.jcgg.jp EuroCarbDB Structure (primary & 3D) & spectroscopy (NMR, MS) Russian initiative CarbBank/NMR (structure & NMR)
  • 21. Future directions Cross-linking between resources – makes it easy to find related information. Portals – one interface to different resources. Better interfaces – current interfaces are often too complex. XML – allows data to be transferred directly to local applications.
  • 22. Conclusion There is a wide range of carbohydrate related resources available on the WWW. Many provide useful information but all are rather limited in scope. There are problems transfering data between databases. The interfaces are difficult to use. Manuals or instructions are often missing.