SlideShare a Scribd company logo
Intro       GC      Protein detection          Nitrilases   NHases   PKS   THM




        Computational analysis of metagenomic data: delineation of
         compositional features and screens for desirable enzymes


                                   Konrad U. F¨rstner
                                              o

                                 Bork Group, EMBL
                                Promotionskolloquium



                                        04. Februar 2009
Intro          GC          Protein detection   Nitrilases   NHases   PKS   THM

 Table of content


        1   Introduction

        2   GC content analysis

        3   Protein detection workflow

        4   Nitrilases

        5   Nitril hydratases

        6   Polyketide synthases I

        7   Take home messages
Intro          GC          Protein detection   Nitrilases   NHases   PKS   THM

 Table of content


        1   Introduction

        2   GC content analysis

        3   Protein detection workflow

        4   Nitrilases

        5   Nitril hydratases

        6   Polyketide synthases I

        7   Take home messages
Intro         GC       Protein detection   Nitrilases   NHases        PKS       THM




        For the microbial ecologist, what can be cultured is the basis of his
        conception of what exists. This is exactly like learning about
        animals from visiting zoos.

                                                                 Carl Woese
Intro   GC   Protein detection   Nitrilases   NHases   PKS   THM
Intro   GC   Protein detection   Nitrilases   NHases   PKS   THM
Intro   GC   Protein detection   Nitrilases   NHases   PKS   THM
Intro   GC   Protein detection   Nitrilases   NHases   PKS   THM
Intro        GC       Protein detection   Nitrilases   NHases   PKS   THM




        Great plate count anomaly
        Less than 1% of the microbes can be cultured under standard
        conditions.
Intro   GC   Protein detection   Nitrilases   NHases   PKS   THM




                          Metagenomics
                                =
                 culture independent approaches
Intro   GC   Protein detection   Nitrilases   NHases   PKS   THM

 Workflow of metagenomics sequencing
Intro   GC    Protein detection   Nitrilases   NHases   PKS   THM

 Selected metagenomic data sets
Intro   GC       Protein detection   Nitrilases   NHases   PKS   THM

 Challenges




        Usually a low coverage
        Dominant species
        Short sequences
        Data size
         ⇒ storage/memory/CPU intensive
         ⇒ software not developed for that
        No standard protocols
         ⇒ hard to compare
Intro          GC          Protein detection   Nitrilases   NHases   PKS   THM

 Table of content


        1   Introduction

        2   GC content analysis

        3   Protein detection workflow

        4   Nitrilases

        5   Nitril hydratases

        6   Polyketide synthases I

        7   Take home messages
Intro    GC        Protein detection   Nitrilases   NHases   PKS   THM

 GC analysis




        GC content = percentage of
        Guanine-Cytosine bp in the
        DNA/RNA
        influences a.o.
              Melting temperature of DNA/RNA
              Codon usage
Intro   GC    Protein detection             Nitrilases     NHases   PKS   THM

 GC analysis - huge difference between soil and ocean water




                                  Foerstner et al., 2005
Intro   GC    Protein detection          Nitrilases   NHases   PKS   THM

 GC analysis - further data confirms statement




                                  Raes et al., 2007
Intro   GC        Protein detection   Nitrilases   NHases   PKS   THM

 GC analysis - possible influencing factors




        Nitrogen availability
        Genome size
        Ultraviolet light exposure and
        repair mechanism
        Codon usage of pioneers
Intro          GC          Protein detection   Nitrilases   NHases   PKS   THM

 Table of content


        1   Introduction

        2   GC content analysis

        3   Protein detection workflow

        4   Nitrilases

        5   Nitril hydratases

        6   Polyketide synthases I

        7   Take home messages
Intro   GC        Protein detection   Nitrilases   NHases   PKS   THM

 Metagenomics data sets as resources of biotech enzymes




        Many microbial enzymes are
        essential tools in e.g. the chemical,
        pharma and food industries
        Searching in metagenomic data
        sets might reveal new potent
        members of known enzymes classes
Intro   GC     Protein detection   Nitrilases   NHases   PKS   THM

 Protein detection and classification workflow
Intro    GC         Protein detection    Nitrilases    NHases    PKS   THM

 Nitrilases




        Nitrile + water          carboxylic acids + ammonia
        One protein
        Application in the chemical industry
              Stereo- and regio-specific conversion of nitriles
Intro   GC    Protein detection          Nitrilases   NHases   PKS   THM

 Nitrilases - new members and subfamilies found




                                  Raes et al., 2007
Intro          GC          Protein detection   Nitrilases   NHases   PKS   THM

 Table of content


        1   Introduction

        2   GC content analysis

        3   Protein detection workflow

        4   Nitrilases

        5   Nitril hydratases

        6   Polyketide synthases I

        7   Take home messages
Intro    GC        Protein detection    Nitrilases   NHases   PKS   THM

 NHases




        Nitril hydratases (NHases)
        Nitrile + water         amide
        Two domains
        Application in the chemical industry
              Acrylamide >30,000 tons/year
              Nicotinamide >3500 tons/year
        Waste water treatment
Intro   GC    Protein detection             Nitrilases     NHases   PKS   THM

 NHases - tree of the α domain




                                  Foerstner et al., 2008
Intro   GC    Protein detection   Nitrilases   NHases   PKS   THM

 NHases - Monosiga brevicollis’ taxomony
Intro   GC     Protein detection             Nitrilases     NHases   PKS   THM

 NHases - in Monosiga brevicollis




                                   Foerstner et al., 2008
Intro          GC          Protein detection   Nitrilases   NHases   PKS   THM

 Table of content


        1   Introduction

        2   GC content analysis

        3   Protein detection workflow

        4   Nitrilases

        5   Nitril hydratases

        6   Polyketide synthases I

        7   Take home messages
Intro     GC       Protein detection   Nitrilases   NHases      PKS   THM

 PKS I




         Polyketide synthases (PKS) create a heterogeneous group of
         secondary metabolites
         The synthesis is similar to the fatty acid synthesis
         Multiple domains
         We focused on polyketide synthases type I (PKS I)
Intro       GC        Protein detection              Nitrilases       NHases   PKS   THM

 PKS I - polyketide synthesis steps




        This picture of this slide is removed due to copyright restriction.
                                          Jenke-Kodama et al., 2005
Intro       GC         Protein detection       Nitrilases   NHases       PKS       THM

 PKS I - examples of polyketides




        Erythromycin                       Oleandomycin              Aflatoxin B1
Intro   GC    Protein detection             Nitrilases     NHases   PKS   THM

 PKS I - tree of the AT domain HMM hits




                                  Foerstner et al., 2008
Intro   GC     Protein detection             Nitrilases     NHases   PKS   THM

 PKS I - tree overview




                                   Foerstner et al., 2008
Intro   GC      Protein detection             Nitrilases     NHases   PKS   THM

 PKS I - hit distribution




                                    Foerstner et al., 2008
Intro   GC   Protein detection             Nitrilases     NHases   PKS   THM

 PKS I - PKSs per genome




                                 Foerstner et al., 2008
Intro          GC          Protein detection   Nitrilases   NHases   PKS   THM

 Table of content


        1   Introduction

        2   GC content analysis

        3   Protein detection workflow

        4   Nitrilases

        5   Nitril hydratases

        6   Polyketide synthases I

        7   Take home messages
Intro        GC        Protein detection   Nitrilases   NHases   PKS       THM

 Take home messages




        Metagenomics ...
            ... might help us to explore the complete microbial world
            ... still has many technical challenges
            ... can reveal the environmetal influence on genomic features
            ... can help discover new enzymes
Intro   GC          Protein detection   Nitrilases   NHases   PKS   THM

 Acknowledgements



        Peer Bork
        Thomas Dandekar
        Lars Steinmetz
        Toby Gibson
        The whole Bork group esp. Jeroen
        Raes and Takuji Yamada
        Christian von Mering
        Melly
        My friends and family
Intro    GC              Protein detection            Nitrilases            NHases              PKS            THM

 Image sources/attribution - part 1/2

        Orangutan Houston Zoo http://flickr.com/photos/billtex48/2178056762/ by (Bill and Mavis) -
        B&M
        Opel Zoo 07.07.2007 http://flickr.com/photos/lamberty/754218458 by frijolito75
        Giraffe http://flickr.com/photos/abelle/280246250/ by A.Bell
        Snuggling http://flickr.com/photos/buckwoo/2421562192/ by Ken W!
        Delicious Dead Bee and Hungry Ants http://flickr.com/photos/hamed/176176998/ by Hamed Saber
        hundreds of fish swarm a soft coral head http://flickr.com/photos/g-na/370131126/ by g-na
        hunt is on http://flickr.com/photos/doug88888/2930690305/ by doug88888
        Long-billed Curlew http://flickr.com/photos/mikebaird/3011987508/ by mikebaird
        145ps 01087.jpg http://flickr.com/photos/ricephotos/2679758872/ by IRRI Images
        Polymicrobic biofilm epifluorescence
        http://commons.wikimedia.org/wiki/File:Polymicrobic_biofilm_epifluorescence.jpg
        The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific
        Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, et al. PLoS Biology Vol. 5, No. 3, e77
        doi:10.1371/journal.pbio.0050077
        green farm http://flickr.com/photos/nakae/204037619/ by nakae
        Acid Mine Drainage http://flickr.com/photos/savethewildup/400614071/ by savethewildup
        blue ocean http://flickr.com/photos/coolskipper/27242821/ by coolskipper
        Digestive system http://commons.wikimedia.org/wiki/File:Digestive_system_whitout_labels.svg
        by Mariana Ruiz Villarreal
        Pg166 bioreactor http://commons.wikimedia.org/wiki/File:Pg166_bioreactor.jpg
Intro    GC            Protein detection           Nitrilases         NHases           PKS            THM

 Image sources/attribution - part 2/2



        Big Drop-Off [...] http://flickr.com/photos/ctsnow/113339176/ by ctsnow
        Sphaeroeca-colony http://commons.wikimedia.org/wiki/File:Sphaeroeca-colony.jpg by Dhzanette
        Ocean view http://flickr.com/photos/provoost/399669002/ by Sjors Provoost
        The hurdles http://flickr.com/photos/29621494N02/3060466344/ by paula fisher
        Erythromycin http://de.wikipedia.org/w/index.php?title=Datei:Erythrommycin_A_B_C.svg by
        Yikrazuul
        Aflatoxin B1 http://de.wikipedia.org/w/index.php?title=Datei:
        Aflatoxin_B1.svg&filetimestamp=20070113042046 by Bryan Derksen
        Oleandomycin http://en.wikipedia.org/wiki/File:Oleandomycin.png by Edgar181
        Tool rack http://en.wikipedia.org/wiki/File:Oleandomycin.png by L. Marie
        Collaboration http://flickr.com/photos/fncll/145149313/ ChrisL AK
        Base pair AT http://commons.wikimedia.org/wiki/File:Base_pair_AT.svg
        Base pair GC http://commons.wikimedia.org/wiki/File:Base_pair_GC.svg
Intro   GC   Protein detection            Nitrilases           NHases       PKS   THM

 About this document



                        A
             Created in L TEX using the beamer class, TeX Live and Emacs.

                         All these programs run on OpenBSD.

                          http://www.latex-project.org
                       http://latex-beamer.sourceforge.net
                           http://www.tug.org/texlive/
                        http://www.gnu.org/software/emacs
                               http://www.gimp.org/
                              http://www.openbsd.org



             Published under the Creative Commons Attribution 3.0 License

                  http://creativecommons.org/licenses/by/3.0/

                          Document version 1.0 2009/02/04

More Related Content

PPTX
Alberto Kornblihtt-Enfermedades raras de la piel
DOC
CHEM3204_PRAC_Manual_2016
PPTX
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
PPTX
LAMP (Loop Mediated Isothermal Amplification)
PPTX
Abbott_Kinase
PPT
Molecular methods of diagnosing infectious disease
PPT
PDF
Automated Nucleic Acid Purification from Diverse Sample types using dedicated...
Alberto Kornblihtt-Enfermedades raras de la piel
CHEM3204_PRAC_Manual_2016
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
LAMP (Loop Mediated Isothermal Amplification)
Abbott_Kinase
Molecular methods of diagnosing infectious disease
Automated Nucleic Acid Purification from Diverse Sample types using dedicated...

What's hot (20)

PPTX
Pyrosequencing
PPTX
Technique of polymerase chain reaction (pcr) experimental biotechnology
PDF
Identification of perfect housekeeping genes for gene expression studies in p...
PDF
Rapid DNA isolation from diverse plant material for use in Next Generation Se...
PDF
RNA editing as a drug target in tryp assay dev woods_hole2_0411 acs v2
PDF
Why you would want a powerful hot-start DNA polymerase for your PCR
PDF
DNA Analysis - Basic Research : A Case Study
PDF
Cox1998-Automated_RNA_selection.
PDF
Maximizing PCR and RT-PCR Success - Download the Brochure
PPTX
19_21Translation
PDF
A real time RT-LAMP portable turbidimeter
PDF
Stable 16 year storage of DNA purified with the QIAamp® DNA Blood mini kit - ...
PDF
Next Generation Sequencing
PPTX
Pyrosequencing
PPTX
Pcr, rapd dan rflp
PDF
Chigot poster2007
PDF
Gene 151_119 (1994) [SDM of dsDNA]
PDF
suraj_jaladanki_examining_Malaclemys_terrapin_genome_scaffolds
PPT
Amplificationtechniquesinmicrobiology 090609205257-phpapp02
PPTX
PCR, Real Time PCR
Pyrosequencing
Technique of polymerase chain reaction (pcr) experimental biotechnology
Identification of perfect housekeeping genes for gene expression studies in p...
Rapid DNA isolation from diverse plant material for use in Next Generation Se...
RNA editing as a drug target in tryp assay dev woods_hole2_0411 acs v2
Why you would want a powerful hot-start DNA polymerase for your PCR
DNA Analysis - Basic Research : A Case Study
Cox1998-Automated_RNA_selection.
Maximizing PCR and RT-PCR Success - Download the Brochure
19_21Translation
A real time RT-LAMP portable turbidimeter
Stable 16 year storage of DNA purified with the QIAamp® DNA Blood mini kit - ...
Next Generation Sequencing
Pyrosequencing
Pcr, rapd dan rflp
Chigot poster2007
Gene 151_119 (1994) [SDM of dsDNA]
suraj_jaladanki_examining_Malaclemys_terrapin_genome_scaffolds
Amplificationtechniquesinmicrobiology 090609205257-phpapp02
PCR, Real Time PCR
Ad

Viewers also liked (20)

PPTX
Future of metagenomics
PPTX
Metagenomics newer approach in understanding Microbes
PDF
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
PPT
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
PPT
Microbial Metagenomics Drives a New Cyberinfrastructure
PDF
Phylogeny Driven Approaches to Genomic and Metagenomic Studies
PPT
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PDF
Dr. Ben Hause - Pathogen Discovery Using Metagenomic Sequencing
PPTX
Metagenomics
PPT
Advancing the Metagenomics Revolution
PPT
Metagenomic
PPTX
Parks kmer metagenomics
PPTX
Viral Metagenomics (CABBIO 20150629 Buenos Aires)
PDF
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
PDF
Multiple kernel learning applied to the integration of Tara oceans datasets
PPTX
introduction to metagenomics
PPT
2009 hattori metagenomics
PPTX
Metagenomics and it’s applications
PPTX
metagenomics
Future of metagenomics
Metagenomics newer approach in understanding Microbes
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
Microbial Metagenomics Drives a New Cyberinfrastructure
Phylogeny Driven Approaches to Genomic and Metagenomic Studies
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
Dr. Ben Hause - Pathogen Discovery Using Metagenomic Sequencing
Metagenomics
Advancing the Metagenomics Revolution
Metagenomic
Parks kmer metagenomics
Viral Metagenomics (CABBIO 20150629 Buenos Aires)
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Multiple kernel learning applied to the integration of Tara oceans datasets
introduction to metagenomics
2009 hattori metagenomics
Metagenomics and it’s applications
metagenomics
Ad

Similar to Computational analysis of metagenomic data: delineation of compositional features and screens for desirable enzymes (20)

PDF
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...
PPTX
Valerie de Anda at #ICG12: A new multi-genomic approach for the study of biog...
PDF
Structure-Activity Relationships and Networks: A Generalized Approach to Expl...
PDF
The Era of the Microbiome - Talk by Jonathan Eisen
PPTX
6. Exploration of Microbialdiversity. Dr Thirunahari Ugandharpptx
PDF
Use of bio-informatic tools in bacterial genetics
PDF
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
PPT
Mks chemotaxonomy
PDF
Julio Peironcely @ ICCS 2011
PDF
Tyler functional annotation thurs 1120
PPTX
Metagenomics , Applications, Techniques And Limitations .pptx
PPTX
Metagenomics .pptx
PPTX
Molecular detection of food borne pathogens-presentation
PPT
BioMinds Poster!!!!!!!!
PDF
Rta Ifpac 2012 Melamine Pesticides Sers
PDF
Environment, food and industrial micro lecture for exam 3
PDF
Diversity Diversity Diversity Diversity ....
PDF
Understanding and classifying metabolite space and metabolite likeness
PPTX
Metagenomics
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...
Valerie de Anda at #ICG12: A new multi-genomic approach for the study of biog...
Structure-Activity Relationships and Networks: A Generalized Approach to Expl...
The Era of the Microbiome - Talk by Jonathan Eisen
6. Exploration of Microbialdiversity. Dr Thirunahari Ugandharpptx
Use of bio-informatic tools in bacterial genetics
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
Mks chemotaxonomy
Julio Peironcely @ ICCS 2011
Tyler functional annotation thurs 1120
Metagenomics , Applications, Techniques And Limitations .pptx
Metagenomics .pptx
Molecular detection of food borne pathogens-presentation
BioMinds Poster!!!!!!!!
Rta Ifpac 2012 Melamine Pesticides Sers
Environment, food and industrial micro lecture for exam 3
Diversity Diversity Diversity Diversity ....
Understanding and classifying metabolite space and metabolite likeness
Metagenomics

More from Konrad Förstner (11)

PDF
Collaborative platforms for streamlining workflows in Open Science
PDF
OpenGov on the city level - A call for action
PDF
OpenGov und OpenData in Darmstadt - Eine Diskussionsgrundlage
PDF
Spannungsfeld Online-Identität
PDF
Advanced Wiki training course
PDF
General thoughts about how to fix the world
PDF
Wiki training course
PDF
Principles and tools of the social web - An introductory seminar
PDF
Revolutionizing scientific communication and collaboration
PDF
The What, Why and How of openness in science
PDF
A quick trip through openness, freedom and transparency
Collaborative platforms for streamlining workflows in Open Science
OpenGov on the city level - A call for action
OpenGov und OpenData in Darmstadt - Eine Diskussionsgrundlage
Spannungsfeld Online-Identität
Advanced Wiki training course
General thoughts about how to fix the world
Wiki training course
Principles and tools of the social web - An introductory seminar
Revolutionizing scientific communication and collaboration
The What, Why and How of openness in science
A quick trip through openness, freedom and transparency

Recently uploaded (20)

PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
sap open course for s4hana steps from ECC to s4
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Approach and Philosophy of On baking technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Big Data Technologies - Introduction.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
KodekX | Application Modernization Development
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Spectroscopy.pptx food analysis technology
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Spectral efficient network and resource selection model in 5G networks
sap open course for s4hana steps from ECC to s4
“AI and Expert System Decision Support & Business Intelligence Systems”
Approach and Philosophy of On baking technology
Digital-Transformation-Roadmap-for-Companies.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation_ Review paper, used for researhc scholars
Big Data Technologies - Introduction.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
MIND Revenue Release Quarter 2 2025 Press Release
KodekX | Application Modernization Development
The Rise and Fall of 3GPP – Time for a Sabbatical?
Chapter 3 Spatial Domain Image Processing.pdf
Network Security Unit 5.pdf for BCA BBA.
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Spectroscopy.pptx food analysis technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows

Computational analysis of metagenomic data: delineation of compositional features and screens for desirable enzymes

  • 1. Intro GC Protein detection Nitrilases NHases PKS THM Computational analysis of metagenomic data: delineation of compositional features and screens for desirable enzymes Konrad U. F¨rstner o Bork Group, EMBL Promotionskolloquium 04. Februar 2009
  • 2. Intro GC Protein detection Nitrilases NHases PKS THM Table of content 1 Introduction 2 GC content analysis 3 Protein detection workflow 4 Nitrilases 5 Nitril hydratases 6 Polyketide synthases I 7 Take home messages
  • 3. Intro GC Protein detection Nitrilases NHases PKS THM Table of content 1 Introduction 2 GC content analysis 3 Protein detection workflow 4 Nitrilases 5 Nitril hydratases 6 Polyketide synthases I 7 Take home messages
  • 4. Intro GC Protein detection Nitrilases NHases PKS THM For the microbial ecologist, what can be cultured is the basis of his conception of what exists. This is exactly like learning about animals from visiting zoos. Carl Woese
  • 5. Intro GC Protein detection Nitrilases NHases PKS THM
  • 6. Intro GC Protein detection Nitrilases NHases PKS THM
  • 7. Intro GC Protein detection Nitrilases NHases PKS THM
  • 8. Intro GC Protein detection Nitrilases NHases PKS THM
  • 9. Intro GC Protein detection Nitrilases NHases PKS THM Great plate count anomaly Less than 1% of the microbes can be cultured under standard conditions.
  • 10. Intro GC Protein detection Nitrilases NHases PKS THM Metagenomics = culture independent approaches
  • 11. Intro GC Protein detection Nitrilases NHases PKS THM Workflow of metagenomics sequencing
  • 12. Intro GC Protein detection Nitrilases NHases PKS THM Selected metagenomic data sets
  • 13. Intro GC Protein detection Nitrilases NHases PKS THM Challenges Usually a low coverage Dominant species Short sequences Data size ⇒ storage/memory/CPU intensive ⇒ software not developed for that No standard protocols ⇒ hard to compare
  • 14. Intro GC Protein detection Nitrilases NHases PKS THM Table of content 1 Introduction 2 GC content analysis 3 Protein detection workflow 4 Nitrilases 5 Nitril hydratases 6 Polyketide synthases I 7 Take home messages
  • 15. Intro GC Protein detection Nitrilases NHases PKS THM GC analysis GC content = percentage of Guanine-Cytosine bp in the DNA/RNA influences a.o. Melting temperature of DNA/RNA Codon usage
  • 16. Intro GC Protein detection Nitrilases NHases PKS THM GC analysis - huge difference between soil and ocean water Foerstner et al., 2005
  • 17. Intro GC Protein detection Nitrilases NHases PKS THM GC analysis - further data confirms statement Raes et al., 2007
  • 18. Intro GC Protein detection Nitrilases NHases PKS THM GC analysis - possible influencing factors Nitrogen availability Genome size Ultraviolet light exposure and repair mechanism Codon usage of pioneers
  • 19. Intro GC Protein detection Nitrilases NHases PKS THM Table of content 1 Introduction 2 GC content analysis 3 Protein detection workflow 4 Nitrilases 5 Nitril hydratases 6 Polyketide synthases I 7 Take home messages
  • 20. Intro GC Protein detection Nitrilases NHases PKS THM Metagenomics data sets as resources of biotech enzymes Many microbial enzymes are essential tools in e.g. the chemical, pharma and food industries Searching in metagenomic data sets might reveal new potent members of known enzymes classes
  • 21. Intro GC Protein detection Nitrilases NHases PKS THM Protein detection and classification workflow
  • 22. Intro GC Protein detection Nitrilases NHases PKS THM Nitrilases Nitrile + water carboxylic acids + ammonia One protein Application in the chemical industry Stereo- and regio-specific conversion of nitriles
  • 23. Intro GC Protein detection Nitrilases NHases PKS THM Nitrilases - new members and subfamilies found Raes et al., 2007
  • 24. Intro GC Protein detection Nitrilases NHases PKS THM Table of content 1 Introduction 2 GC content analysis 3 Protein detection workflow 4 Nitrilases 5 Nitril hydratases 6 Polyketide synthases I 7 Take home messages
  • 25. Intro GC Protein detection Nitrilases NHases PKS THM NHases Nitril hydratases (NHases) Nitrile + water amide Two domains Application in the chemical industry Acrylamide >30,000 tons/year Nicotinamide >3500 tons/year Waste water treatment
  • 26. Intro GC Protein detection Nitrilases NHases PKS THM NHases - tree of the α domain Foerstner et al., 2008
  • 27. Intro GC Protein detection Nitrilases NHases PKS THM NHases - Monosiga brevicollis’ taxomony
  • 28. Intro GC Protein detection Nitrilases NHases PKS THM NHases - in Monosiga brevicollis Foerstner et al., 2008
  • 29. Intro GC Protein detection Nitrilases NHases PKS THM Table of content 1 Introduction 2 GC content analysis 3 Protein detection workflow 4 Nitrilases 5 Nitril hydratases 6 Polyketide synthases I 7 Take home messages
  • 30. Intro GC Protein detection Nitrilases NHases PKS THM PKS I Polyketide synthases (PKS) create a heterogeneous group of secondary metabolites The synthesis is similar to the fatty acid synthesis Multiple domains We focused on polyketide synthases type I (PKS I)
  • 31. Intro GC Protein detection Nitrilases NHases PKS THM PKS I - polyketide synthesis steps This picture of this slide is removed due to copyright restriction. Jenke-Kodama et al., 2005
  • 32. Intro GC Protein detection Nitrilases NHases PKS THM PKS I - examples of polyketides Erythromycin Oleandomycin Aflatoxin B1
  • 33. Intro GC Protein detection Nitrilases NHases PKS THM PKS I - tree of the AT domain HMM hits Foerstner et al., 2008
  • 34. Intro GC Protein detection Nitrilases NHases PKS THM PKS I - tree overview Foerstner et al., 2008
  • 35. Intro GC Protein detection Nitrilases NHases PKS THM PKS I - hit distribution Foerstner et al., 2008
  • 36. Intro GC Protein detection Nitrilases NHases PKS THM PKS I - PKSs per genome Foerstner et al., 2008
  • 37. Intro GC Protein detection Nitrilases NHases PKS THM Table of content 1 Introduction 2 GC content analysis 3 Protein detection workflow 4 Nitrilases 5 Nitril hydratases 6 Polyketide synthases I 7 Take home messages
  • 38. Intro GC Protein detection Nitrilases NHases PKS THM Take home messages Metagenomics ... ... might help us to explore the complete microbial world ... still has many technical challenges ... can reveal the environmetal influence on genomic features ... can help discover new enzymes
  • 39. Intro GC Protein detection Nitrilases NHases PKS THM Acknowledgements Peer Bork Thomas Dandekar Lars Steinmetz Toby Gibson The whole Bork group esp. Jeroen Raes and Takuji Yamada Christian von Mering Melly My friends and family
  • 40. Intro GC Protein detection Nitrilases NHases PKS THM Image sources/attribution - part 1/2 Orangutan Houston Zoo http://flickr.com/photos/billtex48/2178056762/ by (Bill and Mavis) - B&M Opel Zoo 07.07.2007 http://flickr.com/photos/lamberty/754218458 by frijolito75 Giraffe http://flickr.com/photos/abelle/280246250/ by A.Bell Snuggling http://flickr.com/photos/buckwoo/2421562192/ by Ken W! Delicious Dead Bee and Hungry Ants http://flickr.com/photos/hamed/176176998/ by Hamed Saber hundreds of fish swarm a soft coral head http://flickr.com/photos/g-na/370131126/ by g-na hunt is on http://flickr.com/photos/doug88888/2930690305/ by doug88888 Long-billed Curlew http://flickr.com/photos/mikebaird/3011987508/ by mikebaird 145ps 01087.jpg http://flickr.com/photos/ricephotos/2679758872/ by IRRI Images Polymicrobic biofilm epifluorescence http://commons.wikimedia.org/wiki/File:Polymicrobic_biofilm_epifluorescence.jpg The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, et al. PLoS Biology Vol. 5, No. 3, e77 doi:10.1371/journal.pbio.0050077 green farm http://flickr.com/photos/nakae/204037619/ by nakae Acid Mine Drainage http://flickr.com/photos/savethewildup/400614071/ by savethewildup blue ocean http://flickr.com/photos/coolskipper/27242821/ by coolskipper Digestive system http://commons.wikimedia.org/wiki/File:Digestive_system_whitout_labels.svg by Mariana Ruiz Villarreal Pg166 bioreactor http://commons.wikimedia.org/wiki/File:Pg166_bioreactor.jpg
  • 41. Intro GC Protein detection Nitrilases NHases PKS THM Image sources/attribution - part 2/2 Big Drop-Off [...] http://flickr.com/photos/ctsnow/113339176/ by ctsnow Sphaeroeca-colony http://commons.wikimedia.org/wiki/File:Sphaeroeca-colony.jpg by Dhzanette Ocean view http://flickr.com/photos/provoost/399669002/ by Sjors Provoost The hurdles http://flickr.com/photos/29621494N02/3060466344/ by paula fisher Erythromycin http://de.wikipedia.org/w/index.php?title=Datei:Erythrommycin_A_B_C.svg by Yikrazuul Aflatoxin B1 http://de.wikipedia.org/w/index.php?title=Datei: Aflatoxin_B1.svg&filetimestamp=20070113042046 by Bryan Derksen Oleandomycin http://en.wikipedia.org/wiki/File:Oleandomycin.png by Edgar181 Tool rack http://en.wikipedia.org/wiki/File:Oleandomycin.png by L. Marie Collaboration http://flickr.com/photos/fncll/145149313/ ChrisL AK Base pair AT http://commons.wikimedia.org/wiki/File:Base_pair_AT.svg Base pair GC http://commons.wikimedia.org/wiki/File:Base_pair_GC.svg
  • 42. Intro GC Protein detection Nitrilases NHases PKS THM About this document A Created in L TEX using the beamer class, TeX Live and Emacs. All these programs run on OpenBSD. http://www.latex-project.org http://latex-beamer.sourceforge.net http://www.tug.org/texlive/ http://www.gnu.org/software/emacs http://www.gimp.org/ http://www.openbsd.org Published under the Creative Commons Attribution 3.0 License http://creativecommons.org/licenses/by/3.0/ Document version 1.0 2009/02/04