SlideShare a Scribd company logo
Stem Cells and Bioinformatics Michelle Previtera Zhenhong Bao
Induced Pluripotent Stem Cell Lines Derived from Human Somatic Cells OCT4 SOX2 NANOG LIN28
OCT4 Exercise 1: Do a Pubmed search Refine your search using the Limit tab to include studies only involving humans. What other tools do you see that are useful for refining your search? What is OCT4 function in humans according to published literature? What is the difference in amount of articles found with/without the limit? Do a Google Scholar search for OCT4 Do the advance search options have the same options as the Limits tabs on Pubmed? What can you do in Google scholar to refine your results?
Answer OCT4 is a transcription factor involved to maintain pluripotency in ES cells The difference is: Non limit 358 Limit 154 Google and Pubmed do not have the same refinement tools Use: "+" operator  makes sure your results include common words, letters or numbers that Google's search technology generally ignores "-" operator  excludes all results that include this search term, phrase search  only returns results that include this exact phrase the "OR" operator  returns results that include either of your search terms the "intitle:" operator   only returns results that include your search term in the document's title.  From http://scholar.google.com/intl/en/scholar/refinesearch.html
SOX2 Using NCBI Gene Entrez What are 2 features that make SOX2 unique?  HINT: Look in the summary paragraph in Gene Entrex What does SOX2 interact with and what is the function of this interaction ? Is there a structure for SOX2 on CDD? If so what is interesting about the structure when you download it on Cn3d? What is the structure aligned to? What do the # marks mean on the feature 1 row?
Answer intronless gene and lies within an intron of another gene called SOX2 overlapping transcript (SOX2OT)  OCT4 to establish first 3 lineages in ES. Shows DNA Binding site and those are the residues that are Blue and conserved Chain D, Crystal Structure Of A PouHMGDNA TERNARY COMPLEX.  ('#') pinpoint features
NANOG Illustration using EMBOSS Suite Exercise: Obtain mRNA and genomic sequence of human NANOG in FASTA format from  NCBI  Entrez . According to  Infoseq , what’s the length, GC content, accession number of the human NANOG mRNA? Align the mRNA sequence to the genomic sequence using  dottup , show results. Do a local alignment with human mRNA sequence to mouse sequence using  Water .  Use  getorf/plotorf  to obtain ORF of human NANOG mRNA sequence. Translate the mRNA sequence to a protein sequence with  Transeq . Show the hydropathy plot of the above protein sequence using  Pepinfo .
NCBI Entrez Obtain sequence files from NCBI
Infoseq Name, Accession, Type, GI, length, GC %, etc. Accession No: NM_024865.2  Length: 2098nt GC content: 45.28% Description: Homo sapiens Nanog homeobox (NANOG), mRNA
dottup Dottup looks for exact matches between sequences Word Size = 10 Word Size = 20
Water local alignment as in water searches for regions of  local similarity  and need not include the entire length of the sequences Results for NANOG mRNA between  homo sapiens  and  Mus Musculus
Plotorf/getorf Finds and plots potential open reading frames. (ORF)  ORF in plotorf defined as regions between START and STOP codons.
Transeq Transeq translate mRNA sequence to protein sequence. ORF obtained from Plotorf/getorf. Human NANOG mRNA ORF 217 to 1131 Translated Protein Sequence
Pepinfo Pepinfo produces information on amino acid properties (size, polarity, aromaticity, charge etc).
LIN28 Illustration using Blast Exercise Obtain human protein sequence of LIN28 from NCBI  Search nucleotide database using a protein query ( tblastn ) Search Conserved Domain Database ( CDD ) for conserved domains From  taxonomy report , find the best match of mouse homolog Compare  the conserved domains from human and mouse proteins
Blast tblastn:  Search  translated nucleotide  database using a  protein  query   Search Result: homologs of  human LIN28
Conserved Domain Database (CDD) Result for LIN28  homo sapiens CSD, Cold-shock DNA-binding domain,  67aa , 95.5% aligned  AIR1, Arginine methyltransferase-interacting protein, 190aa  34.7% aligned
Taxonomy Report Blast results are categorized in species the best match in  mus musculus :  protein accession number: NP_665832 ,  E value 2e-103
CD from mouse protein Do the same search with mouse LIN28 protein in CDD CSD: 67 aa, 95.5%  AIR1: 190aa,  26.84%  Compared to human LIN28 CSD:67aa,  95.5%  AIR1:  190aa,  34.7%
Literatures Gene functions related to pluripotency.  Oct4 is required to maintain the undifferentiated stem cell state, and differentiation to trophectoderm occurs in its absence.   NANOG plays a crucial role in maintaining the pluripotent state of primate embryonic stem cells.   …

More Related Content

PPTX
MCB 432 Final Table PP 01.06.16
PDF
C0261012019
PDF
ASB PosterCompletepdf
PDF
PAINT Conf Call 062414
PPT
Characterization of the phi29 Bacteriophage Nanomotor
PDF
Translating research data into Gene Ontology annotations
PDF
Functional annotation
PPTX
Gene Editing: An Essential Tool For Plant Breeding
MCB 432 Final Table PP 01.06.16
C0261012019
ASB PosterCompletepdf
PAINT Conf Call 062414
Characterization of the phi29 Bacteriophage Nanomotor
Translating research data into Gene Ontology annotations
Functional annotation
Gene Editing: An Essential Tool For Plant Breeding

What's hot (16)

PDF
BIOL335: RNA bioinformatics
PPT
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
PDF
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
PDF
Structural Mechanism for the Fidelity Modulation of DNA Polymerase λ
PDF
2 md2016 annotation
PDF
Does RNA avoidance dictate protein expression level?
PDF
Tyler functional annotation thurs 1120
DOCX
SHSARP paper final
PPTX
Basler modellers.210126reduced
PPTX
Mutation illustration
PPTX
Mutation illustration
PPTX
Alberto Kornblihtt-Enfermedades raras de la piel
PDF
Examining gene expression and methylation with next gen sequencing
PPTX
Docking & Designing Small Molecules within Rosetta Code Framework
PDF
Engineered histone acetylation using DNA-binding domains (DBD), chemical ind...
PPTX
Systematic detection of internal symmetry in proteins - Rheinknie Regiomeetin...
BIOL335: RNA bioinformatics
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
Structural Mechanism for the Fidelity Modulation of DNA Polymerase λ
2 md2016 annotation
Does RNA avoidance dictate protein expression level?
Tyler functional annotation thurs 1120
SHSARP paper final
Basler modellers.210126reduced
Mutation illustration
Mutation illustration
Alberto Kornblihtt-Enfermedades raras de la piel
Examining gene expression and methylation with next gen sequencing
Docking & Designing Small Molecules within Rosetta Code Framework
Engineered histone acetylation using DNA-binding domains (DBD), chemical ind...
Systematic detection of internal symmetry in proteins - Rheinknie Regiomeetin...
Ad

Similar to Bioinfomatics Presentation (20)

PPT
Biological databases
PDF
Bioinformatics.Practical Notebook
PPT
Group b
PDF
Research report (alternative splicing, protein structure; retinitis pigmentosa)
PDF
RSEM and DE packages
DOCX
Essential Biology 3.5 Transcription & Translation (Core)
PPTX
RNA Sequencing Research
PPTX
Dgaston dec-06-2012
PPT
Bioinformatics MiRON
PPT
Biological literature mining - from information retrieval to biological disco...
PDF
Cufflinks
PDF
GoTermsAnalysisWithR
PDF
PPT
Role of bioinformatics in life sciences research
PPT
Prediction of protein function
PDF
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
DOCX
Internship Report
PDF
Apollo Collaborative genome annotation editing
PPTX
SF and PE CTR-IN 2016 Poster_FInal
PPTX
Thesis def
Biological databases
Bioinformatics.Practical Notebook
Group b
Research report (alternative splicing, protein structure; retinitis pigmentosa)
RSEM and DE packages
Essential Biology 3.5 Transcription & Translation (Core)
RNA Sequencing Research
Dgaston dec-06-2012
Bioinformatics MiRON
Biological literature mining - from information retrieval to biological disco...
Cufflinks
GoTermsAnalysisWithR
Role of bioinformatics in life sciences research
Prediction of protein function
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
Internship Report
Apollo Collaborative genome annotation editing
SF and PE CTR-IN 2016 Poster_FInal
Thesis def
Ad

Recently uploaded (20)

PDF
Ôn tập tiếng anh trong kinh doanh nâng cao
PDF
Reconciliation AND MEMORANDUM RECONCILATION
PPTX
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
PDF
Digital Marketing & E-commerce Certificate Glossary.pdf.................
PDF
Laughter Yoga Basic Learning Workshop Manual
DOCX
Euro SEO Services 1st 3 General Updates.docx
PDF
Tata consultancy services case study shri Sharda college, basrur
PDF
Roadmap Map-digital Banking feature MB,IB,AB
DOCX
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
PDF
MSPs in 10 Words - Created by US MSP Network
PDF
Chapter 5_Foreign Exchange Market in .pdf
PDF
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PPTX
Principles of Marketing, Industrial, Consumers,
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
PPTX
Probability Distribution, binomial distribution, poisson distribution
PPT
340036916-American-Literature-Literary-Period-Overview.ppt
PPTX
New Microsoft PowerPoint Presentation - Copy.pptx
DOCX
Business Management - unit 1 and 2
PDF
Nidhal Samdaie CV - International Business Consultant
Ôn tập tiếng anh trong kinh doanh nâng cao
Reconciliation AND MEMORANDUM RECONCILATION
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
Digital Marketing & E-commerce Certificate Glossary.pdf.................
Laughter Yoga Basic Learning Workshop Manual
Euro SEO Services 1st 3 General Updates.docx
Tata consultancy services case study shri Sharda college, basrur
Roadmap Map-digital Banking feature MB,IB,AB
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
MSPs in 10 Words - Created by US MSP Network
Chapter 5_Foreign Exchange Market in .pdf
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
Principles of Marketing, Industrial, Consumers,
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
Probability Distribution, binomial distribution, poisson distribution
340036916-American-Literature-Literary-Period-Overview.ppt
New Microsoft PowerPoint Presentation - Copy.pptx
Business Management - unit 1 and 2
Nidhal Samdaie CV - International Business Consultant

Bioinfomatics Presentation

  • 1. Stem Cells and Bioinformatics Michelle Previtera Zhenhong Bao
  • 2. Induced Pluripotent Stem Cell Lines Derived from Human Somatic Cells OCT4 SOX2 NANOG LIN28
  • 3. OCT4 Exercise 1: Do a Pubmed search Refine your search using the Limit tab to include studies only involving humans. What other tools do you see that are useful for refining your search? What is OCT4 function in humans according to published literature? What is the difference in amount of articles found with/without the limit? Do a Google Scholar search for OCT4 Do the advance search options have the same options as the Limits tabs on Pubmed? What can you do in Google scholar to refine your results?
  • 4. Answer OCT4 is a transcription factor involved to maintain pluripotency in ES cells The difference is: Non limit 358 Limit 154 Google and Pubmed do not have the same refinement tools Use: "+" operator makes sure your results include common words, letters or numbers that Google's search technology generally ignores "-" operator excludes all results that include this search term, phrase search only returns results that include this exact phrase the "OR" operator returns results that include either of your search terms the "intitle:" operator only returns results that include your search term in the document's title. From http://scholar.google.com/intl/en/scholar/refinesearch.html
  • 5. SOX2 Using NCBI Gene Entrez What are 2 features that make SOX2 unique? HINT: Look in the summary paragraph in Gene Entrex What does SOX2 interact with and what is the function of this interaction ? Is there a structure for SOX2 on CDD? If so what is interesting about the structure when you download it on Cn3d? What is the structure aligned to? What do the # marks mean on the feature 1 row?
  • 6. Answer intronless gene and lies within an intron of another gene called SOX2 overlapping transcript (SOX2OT) OCT4 to establish first 3 lineages in ES. Shows DNA Binding site and those are the residues that are Blue and conserved Chain D, Crystal Structure Of A PouHMGDNA TERNARY COMPLEX. ('#') pinpoint features
  • 7. NANOG Illustration using EMBOSS Suite Exercise: Obtain mRNA and genomic sequence of human NANOG in FASTA format from NCBI Entrez . According to Infoseq , what’s the length, GC content, accession number of the human NANOG mRNA? Align the mRNA sequence to the genomic sequence using dottup , show results. Do a local alignment with human mRNA sequence to mouse sequence using Water . Use getorf/plotorf to obtain ORF of human NANOG mRNA sequence. Translate the mRNA sequence to a protein sequence with Transeq . Show the hydropathy plot of the above protein sequence using Pepinfo .
  • 8. NCBI Entrez Obtain sequence files from NCBI
  • 9. Infoseq Name, Accession, Type, GI, length, GC %, etc. Accession No: NM_024865.2 Length: 2098nt GC content: 45.28% Description: Homo sapiens Nanog homeobox (NANOG), mRNA
  • 10. dottup Dottup looks for exact matches between sequences Word Size = 10 Word Size = 20
  • 11. Water local alignment as in water searches for regions of local similarity and need not include the entire length of the sequences Results for NANOG mRNA between homo sapiens and Mus Musculus
  • 12. Plotorf/getorf Finds and plots potential open reading frames. (ORF) ORF in plotorf defined as regions between START and STOP codons.
  • 13. Transeq Transeq translate mRNA sequence to protein sequence. ORF obtained from Plotorf/getorf. Human NANOG mRNA ORF 217 to 1131 Translated Protein Sequence
  • 14. Pepinfo Pepinfo produces information on amino acid properties (size, polarity, aromaticity, charge etc).
  • 15. LIN28 Illustration using Blast Exercise Obtain human protein sequence of LIN28 from NCBI Search nucleotide database using a protein query ( tblastn ) Search Conserved Domain Database ( CDD ) for conserved domains From taxonomy report , find the best match of mouse homolog Compare the conserved domains from human and mouse proteins
  • 16. Blast tblastn: Search translated nucleotide database using a protein query Search Result: homologs of human LIN28
  • 17. Conserved Domain Database (CDD) Result for LIN28 homo sapiens CSD, Cold-shock DNA-binding domain, 67aa , 95.5% aligned AIR1, Arginine methyltransferase-interacting protein, 190aa 34.7% aligned
  • 18. Taxonomy Report Blast results are categorized in species the best match in mus musculus : protein accession number: NP_665832 , E value 2e-103
  • 19. CD from mouse protein Do the same search with mouse LIN28 protein in CDD CSD: 67 aa, 95.5% AIR1: 190aa, 26.84% Compared to human LIN28 CSD:67aa, 95.5% AIR1: 190aa, 34.7%
  • 20. Literatures Gene functions related to pluripotency. Oct4 is required to maintain the undifferentiated stem cell state, and differentiation to trophectoderm occurs in its absence. NANOG plays a crucial role in maintaining the pluripotent state of primate embryonic stem cells. …