SlideShare a Scribd company logo
Introduction to proteomics
Why proteomics?
● Most of the stuff are done by proteins
● Valuable publicly available resources
● Post translational modifications may influence epigenetics
● Many recent applications of machine learning in
○ Preprocessing raw files
○ Analysis
○ Interpreting
● Proteogenomics
○ Proteoepigenomics?
How does a peptide look like
How does a peptide look like
How is proteomics data generated?
Image from: http://www.premierbiosoft.com/tech_notes/mass-spectrometry.html
Using trypsin to limit possibilities
Trypsin cleaves peptide chains mainly at the carboxyl side of the amino acids
lysine or arginine, except when either is followed by proline.
1st dimension of data: time
Other dimensions
Slide from Andrei Drabovich
Possible precursor fragmentation
Slide from Thomas Kislinger
In summary
Slide from Thomas Kislinger
How are proteomics
experiments designed?
A. Labeling Cysteine after
obtaining peptide mixture
B. Labeling with heavy amino
acids while growing cells or
animal
C. Reaction with iTRAQ tags
Slide from Thomas Kislinger
What are the features for each peptide?
1. Time of elution
2. m/z and intensity of precursor ion (trypsinated peptides)
3. m/z and intensity of fragmented ions (mono-oligo peptides) from each
precursor ion
How to analyze the data?
1. Possible expectations from a FASTA file
2. Publicly available data from known proteins
3. De novo sequencing:
a. What are the possible peptides for a given precursion ion m/z
b. Which of these peptides are better supported with m/z of obtained b/y ions?
How to analyze the data?
Zolg, Daniel P., et al. "Building ProteomeTools based on a complete synthetic human proteome." Nature methods 14.3 (2017): 259.
How to infer post-translational modifications
Chick, Joel M., et al. "A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides."
Nature biotechnology 33.7 (2015): 743.
Bioconductor
packages
Publicly available cancer proteomics resources
● The Clinical Proteomic Tumor Analysis Consortium (CPTAC)
○ Breast cancer
○ Ovarian cancer
○ Colorectal cancer
○ Rectal adenocarcinoma
○ Skin carcinoma
○ Oral squamous cell carcinoma
○ Treated and resistant triple negative breast cancer xenografts

More Related Content

PDF
Genomics and proteomics II
PPTX
Genes, Genomics and Proteomics
PDF
PPT
Genomics and bioinformatics
PPTX
Introduction to Proteogenomics
PPT
Bioinformatics A Biased Overview
PPTX
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...
Genomics and proteomics II
Genes, Genomics and Proteomics
Genomics and bioinformatics
Introduction to Proteogenomics
Bioinformatics A Biased Overview
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...

What's hot (20)

PPTX
Introduction to Bioinformatics
PPT
Protein–protein interaction networks
PPTX
Introduction to the Proteomics Bioinformatics Course 2016
PPT
Protein interaction networks
PPT
Biological databases
PPTX
Comparative genomics and proteomics
PPTX
Genomics and Bioinformatics
PPTX
Proteomics and its applications in phytopathology
PPT
OMICS tecnology
PPT
PPTX
Introduction to the Proteomics Bioinformatics Course 2017
PPTX
Bioinformatics, comparative genemics and proteomics
PPT
Human genome project
PPTX
Genomics and proteomics ppt
PPT
Biodatabases 101220022654-phpapp02
PDF
Systems biology
PPTX
Bioinformatics and functional genomics
PPT
Intro bioinfo
PPTX
Regulation of Gene Expression
Introduction to Bioinformatics
Protein–protein interaction networks
Introduction to the Proteomics Bioinformatics Course 2016
Protein interaction networks
Biological databases
Comparative genomics and proteomics
Genomics and Bioinformatics
Proteomics and its applications in phytopathology
OMICS tecnology
Introduction to the Proteomics Bioinformatics Course 2017
Bioinformatics, comparative genemics and proteomics
Human genome project
Genomics and proteomics ppt
Biodatabases 101220022654-phpapp02
Systems biology
Bioinformatics and functional genomics
Intro bioinfo
Regulation of Gene Expression
Ad

Similar to Introduction to proteomics (20)

PPTX
Introduction to bioinformatics
PPT
2015 03 13_puurs_v_public
PPT
Bms 2010
PPTX
Introduction to bioinformatics and databases .pptx
PDF
Bioinformatics
PPT
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
PPTX
MLS 5321 MOLECULAR BIOLOGY II TECHNIQUES AND APPLICATIONS POWER POINT.pptx
PPTX
MOLECULAR BIOLOGY TECHNIQUES AND APPLICATIONS
DOCX
Bioinformatics
PPTX
Bioinformatica 29-09-2011-t1-bioinformatics
PPTX
ISB nov 2014
PPTX
Bioinformatics final
PPT
2013 10 23_dna_for_dummies_v_presented
PDF
Marzillier_09052014.pdf
PDF
ICSB 2013 - Visits Abroad Report
PDF
Bioinformatics issues and challanges presentation at s p college
PPTX
Is it feasible to identify novel biomarkers by mining public proteomics data?
PPTX
Presentation Bioinformatics Introduction.pptx
PDF
Proteomics_in_nutrition_research_Principles_techno.pdf
PPTX
genomics proteomics metbolomics.pptx
Introduction to bioinformatics
2015 03 13_puurs_v_public
Bms 2010
Introduction to bioinformatics and databases .pptx
Bioinformatics
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
MLS 5321 MOLECULAR BIOLOGY II TECHNIQUES AND APPLICATIONS POWER POINT.pptx
MOLECULAR BIOLOGY TECHNIQUES AND APPLICATIONS
Bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
ISB nov 2014
Bioinformatics final
2013 10 23_dna_for_dummies_v_presented
Marzillier_09052014.pdf
ICSB 2013 - Visits Abroad Report
Bioinformatics issues and challanges presentation at s p college
Is it feasible to identify novel biomarkers by mining public proteomics data?
Presentation Bioinformatics Introduction.pptx
Proteomics_in_nutrition_research_Principles_techno.pdf
genomics proteomics metbolomics.pptx
Ad

More from Hoffman Lab (20)

PPTX
Miller: A command-line tool for querying, shaping, and reformatting data files
PDF
GNU Parallel: Lab meeting—technical talk
PDF
TCRpower
PPTX
Efficient querying of genomic reference databases with gget
PPTX
WashU Epigenome Browser
PPTX
Wireguard: A Virtual Private Network Tunnel
PPTX
Plotting heatmap with matplotlib/seaborn
PPTX
Go Get Data (GGD)
PPTX
fastp: the FASTQ pre-processor
PPTX
R markdown and Rmdformats
PPTX
File searching tools
PPTX
Better BibTeX (BBT) for Zotero
PPTX
Awk primer and Bioawk
PPTX
Terminals and Shells
PPTX
BioRender & Glossary/Acronym
PPTX
Linters in R
PPTX
BioSyntax: syntax highlighting for computational biology
PPTX
Get Good With Git
PDF
Tech Talk: UCSC Genome Browser
PDF
MultiQC: summarize analysis results for multiple tools and samples in a singl...
Miller: A command-line tool for querying, shaping, and reformatting data files
GNU Parallel: Lab meeting—technical talk
TCRpower
Efficient querying of genomic reference databases with gget
WashU Epigenome Browser
Wireguard: A Virtual Private Network Tunnel
Plotting heatmap with matplotlib/seaborn
Go Get Data (GGD)
fastp: the FASTQ pre-processor
R markdown and Rmdformats
File searching tools
Better BibTeX (BBT) for Zotero
Awk primer and Bioawk
Terminals and Shells
BioRender & Glossary/Acronym
Linters in R
BioSyntax: syntax highlighting for computational biology
Get Good With Git
Tech Talk: UCSC Genome Browser
MultiQC: summarize analysis results for multiple tools and samples in a singl...

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
cuic standard and advanced reporting.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
1. Introduction to Computer Programming.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Tartificialntelligence_presentation.pptx
PPTX
Machine Learning_overview_presentation.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Spectral efficient network and resource selection model in 5G networks
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
cuic standard and advanced reporting.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Machine learning based COVID-19 study performance prediction
Diabetes mellitus diagnosis method based random forest with bat algorithm
Assigned Numbers - 2025 - Bluetooth® Document
Programs and apps: productivity, graphics, security and other tools
1. Introduction to Computer Programming.pptx
Network Security Unit 5.pdf for BCA BBA.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Unlocking AI with Model Context Protocol (MCP)
Per capita expenditure prediction using model stacking based on satellite ima...
MYSQL Presentation for SQL database connectivity
Encapsulation_ Review paper, used for researhc scholars
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Tartificialntelligence_presentation.pptx
Machine Learning_overview_presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
NewMind AI Weekly Chronicles - August'25-Week II
Spectral efficient network and resource selection model in 5G networks

Introduction to proteomics

  • 2. Why proteomics? ● Most of the stuff are done by proteins ● Valuable publicly available resources ● Post translational modifications may influence epigenetics ● Many recent applications of machine learning in ○ Preprocessing raw files ○ Analysis ○ Interpreting ● Proteogenomics ○ Proteoepigenomics?
  • 3. How does a peptide look like
  • 4. How does a peptide look like
  • 5. How is proteomics data generated? Image from: http://www.premierbiosoft.com/tech_notes/mass-spectrometry.html
  • 6. Using trypsin to limit possibilities Trypsin cleaves peptide chains mainly at the carboxyl side of the amino acids lysine or arginine, except when either is followed by proline.
  • 7. 1st dimension of data: time
  • 8. Other dimensions Slide from Andrei Drabovich
  • 10. In summary Slide from Thomas Kislinger
  • 11. How are proteomics experiments designed? A. Labeling Cysteine after obtaining peptide mixture B. Labeling with heavy amino acids while growing cells or animal C. Reaction with iTRAQ tags Slide from Thomas Kislinger
  • 12. What are the features for each peptide? 1. Time of elution 2. m/z and intensity of precursor ion (trypsinated peptides) 3. m/z and intensity of fragmented ions (mono-oligo peptides) from each precursor ion
  • 13. How to analyze the data? 1. Possible expectations from a FASTA file 2. Publicly available data from known proteins 3. De novo sequencing: a. What are the possible peptides for a given precursion ion m/z b. Which of these peptides are better supported with m/z of obtained b/y ions?
  • 14. How to analyze the data? Zolg, Daniel P., et al. "Building ProteomeTools based on a complete synthetic human proteome." Nature methods 14.3 (2017): 259.
  • 15. How to infer post-translational modifications Chick, Joel M., et al. "A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides." Nature biotechnology 33.7 (2015): 743.
  • 17. Publicly available cancer proteomics resources ● The Clinical Proteomic Tumor Analysis Consortium (CPTAC) ○ Breast cancer ○ Ovarian cancer ○ Colorectal cancer ○ Rectal adenocarcinoma ○ Skin carcinoma ○ Oral squamous cell carcinoma ○ Treated and resistant triple negative breast cancer xenografts

Editor's Notes

  • #6: Trypsin cleaves peptide chains mainly at the carboxyl side of the amino acids lysine or arginine, except when either is followed by proline.
  • #8: SCX: Strong cation exchange C18: Hydrophobic column HPLC: High pressure liquid chromatography Acetonitrile: ACN a polar aprotic solvent Blue: reverse phase gradient Green: salt pulse and reverse phase
  • #9: SCX: Strong cation exchange C18: Hydrophobic column HPLC: High pressure liquid chromatography Acetonitrile: ACN a polar aprotic solvent Blue: reverse phase gradient Green: salt pulse and reverse phase
  • #12: ICAT - Isotope coded affinity tags SILAC - Stable isotope labeling by amino acids in cell culture iTRAQ - Isotope tags for relative and absolute quantification