This document discusses the computational study of protein structure and function from sequence. It begins by defining proteins as polypeptides made of amino acids that fold into 3D structures like helices and sheets. The document then outlines the steps to computationally analyze a protein, including obtaining the sequence from databases in FASTA format, predicting properties with tools like ProtParam, secondary structure with PsiPred, signal peptides with SignalP, transmembrane regions with TMHMM, and domains with InterPro. It describes using homology-based tools to leverage structural conservation and then discusses challenges in full 3D structure prediction. The overall summary describes the computational workflow to go from a protein's amino acid sequence to analyzing its structure and function.
Related topics: