This document discusses open source bioinformatics tools and resources for data scientists working in drug discovery. It provides an overview of recent projects involving druggability prediction, protein structure and function prediction, and identification of new targets for cancer. It also summarizes key steps in the drug discovery process and some of the main challenges, including drug resistance and tumor heterogeneity. Resources mentioned include databases of protein structures, drug data, gene expression and pathways involved in DNA damage response.