The document provides an overview of using text-mining techniques like named entity recognition and information extraction to identify disease genes from biomedical abstracts. It describes retrieving over 170,000 abstracts related to prostate cancer and schizophrenia and using tools like grep, cut, sort, and uniq to analyze the text and identify human genes mentioned in the abstracts as well as their co-occurrence between diseases. The goal is to leverage these methods to help prioritize candidate genes for further study.