The document discusses the integration of diverse large-scale datasets to build comprehensive protein-protein interaction networks. It describes challenges with data from different sources having different identifiers, evidence types and quality. It also discusses methods used by STRING and other databases to combine data from curated databases, literature mining, primary datasets and transfer of interactions based on orthology. Examples are given of cell cycle studies in yeast that have analyzed periodically expressed genes and protein interactions.