The document discusses the process of preparing a chemical database for virtual screening or compound acquisition. It begins with assembling collections from in-house and external databases. The collection is then cleaned by removing invalid structures and standardizing structure representations. Property filtering is used to focus on lead-like compounds. Known active molecules are searched for structural similarity. Alternative structures like stereoisomers are explored. Representatives are selected from clustered structures using descriptors and similarity metrics. 3D structures are generated and a final list of compounds is assembled for screening, with some random additions, completing the preparation.