TY - DATA T1 - Chemically Standardized Dataset of 512 Kinases for Statistical Modeling PY - 2020/02/13 AU - Lindsey Burggraaff UR - https://data.4tu.nl/articles/dataset/Chemically_Standardized_Dataset_of_512_Kinases_for_Statistical_Modeling/12702779/1 DO - 10.4121/uuid:6af1d9de-281f-4221-b7e1-e7c01b90dfe0 KW - Cheminformatics KW - Compound dataset KW - Drug discovery KW - Kinase KW - Machine learning N2 - Compound dataset consisting of structures and bioactivity data (classes) for 512 kinases. Chemical structures are available as InChIKey and bioactivity data as either active (pChEMBL >= 6.5) or inactive (pChEMBL < 6.5) (the meaning of the pChEMBL value can be found on: https://www.ebi.ac.uk/chembl/). The compound structures are chemically standardised by neutralising charges, removing salts, and keeping the largest fragment. The dataset was used in training and validation of statistical models (QSAR and PCM). ER -