Statistics, Department of
Document Type
Article
Date of this Version
6-4-2024
Citation
Nature Communications | ( 2024) 15:5072. https://doi.org/10.1038/s41467-024-49372-0
Abstract
Quantitative structure-activity relationship (QSAR)modeling is a powerful tool for drug discovery, yet the lack of interpretability of commonly used QSAR models hinders their application inmolecular design.We propose a similaritybased regression framework, topological regression (TR), that offers a statistically grounded, computationally fast, and interpretable technique to predict drug responses. We compare the predictive performance of TR on 530 ChEMBL human target activity datasets against the predictive performance of deep-learning-based QSAR models. Our results suggest that our sparse TR model can achieve equal, if not better, performance than the deep learningbased QSAR models and provide better intuitive interpretation by extracting an approximate isometry between the chemical space of the drugs and their activity space.
Comments
Open access.