Substituting random forest for multiple linear regression improves binding affinity prediction of scoring functions: Cyscore as a case study.
Auteurs
Li H, Leung KS, Wong MH, Ballester PJ
Résumé
State-of-the-art protein-ligand docking methods are generally limited by the traditionally low accuracy of their scoring functions, which are used to predict binding affinity and thus vital for discriminating between active and inactive compounds. Despite intensive research over the years, classical scoring functions have reached a plateau in their predictive performance. These assume a predetermined additive functional form for some sophisticated numerical features, and use standard multivariate linear regression (MLR) on experimental data to derive the coefficients.
Lire l‘article