Gregory M. Banik 
Advances in Consensus Modeling for ADME/Tox Prediction
Echeminfo
 

Advances in Consensus Modeling for ADME/Tox Prediction 

The attrition rate of drug candidates in pharmaceutical R&D is a well known problem. In silico ADME/Tox prediction has become an increasing important method to help rank order the drug candidates in the hopes of improving the odds of success earlier in the R&D process. A number of publications and conference proceedings have extolled the virtues of consensus modeling for ADME/Tox, which heretofore has been a manual, laborious process. Consensus modeling involves the use of multiple models simultaneously to improve the result over any of the individual models in the prediction. A system is described that automates the creation and validation of consensus models for both real variable predictors (for example, log P) as well as Boolean variable or classification predictors (for example, mutagenicity).

An integrated N-fold cross-validation process is included for real variable consensus models, and an integrated randomization cross-check process is included for Boolean variables. Both provide statistics that allow the quality of the consensus prediction to be assessed. In addition, two new concepts for consensus modeling will be demonstrated. The first, substructure-localized consensus models (LOCOMOs), allows consensus models to be trained for specific classes of compounds, improving the overall accuracy of prediction. The second, model normalization, involves the use of scaling factors and constants to better map a model to a specific set of experimental data. In all cases, once a consensus model is trained and validated, it can be used with a higher degree of confidence to screen large libraries of compounds.