
Side effects are inherently unpredictable, often first detected in clinical trials or even by patients after approval. What if drug developers could predict side effects before the drug goes to trial? Researchers at the University of Texas San Antonio have developed an ensemble-based classification algorithm to predict side effects associated with different drugs – and so far it has outperformed previous methods and standard classifiers (based on evaluation results of 1385 side effects for 888 FDA-approved drugs). The team have also applied their method to a number of uncharacterized drug molecules in the DrugBank database to predict their side effects, which have subsequently been validated using literature mining.
The software works by assessing the chemical structure of a drug molecule and then determining whether there are any key sub-structures that are known to cause side effects in other drugs. “Some previous studies used ordinary canonical correlation analysis (OCCA) and sparse canonical correlation analysis (SCCA) to predict side effects,” says Jamiul Jahid one of the authors on the research paper (1). “We compared our results thoroughly with the SSCA method in our manuscript and found that our method significantly outperformed SCCA.” “One unique feature of our new approach is that it identifies hard to predict rare side effects which can be easily ignored by other approaches. Our model uses chemical sub-structure to identify a small subset of similar drugs to build prediction models for specific drugs, which makes it more robust for those hard-to-predict, rare side effect cases,” says Jianhua Ruan, group leader of the project. The model developed by Ruan and his team uses a technique called “ensemble classification”. The researchers identified a set of similar drugs for each drug based on their chemical substructures and then constructed a number of base classifiers – these can be any machine learning classifiers, such as a decision tree, support vector machine or a random forest. “Our method could also be used to identify chemical sub-structures related to different side effects, which means that pharma companies may be able to investigate the underlying mechanism that causes the side effect. This potential could make predictive models like ours crucial in guiding the development of new drugs,” says Ruan.
References
- M.J.Jahid and J. Ruan, “Structure-based prediction of drug side effects using a novel classification algorithm,” Int. J. Comput. Biol. Drug Des., 9, 1/2, 87 (2016).