A "molecular" compass shows the way to reducing animal testing
Scientists develop intelligent software tool for assessing chemical risks
Sergey Sosnin
For many decades, new drugs and agricultural chemicals were mainly tested on animals. These tests are expensive, raise ethical concerns and often fail to accurately predict side effects in humans. As part of the RISK-HUNT3R project supported by the European Union, scientists from the University of Vienna are working on the development of the next generation of methods for animal-free risk assessment of new substances. Computer-aided methods now make it possible to fully assess the toxicological and ecological risks of new chemicals by computer without having to synthesize and test the chemical compounds. But one question remains: How trustworthy are these computer models?
It's all about reliable predictions
To investigate this problem in more detail, Sergey Sosnin, Senior Scientist in the Pharmacoinformatics Research Group at the University of Vienna, focused on binary classification. Here, a machine learning model provides a probability from 0 % to 100 % indicating whether a chemical compound is active or not (e.g. toxic or non-toxic, bioaccumulative or non-bioaccumulative, a binder or non-binder to a specific human protein). This probability reflects the confidence of the model in its prediction. Ideally, the model should only give values close to 0% (safely inactive) or 100% (safely active) for correct predictions. If the model is uncertain and gives a confidence score of e.g. 51%, these predictions should be discarded and alternative methods of risk assessment used. However, a problem arises when the model gives incorrect predictions with high probabilities.
"This is the real nightmare scenario for toxicologists," says Sergey Sosnin. "If a model predicts that a compound is not toxic with 99% certainty, but the compound is actually toxic, there is no way of knowing that something has gone wrong." The only solution is to identify in advance those areas of chemical space - i.e. possible classes of organic compounds - where the model has "blind spots" and avoid them. This requires researchers evaluating the model to check the predicted results for thousands of chemical compounds individually - a tedious and error-prone task.
Overcoming this significant obstacle
"To support these researchers," continues Sosnin, "we developed interactive graphical tools that project chemical compounds on a 2D plane, similar to geographical maps. Using colors, we highlight the compounds that were predicted incorrectly with high confidence, allowing users to identify them as clusters of red dots. The map is interactive and allows users to examine the chemical space and explore areas of concern."
The methodology was tested using a model of binding to the estrogen receptor. After visually analyzing the chemical space, it became clear that the model works well for e.g. steroids and polychlorinated biphenyls, but fails completely for small, non-cyclic compounds and should therefore not be used for these.
The software developed in this project is freely available to the scientific community on GitHub. Sergey Sosnin hopes that MolCompass will help chemists and toxicologists to better understand the limitations of computer models. This study is a step towards a future where animal testing will no longer be necessary and the only workplace for toxicologists will be a desk with a computer.
Note: This article has been translated using a computer system without human intervention. LUMITOS offers these automatic translations to present a wider range of current news. Since this article has been translated with automatic translation, it is possible that it contains errors in vocabulary, syntax or grammar. The original article in German can be found here.