Constructive Machine Learning and Hierarchical Multi-label Classification for Molecules Design.

Rodney Renato de Souza Silva,Ricardo Cerri

BRACIS (2)(2023)

Cited 0|Views2
No score
Abstract
Constructive Machine Learning (CML) is a research field that uses algorithms to generate new instances, similar but not identical to existing ones. It has been widely used to assist the discovery of new drug-like molecules. This is very challenging, given that the search space is discrete, unstructured and enormous. In this work we use CML to learn the intrinsic rules of datasets of molecules to generate novel ones. The chosen CML methods can be divided in two sub groups, text-based and graph oriented. Considering different possibilities to evaluate the methods and the generated molecules, we propose classifying generated molecules in a taxonomy, using a hierarchical multi-label classifier previously trained in a dataset of molecules with known taxonomy information. In this way, it is possible to predict properties and verify the relevance of the generated molecules to existing taxonomies. We also propose a hierarchical diversity measure to compare groups of molecules based on their taxonomy information. The measure showed coherent results and is faster to calculate than the commonly used external diversity measures.
More
Translated text
Key words
molecules,classification,machine learning,multi-label
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined