当前位置：网站首页>Chapter5 : Deep Learning and Computational Chemistry

Chapter5 : Deep Learning and Computational Chemistry

2022-07-25 12:22:00 【UniversalNature】

reading notes of《Artificial Intelligence in Drug Design》

文章目录

1.Introduction
- 1.1.A Brief History of AI
2.Deep Learning Applications in Computational Chemistry
- 2.1.QSAR
- 2.2.Generative Modeling

1.Introduction

In the classic Corwin Hansch articleit was illustrated that, in general, biological activity for a group of “congeneric” chemicals can be described by a comprehensive model: $Log\ 1/C_{50}=a\pi+b\varepsilon+cS+d$
Deep learning is a particular kind of machine learning that overcomes these difficulties by representing the world as a nested hierarchy of concepts, with each concept defined in relation to simpler concepts, and more abstract representations computed in terms of less abstract ones.

请添加图片描述

1.1.A Brief History of AI

The origins of the field can be traced as far back as the 1940s with the advent of the McCulloch-Pitts neuron as an early model of brain function.
In the 1950s, the perceptron became the first model that could learn its weights from the input.
The inherent limitations of the early methods eventually become apparent—expectations were raised far beyond the reality and progress was much slower than anticipated. This lead to the first “AI winter”—a period in which general interest in the field decreased dramatically, as did the funding.
An approach termed connectionism gathered speed in the 1980s. The central idea in connectionism is that a large number of simple computational units can achieve intelligent behavior when networked together.
The second wave of neural network research lasted until the mid-1990s. Expectations were, once again, raised too high and unrealistically ambitious claims were made while seeking investment. When AI research did not fulfil these expectations, investors and the public were disappointed. At the same time, other fields of machine learning made advances. Kernel machines and graphical models both achieved good results on many important tasks. These two factors led to a decline in the popularity of neural networks that lasted until the first decade of the twenty-first century.
The third wave of neural network research began with a break- through in 2006.
That deep learning models can achieve good performance has been known for some time and such models have been successfully used in commercial applications since the 1990s.
As more and more of our activities take place on computers, greater volumes of data are recorded every day leading to the current age of “big data.”

请添加图片描述

Since the introduction of hidden layers artificial neural networks have doubled in size every 2.4 years.
Chellapilla et al. proposed three novel methods to speed up deep convolutional neural networks: unrolling convolutions, using basic linear algebra software subroutines and using graphics processing units (GPUs).
The recent rise of deep learning has also been greatly facilitated by various algorithmic developments.
Last but not least, the increased availability and usability of software and documentation for training neural networks is another reason for the rapid adoption of deep learning in recent years.

2.Deep Learning Applications in Computational Chemistry

2.1.QSAR

Perhaps the first application of deep learning to QSAR was during the Merck challenge in 2012.
Random forest models were preferred in this case due to lower computational cost of training and the increased model interpretability.
When trained on the same datasets and descriptors DNN predictions are frequently similar to those of other methods in terms of their practical utility. This observation is not surprising—while algorithmic improvements may result in slightly better statistics the overall quality ofany model is still bound to the existence of an actual relationship between the modeled property and the features used to describe the molecules.
It has further been suggested that the improvement relies on the training sets for the activities sharing similar compounds and features, and there being significant correlations between those activities.
The (2-dimensional) structure of a molecule naturally forms a graph, which makes a class of deep learning techniques known as graph convolutional neural networks (GCNNs) a logical method choice in chemistry.

2.2.Generative Modeling

Genetic algorithms are a popular choice for global optimization and have been applied in the chemistry domain.
The three main deep learning approaches revolve around variational autoencoders (VAEs), reinforcement learning (RL) and generative adversarial networks (GANs). Recently, graph convolutional networks have also been applied to this problem.
One of the seminal demonstrations of this method is the work of Go ́mezBombarelli et al. ,which used an autoencoder with a latent space that was optimized by an additional network to reflect a particular property.
Convergence for GANs is not straightforward and can suffer from several issues, including mode collapse and overwhelming of the generator by the discriminator during training.
Recently, DeepSMILES and SELFIES representations have also been developed in order to overcome some of the limitations of the SMILES syntax in the context of deep learning.
RL considers the generator as an agent that must learn how to take actions (add characters) within an environment or task (SMILES generation) to maximize some notion of reward (properties).
Generally speaking, VAEs and RNN approaches require large volumes of data to train as they model distributions. Usually ChEMBL or ZINC, both containing more than a million small molecules, are used to derive those models.
To complicate the matter further, as experimental data is expensive and time consuming to gather the fitness of the generated molecules is usually assessed via QSAR models. A recent study used as many as 11 QSAR models to score and optimize compounds. Building good quality QSAR models is not trivial and requires the availability of high-quality experimental data. To circumvent that, most published work uses easily calculable properties like AlogP, molecular weight, etc. While demonstrating that the models work in principle, those experiments have little practical application.