TY - CHAP
T1 - Kernel Methods for Quantum Chemistry
AU - Pronobis, Wiktor
AU - Müller, Klaus Robert
PY - 2020
Y1 - 2020
N2 - Kernel ridge regression (KRR) is one of the most popular methods of non-linear regression analysis in quantum chemistry. One of the main ingredients of KRR is the representation of the underlying physical system which mainly determines the performance of predicting quantum-mechanical properties based on KRR. Several such representations have been developed for both, solids and molecules; all of them with different advantages and limitations. These descriptors correspond to a similarity measure between two chemical compounds which is represented by the kernel. As recent approaches define the kernel directly from the underlying physical system, it is important to understand the properties of kernels and how these kernel properties can be used to improve the performance of machine learning models for quantum chemistry. After reviewing key representations of molecules, we provide an intuition on how the choice of the kernel affects the model. This is followed by a more practical guide of two complementary kernel methods, one for supervised and one for unsupervised learning, respectively. Finally, we present a way to gain an understanding about the model complexity by estimating the effective dimensionality induced by the data, the representation, and the kernel.
AB - Kernel ridge regression (KRR) is one of the most popular methods of non-linear regression analysis in quantum chemistry. One of the main ingredients of KRR is the representation of the underlying physical system which mainly determines the performance of predicting quantum-mechanical properties based on KRR. Several such representations have been developed for both, solids and molecules; all of them with different advantages and limitations. These descriptors correspond to a similarity measure between two chemical compounds which is represented by the kernel. As recent approaches define the kernel directly from the underlying physical system, it is important to understand the properties of kernels and how these kernel properties can be used to improve the performance of machine learning models for quantum chemistry. After reviewing key representations of molecules, we provide an intuition on how the choice of the kernel affects the model. This is followed by a more practical guide of two complementary kernel methods, one for supervised and one for unsupervised learning, respectively. Finally, we present a way to gain an understanding about the model complexity by estimating the effective dimensionality induced by the data, the representation, and the kernel.
UR - http://www.scopus.com/inward/record.url?scp=85086085490&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85086085490&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-40245-7_3
DO - 10.1007/978-3-030-40245-7_3
M3 - Chapter
AN - SCOPUS:85086085490
T3 - Lecture Notes in Physics
SP - 25
EP - 36
BT - Lecture Notes in Physics
PB - Springer
ER -