SlideShare a Scribd company logo
Operators in Machine Learning:
Response Properties in Chemical Space
Anders S. Christensen <anders.christensen@unibas.ch>
University of Basel
October 3, 2018
0/ 16
1/ 16
Introduction
Machine learning:
Training set of structures and
properties
Train model, identify similarities
New predictions: Interpolation
between training instances
Figure by Felix Faber
Better predictions?
More training data
Better models
1/ 16
Introduction
Machine learning:
Training set of structures and
properties
Train model, identify similarities
New predictions: Interpolation
between training instances
Figure by Felix Faber
Better predictions?
More training data
Better models
2/ 16
Learning curves
Learning curves follow a power law:
Error ≈ a
Nb
log (Error) ≈ log (a) − b log (N)
Vapnik, The Nature of Statistical Learning Theory, Springer (1995)
3/ 16
Kernel ridge regression and representations
Prediction of a property, y for a molecule ˜x:
y (˜x) =
∑
i
k (˜x, xi) αi
Kernel: The similarity between two representations.
Non-linear transformation that makes the problem linear
E.g. Gaussian kernel:
k (xi, xj) = exp
(
−
∥xi − xj∥2
2
2σ2
)
Simply a number between 1 and 0.
Analytical solution:
α = K−1
y
Better representations ⇒ better kernel-based ML models!
3/ 16
Kernel ridge regression and representations
Prediction of a property, y for a molecule ˜x:
y (˜x) =
∑
i
k (˜x, xi) αi
Kernel: The similarity between two representations.
Non-linear transformation that makes the problem linear
E.g. Gaussian kernel:
k (xi, xj) = exp
(
−
∥xi − xj∥2
2
2σ2
)
Simply a number between 1 and 0.
Analytical solution:
α = K−1
y
Better representations ⇒ better kernel-based ML models!
3/ 16
Kernel ridge regression and representations
Prediction of a property, y for a molecule ˜x:
y (˜x) =
∑
i
k (˜x, xi) αi
Kernel: The similarity between two representations.
Non-linear transformation that makes the problem linear
E.g. Gaussian kernel:
k (xi, xj) = exp
(
−
∥xi − xj∥2
2
2σ2
)
Simply a number between 1 and 0.
Analytical solution:
α = K−1
y
Better representations ⇒ better kernel-based ML models!
3/ 16
Kernel ridge regression and representations
Prediction of a property, y for a molecule ˜x:
y (˜x) =
∑
i
k (˜x, xi) αi
Kernel: The similarity between two representations.
Non-linear transformation that makes the problem linear
E.g. Gaussian kernel:
k (xi, xj) = exp
(
−
∥xi − xj∥2
2
2σ2
)
Simply a number between 1 and 0.
Analytical solution:
α = K−1
y
Better representations ⇒ better kernel-based ML models!
4/ 16
Kernel principal component analysis
Kernel matrix for 1000 small organic molecules, elements: HCNO
4/ 16
Kernel principal component analysis
Kernel matrix for 1000 small organic molecules, elements: HCNO
5/ 16
Our latest representation (FCHL)
Atom I represented by a many-body expansion distribution of its environment,
AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)}
L2 distance: ∥AM (I) − AM (J)∥2
2 ≡
∑M
m=0
∫
(Am(I) − Am(J))2
d⃗Zd⃗r
6/ 16
Kernel principal component analysis
Kernel matrix for 1000 small organic molecules, elements: HCNO
7/ 16
The “QM9” benchmark set
134K small organic
molecules
DFT atomization
energy
8/ 16
Learning properties
Figure: FA Faber, et al. (2017) JCTC, collaboration with Google Brain
8/ 16
Learning properties
Figure: FA Faber, et al. (2017) JCTC, collaboration with Google Brain
9/ 16
Energy and response properties
E.g. a force component is the first-order energy response wrt. x:
Fx = −
∂
∂x
U
Electric dipole moment:
⃗µ = −
∂
∂ ⃗E
U
Energy in kernel-based regression models:
U = K α
Any response property:
ω ≡ O [U] = O [K] α
Analytical solution; minimize the following Lagrangian:
J(α) = ∥ωtraining
− O[K]α∥2
2
Normal-equation solution:
α =
[ ∑
γ
Oγ[K]T
Oγ[K]
]−1[ ∑
γ
(
ωtraining
γ
)T
Oγ[K]
]
9/ 16
Energy and response properties
E.g. a force component is the first-order energy response wrt. x:
Fx = −
∂
∂x
U
Electric dipole moment:
⃗µ = −
∂
∂ ⃗E
U
Energy in kernel-based regression models:
U = K α
Any response property:
ω ≡ O [U] = O [K] α
Analytical solution; minimize the following Lagrangian:
J(α) = ∥ωtraining
− O[K]α∥2
2
Normal-equation solution:
α =
[ ∑
γ
Oγ[K]T
Oγ[K]
]−1[ ∑
γ
(
ωtraining
γ
)T
Oγ[K]
]
9/ 16
Energy and response properties
E.g. a force component is the first-order energy response wrt. x:
Fx = −
∂
∂x
U
Electric dipole moment:
⃗µ = −
∂
∂ ⃗E
U
Energy in kernel-based regression models:
U = K α
Any response property:
ω ≡ O [U] = O [K] α
Analytical solution; minimize the following Lagrangian:
J(α) = ∥ωtraining
− O[K]α∥2
2
Normal-equation solution:
α =
[ ∑
γ
Oγ[K]T
Oγ[K]
]−1[ ∑
γ
(
ωtraining
γ
)T
Oγ[K]
]
9/ 16
Energy and response properties
E.g. a force component is the first-order energy response wrt. x:
Fx = −
∂
∂x
U
Electric dipole moment:
⃗µ = −
∂
∂ ⃗E
U
Energy in kernel-based regression models:
U = K α
Any response property:
ω ≡ O [U] = O [K] α
Analytical solution; minimize the following Lagrangian:
J(α) = ∥ωtraining
− O[K]α∥2
2
Normal-equation solution:
α =
[ ∑
γ
Oγ[K]T
Oγ[K]
]−1[ ∑
γ
(
ωtraining
γ
)T
Oγ[K]
]
9/ 16
Energy and response properties
E.g. a force component is the first-order energy response wrt. x:
Fx = −
∂
∂x
U
Electric dipole moment:
⃗µ = −
∂
∂ ⃗E
U
Energy in kernel-based regression models:
U = K α
Any response property:
ω ≡ O [U] = O [K] α
Analytical solution; minimize the following Lagrangian:
J(α) = ∥ωtraining
− O[K]α∥2
2
Normal-equation solution:
α =
[ ∑
γ
Oγ[K]T
Oγ[K]
]−1[ ∑
γ
(
ωtraining
γ
)T
Oγ[K]
]
9/ 16
Energy and response properties
E.g. a force component is the first-order energy response wrt. x:
Fx = −
∂
∂x
U
Electric dipole moment:
⃗µ = −
∂
∂ ⃗E
U
Energy in kernel-based regression models:
U = K α
Any response property:
ω ≡ O [U] = O [K] α
Analytical solution; minimize the following Lagrangian:
J(α) = ∥ωtraining
− O[K]α∥2
2
Normal-equation solution:
α =
[ ∑
γ
Oγ[K]T
Oγ[K]
]−1[ ∑
γ
(
ωtraining
γ
)T
Oγ[K]
]
10/ 16
Conventional kernel ridge regression:
E = K αE
ω = K αω
Same kernel matrix
Different regression coefficients
New model for every property
Response operator kernel-based regression:
E = K α
ω = O [K] α
Same kernel matrix (but different operator)
Same regression coefficients
Same model for multiple response properties
AS Christensen, FA Faber, OA von Lilienfeld (2018) “Operators in Machine Learning: Response Properties in Chemical
Space” arXiv:1807.08811
10/ 16
Conventional kernel ridge regression:
E = K αE
ω = K αω
Same kernel matrix
Different regression coefficients
New model for every property
Response operator kernel-based regression:
E = K α
ω = O [K] α
Same kernel matrix (but different operator)
Same regression coefficients
Same model for multiple response properties
AS Christensen, FA Faber, OA von Lilienfeld (2018) “Operators in Machine Learning: Response Properties in Chemical
Space” arXiv:1807.08811
11/ 16
Electric field first-order response (dipole moment)
Representation:
AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)}
Place fictitious partial charges
on atom centers.
Weight terms by resulting
dipole.
Weight for two-body terms:
ξ∗IJ
2 = ξIJ
2 − ϵ(⃗µIJ · ⃗E)
Weight for three-body terms:
ξ∗IJK
3 = ξIJK
3 − ϵ(⃗µIJK · ⃗E)
11/ 16
Electric field first-order response (dipole moment)
Representation:
AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)}
Place fictitious partial charges
on atom centers.
Weight terms by resulting
dipole.
Weight for two-body terms:
ξ∗IJ
2 = ξIJ
2 − ϵ(⃗µIJ · ⃗E)
Weight for three-body terms:
ξ∗IJK
3 = ξIJK
3 − ϵ(⃗µIJK · ⃗E)
11/ 16
Electric field first-order response (dipole moment)
Representation:
AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)}
Place fictitious partial charges
on atom centers.
Weight terms by resulting
dipole.
Weight for two-body terms:
ξ∗IJ
2 = ξIJ
2 − ϵ(⃗µIJ · ⃗E)
Weight for three-body terms:
ξ∗IJK
3 = ξIJK
3 − ϵ(⃗µIJK · ⃗E)
180 120 60 0 60 120 180
Angle [degrees]
0.5
0.0
0.5
1.0
1.5
2.0
E[mHa]
ML Test (w/ dipole)
ML Test (w/o dipole)
MP2
ML Training
H-F molecule in electric field of 0.001 a.u.
AS Christensen et al. (2018) arXiv:1807.08811
12/ 16
Learning dipole moments
The QM9 dataset (again)
Red: Learn the dipole norm
Conventional KRR
Blue: Learn the energy
derivative
Using response operators
100 1000 2500 500010000
N
1.5
1.0
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
MAE||[D]
20x less data
Norm
Response
Learning curve for QM9
AS Christensen et al. (2018) arXiv:1807.08811
13/ 16
Learning forces
0.05
0.1
0.2
0.4
0.6
MAEE
[kcal/mol]
GDML
FCHL*
SchNet
200 500 1000
0.2
0.3
0.4
0.6
0.8
1.0
2.0
MAEFx
[kcal/mol/Å]
500 1000 500 1000 500 1000 500 1000 500 1000 500 1000 500 1000
N
First derivative wrt. nuclear coordinates
“MD17” benchmark set
QM-MD trajectory, DFT forces and energies
S Chmiela et al. (2017) Sci. Advances, e1603015
AS Christensen et al. (2018) arXiv:1807.08811
14/ 16
Vibrational normal modes
Normal modes for C6N3H7
Training set: non-equilibirum
geometries of fragments
N = number of samples in the
training set
“Rank” = total number of
energies and force
components in training set
0.0
0.5
Mode:1
N = 1; Rank: 214
0.0
0.5
Mode:5
0.1 0.0 0.1
0.0
0.5
Mode:15
N = 4; Rank: 856
0.0 0.1
N = 32; Rank: 6848
0.0 0.1
E[kcal/mol]
RMSD displacement along normal mode [Å]
15/ 16
Mixed derivatives and IR spectra
IR intensity:
Mixed derivative wrt. nuclear
coordinates and external electric field.
Vibrational analysis using interface to
Gaussian09
Model trained on distorted geometries of
dichloromethane. Training includes:
Energy
Dipole moments
Forces
Many other operators to try out!
0 1000 2000 3000 4000 5000
[cm
1
]
MP2
N = 100
FCHL* MAE: 2.4 cm 1
N = 50
FCHL* MAE: 5.7 cm 1
N = 25
FCHL* MAE: 25.6 cm 1
N = 10
FCHL* MAE: 126.1 cm 1
N = 5
FCHL* MAE: 340.4 cm 1
AS Christensen et al. (2018) arXiv:1807.08811
15/ 16
Mixed derivatives and IR spectra
IR intensity:
Mixed derivative wrt. nuclear
coordinates and external electric field.
Vibrational analysis using interface to
Gaussian09
Model trained on distorted geometries of
dichloromethane. Training includes:
Energy
Dipole moments
Forces
Many other operators to try out!
0 1000 2000 3000 4000 5000
[cm
1
]
MP2
N = 100
FCHL* MAE: 2.4 cm 1
N = 50
FCHL* MAE: 5.7 cm 1
N = 25
FCHL* MAE: 25.6 cm 1
N = 10
FCHL* MAE: 126.1 cm 1
N = 5
FCHL* MAE: 340.4 cm 1
AS Christensen et al. (2018) arXiv:1807.08811
16/ 16
QML Code
Python2/3 compatible toolkit
Open Source (MIT License)
Manual with examples
pip install qml
Website: qmlcode.org
GitHub: github.com/qmlcode/qml
Follow me on Twitter:
@AndersSChristen

More Related Content

PDF
NANO266 - Lecture 4 - Introduction to DFT
PPTX
Blaha krakow 2004
PDF
Wien2k getting started
PDF
Real power loss reduction by arctic char algorithm
PDF
Introduction to DFT Part 2
PPT
Electrycity 2 p c r o 1
PPTX
Methods available in WIEN2k for the treatment of exchange and correlation ef...
PDF
Dft calculation by vasp
NANO266 - Lecture 4 - Introduction to DFT
Blaha krakow 2004
Wien2k getting started
Real power loss reduction by arctic char algorithm
Introduction to DFT Part 2
Electrycity 2 p c r o 1
Methods available in WIEN2k for the treatment of exchange and correlation ef...
Dft calculation by vasp

What's hot (20)

PPTX
Localized Electrons with Wien2k
PDF
The all-electron GW method based on WIEN2k: Implementation and applications.
PPT
Electricity for physic
PDF
NANO266 - Lecture 10 - Temperature
PPTX
Exact Exchange in Density Functional Theory
PDF
NANO266 - Lecture 6 - Molecule Properties from Quantum Mechanical Modeling
PDF
Teoría Cuántica de la Radiacion
PDF
D. Vulcanov: Symbolic Computation Methods in Cosmology and General Relativity...
PDF
Phonons & Phonopy: Pro Tips (2014)
PDF
Tersoff Potential:Inter-atomic Potential for Semi-conductors
PDF
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...
PDF
SPECTRAL SYNTHESIS PROBLEM FOR FOURIER ALGEBRAS
PDF
Materials Modelling: From theory to solar cells (Lecture 1)
PDF
EH1 - Reduced-order modelling for vibration energy harvesting
PDF
Quantum enveloping algebras with von Neumann regular Cartan-like generators a...
PDF
NANO266 - Lecture 14 - Transition state modeling
PDF
VPrasad_DAEBRNSHEPDec2014talk
PDF
Antwoordenboek statica
PDF
Application of Gauss' Law
DOC
1 potential &amp; capacity 09
Localized Electrons with Wien2k
The all-electron GW method based on WIEN2k: Implementation and applications.
Electricity for physic
NANO266 - Lecture 10 - Temperature
Exact Exchange in Density Functional Theory
NANO266 - Lecture 6 - Molecule Properties from Quantum Mechanical Modeling
Teoría Cuántica de la Radiacion
D. Vulcanov: Symbolic Computation Methods in Cosmology and General Relativity...
Phonons & Phonopy: Pro Tips (2014)
Tersoff Potential:Inter-atomic Potential for Semi-conductors
Density functional theory (DFT) and the concepts of the augmented-plane-wave ...
SPECTRAL SYNTHESIS PROBLEM FOR FOURIER ALGEBRAS
Materials Modelling: From theory to solar cells (Lecture 1)
EH1 - Reduced-order modelling for vibration energy harvesting
Quantum enveloping algebras with von Neumann regular Cartan-like generators a...
NANO266 - Lecture 14 - Transition state modeling
VPrasad_DAEBRNSHEPDec2014talk
Antwoordenboek statica
Application of Gauss' Law
1 potential &amp; capacity 09
Ad

Similar to Operators in Machine Learning: Response Properties in Chemical Space (20)

PDF
PDF
Paper_Sound-LineConstraints_CompositePanel
PDF
Problem and solution i ph o 28
PDF
Lecture: Interatomic Potentials Enabled by Machine Learning
PPT
Lecture 06 maxwell eqn
PPTX
Electrostatic Force, Electric Field.pptx
PDF
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
PDF
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
PDF
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
PDF
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
PDF
ANALYTICAL, NUMERICAL AND EXPERIMENTAL VALIDATION OF COIL VOLTAGE IN INDUCTIO...
PDF
ANALYTICAL, NUMERICAL AND EXPERIMENTAL VALIDATION OF COIL VOLTAGE IN INDUCTIO...
PPTX
MAR_Comprehensive exam on density functional theorypptx
PPTX
Shuaib Y-basedComprehensive mahmudj.pptx
PDF
Starobinsky astana 2017
PDF
NANO266 - Lecture 7 - QM Modeling of Periodic Structures
PDF
A chaotic particle swarm optimization (cpso) algorithm for solving optimal re...
PDF
A chaotic particle swarm optimization (cpso) algorithm for solving optimal re...
PDF
Alexei Starobinsky - Inflation: the present status
PDF
final_report
Paper_Sound-LineConstraints_CompositePanel
Problem and solution i ph o 28
Lecture: Interatomic Potentials Enabled by Machine Learning
Lecture 06 maxwell eqn
Electrostatic Force, Electric Field.pptx
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
Analytical, Numerical and Experimental Validation of Coil Voltage in Inductio...
ANALYTICAL, NUMERICAL AND EXPERIMENTAL VALIDATION OF COIL VOLTAGE IN INDUCTIO...
ANALYTICAL, NUMERICAL AND EXPERIMENTAL VALIDATION OF COIL VOLTAGE IN INDUCTIO...
MAR_Comprehensive exam on density functional theorypptx
Shuaib Y-basedComprehensive mahmudj.pptx
Starobinsky astana 2017
NANO266 - Lecture 7 - QM Modeling of Periodic Structures
A chaotic particle swarm optimization (cpso) algorithm for solving optimal re...
A chaotic particle swarm optimization (cpso) algorithm for solving optimal re...
Alexei Starobinsky - Inflation: the present status
final_report
Ad

Recently uploaded (20)

PDF
Placing the Near-Earth Object Impact Probability in Context
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
famous lake in india and its disturibution and importance
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
Microbiology with diagram medical studies .pptx
PPTX
Cell Membrane: Structure, Composition & Functions
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
An interstellar mission to test astrophysical black holes
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
. Radiology Case Scenariosssssssssssssss
Placing the Near-Earth Object Impact Probability in Context
Derivatives of integument scales, beaks, horns,.pptx
Classification Systems_TAXONOMY_SCIENCE8.pptx
2. Earth - The Living Planet Module 2ELS
famous lake in india and its disturibution and importance
HPLC-PPT.docx high performance liquid chromatography
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
TOTAL hIP ARTHROPLASTY Presentation.pptx
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Microbiology with diagram medical studies .pptx
Cell Membrane: Structure, Composition & Functions
POSITIONING IN OPERATION THEATRE ROOM.ppt
bbec55_b34400a7914c42429908233dbd381773.pdf
The KM-GBF monitoring framework – status & key messages.pptx
An interstellar mission to test astrophysical black holes
2Systematics of Living Organisms t-.pptx
neck nodes and dissection types and lymph nodes levels
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
. Radiology Case Scenariosssssssssssssss

Operators in Machine Learning: Response Properties in Chemical Space

  • 1. Operators in Machine Learning: Response Properties in Chemical Space Anders S. Christensen <anders.christensen@unibas.ch> University of Basel October 3, 2018 0/ 16
  • 2. 1/ 16 Introduction Machine learning: Training set of structures and properties Train model, identify similarities New predictions: Interpolation between training instances Figure by Felix Faber Better predictions? More training data Better models
  • 3. 1/ 16 Introduction Machine learning: Training set of structures and properties Train model, identify similarities New predictions: Interpolation between training instances Figure by Felix Faber Better predictions? More training data Better models
  • 4. 2/ 16 Learning curves Learning curves follow a power law: Error ≈ a Nb log (Error) ≈ log (a) − b log (N) Vapnik, The Nature of Statistical Learning Theory, Springer (1995)
  • 5. 3/ 16 Kernel ridge regression and representations Prediction of a property, y for a molecule ˜x: y (˜x) = ∑ i k (˜x, xi) αi Kernel: The similarity between two representations. Non-linear transformation that makes the problem linear E.g. Gaussian kernel: k (xi, xj) = exp ( − ∥xi − xj∥2 2 2σ2 ) Simply a number between 1 and 0. Analytical solution: α = K−1 y Better representations ⇒ better kernel-based ML models!
  • 6. 3/ 16 Kernel ridge regression and representations Prediction of a property, y for a molecule ˜x: y (˜x) = ∑ i k (˜x, xi) αi Kernel: The similarity between two representations. Non-linear transformation that makes the problem linear E.g. Gaussian kernel: k (xi, xj) = exp ( − ∥xi − xj∥2 2 2σ2 ) Simply a number between 1 and 0. Analytical solution: α = K−1 y Better representations ⇒ better kernel-based ML models!
  • 7. 3/ 16 Kernel ridge regression and representations Prediction of a property, y for a molecule ˜x: y (˜x) = ∑ i k (˜x, xi) αi Kernel: The similarity between two representations. Non-linear transformation that makes the problem linear E.g. Gaussian kernel: k (xi, xj) = exp ( − ∥xi − xj∥2 2 2σ2 ) Simply a number between 1 and 0. Analytical solution: α = K−1 y Better representations ⇒ better kernel-based ML models!
  • 8. 3/ 16 Kernel ridge regression and representations Prediction of a property, y for a molecule ˜x: y (˜x) = ∑ i k (˜x, xi) αi Kernel: The similarity between two representations. Non-linear transformation that makes the problem linear E.g. Gaussian kernel: k (xi, xj) = exp ( − ∥xi − xj∥2 2 2σ2 ) Simply a number between 1 and 0. Analytical solution: α = K−1 y Better representations ⇒ better kernel-based ML models!
  • 9. 4/ 16 Kernel principal component analysis Kernel matrix for 1000 small organic molecules, elements: HCNO
  • 10. 4/ 16 Kernel principal component analysis Kernel matrix for 1000 small organic molecules, elements: HCNO
  • 11. 5/ 16 Our latest representation (FCHL) Atom I represented by a many-body expansion distribution of its environment, AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)} L2 distance: ∥AM (I) − AM (J)∥2 2 ≡ ∑M m=0 ∫ (Am(I) − Am(J))2 d⃗Zd⃗r
  • 12. 6/ 16 Kernel principal component analysis Kernel matrix for 1000 small organic molecules, elements: HCNO
  • 13. 7/ 16 The “QM9” benchmark set 134K small organic molecules DFT atomization energy
  • 14. 8/ 16 Learning properties Figure: FA Faber, et al. (2017) JCTC, collaboration with Google Brain
  • 15. 8/ 16 Learning properties Figure: FA Faber, et al. (2017) JCTC, collaboration with Google Brain
  • 16. 9/ 16 Energy and response properties E.g. a force component is the first-order energy response wrt. x: Fx = − ∂ ∂x U Electric dipole moment: ⃗µ = − ∂ ∂ ⃗E U Energy in kernel-based regression models: U = K α Any response property: ω ≡ O [U] = O [K] α Analytical solution; minimize the following Lagrangian: J(α) = ∥ωtraining − O[K]α∥2 2 Normal-equation solution: α = [ ∑ γ Oγ[K]T Oγ[K] ]−1[ ∑ γ ( ωtraining γ )T Oγ[K] ]
  • 17. 9/ 16 Energy and response properties E.g. a force component is the first-order energy response wrt. x: Fx = − ∂ ∂x U Electric dipole moment: ⃗µ = − ∂ ∂ ⃗E U Energy in kernel-based regression models: U = K α Any response property: ω ≡ O [U] = O [K] α Analytical solution; minimize the following Lagrangian: J(α) = ∥ωtraining − O[K]α∥2 2 Normal-equation solution: α = [ ∑ γ Oγ[K]T Oγ[K] ]−1[ ∑ γ ( ωtraining γ )T Oγ[K] ]
  • 18. 9/ 16 Energy and response properties E.g. a force component is the first-order energy response wrt. x: Fx = − ∂ ∂x U Electric dipole moment: ⃗µ = − ∂ ∂ ⃗E U Energy in kernel-based regression models: U = K α Any response property: ω ≡ O [U] = O [K] α Analytical solution; minimize the following Lagrangian: J(α) = ∥ωtraining − O[K]α∥2 2 Normal-equation solution: α = [ ∑ γ Oγ[K]T Oγ[K] ]−1[ ∑ γ ( ωtraining γ )T Oγ[K] ]
  • 19. 9/ 16 Energy and response properties E.g. a force component is the first-order energy response wrt. x: Fx = − ∂ ∂x U Electric dipole moment: ⃗µ = − ∂ ∂ ⃗E U Energy in kernel-based regression models: U = K α Any response property: ω ≡ O [U] = O [K] α Analytical solution; minimize the following Lagrangian: J(α) = ∥ωtraining − O[K]α∥2 2 Normal-equation solution: α = [ ∑ γ Oγ[K]T Oγ[K] ]−1[ ∑ γ ( ωtraining γ )T Oγ[K] ]
  • 20. 9/ 16 Energy and response properties E.g. a force component is the first-order energy response wrt. x: Fx = − ∂ ∂x U Electric dipole moment: ⃗µ = − ∂ ∂ ⃗E U Energy in kernel-based regression models: U = K α Any response property: ω ≡ O [U] = O [K] α Analytical solution; minimize the following Lagrangian: J(α) = ∥ωtraining − O[K]α∥2 2 Normal-equation solution: α = [ ∑ γ Oγ[K]T Oγ[K] ]−1[ ∑ γ ( ωtraining γ )T Oγ[K] ]
  • 21. 9/ 16 Energy and response properties E.g. a force component is the first-order energy response wrt. x: Fx = − ∂ ∂x U Electric dipole moment: ⃗µ = − ∂ ∂ ⃗E U Energy in kernel-based regression models: U = K α Any response property: ω ≡ O [U] = O [K] α Analytical solution; minimize the following Lagrangian: J(α) = ∥ωtraining − O[K]α∥2 2 Normal-equation solution: α = [ ∑ γ Oγ[K]T Oγ[K] ]−1[ ∑ γ ( ωtraining γ )T Oγ[K] ]
  • 22. 10/ 16 Conventional kernel ridge regression: E = K αE ω = K αω Same kernel matrix Different regression coefficients New model for every property Response operator kernel-based regression: E = K α ω = O [K] α Same kernel matrix (but different operator) Same regression coefficients Same model for multiple response properties AS Christensen, FA Faber, OA von Lilienfeld (2018) “Operators in Machine Learning: Response Properties in Chemical Space” arXiv:1807.08811
  • 23. 10/ 16 Conventional kernel ridge regression: E = K αE ω = K αω Same kernel matrix Different regression coefficients New model for every property Response operator kernel-based regression: E = K α ω = O [K] α Same kernel matrix (but different operator) Same regression coefficients Same model for multiple response properties AS Christensen, FA Faber, OA von Lilienfeld (2018) “Operators in Machine Learning: Response Properties in Chemical Space” arXiv:1807.08811
  • 24. 11/ 16 Electric field first-order response (dipole moment) Representation: AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)} Place fictitious partial charges on atom centers. Weight terms by resulting dipole. Weight for two-body terms: ξ∗IJ 2 = ξIJ 2 − ϵ(⃗µIJ · ⃗E) Weight for three-body terms: ξ∗IJK 3 = ξIJK 3 − ϵ(⃗µIJK · ⃗E)
  • 25. 11/ 16 Electric field first-order response (dipole moment) Representation: AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)} Place fictitious partial charges on atom centers. Weight terms by resulting dipole. Weight for two-body terms: ξ∗IJ 2 = ξIJ 2 − ϵ(⃗µIJ · ⃗E) Weight for three-body terms: ξ∗IJK 3 = ξIJK 3 − ϵ(⃗µIJK · ⃗E)
  • 26. 11/ 16 Electric field first-order response (dipole moment) Representation: AM (I) = {A1(I), A2(I), A3(I), . . . , AM (I)} Place fictitious partial charges on atom centers. Weight terms by resulting dipole. Weight for two-body terms: ξ∗IJ 2 = ξIJ 2 − ϵ(⃗µIJ · ⃗E) Weight for three-body terms: ξ∗IJK 3 = ξIJK 3 − ϵ(⃗µIJK · ⃗E) 180 120 60 0 60 120 180 Angle [degrees] 0.5 0.0 0.5 1.0 1.5 2.0 E[mHa] ML Test (w/ dipole) ML Test (w/o dipole) MP2 ML Training H-F molecule in electric field of 0.001 a.u. AS Christensen et al. (2018) arXiv:1807.08811
  • 27. 12/ 16 Learning dipole moments The QM9 dataset (again) Red: Learn the dipole norm Conventional KRR Blue: Learn the energy derivative Using response operators 100 1000 2500 500010000 N 1.5 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 MAE||[D] 20x less data Norm Response Learning curve for QM9 AS Christensen et al. (2018) arXiv:1807.08811
  • 28. 13/ 16 Learning forces 0.05 0.1 0.2 0.4 0.6 MAEE [kcal/mol] GDML FCHL* SchNet 200 500 1000 0.2 0.3 0.4 0.6 0.8 1.0 2.0 MAEFx [kcal/mol/Å] 500 1000 500 1000 500 1000 500 1000 500 1000 500 1000 500 1000 N First derivative wrt. nuclear coordinates “MD17” benchmark set QM-MD trajectory, DFT forces and energies S Chmiela et al. (2017) Sci. Advances, e1603015 AS Christensen et al. (2018) arXiv:1807.08811
  • 29. 14/ 16 Vibrational normal modes Normal modes for C6N3H7 Training set: non-equilibirum geometries of fragments N = number of samples in the training set “Rank” = total number of energies and force components in training set 0.0 0.5 Mode:1 N = 1; Rank: 214 0.0 0.5 Mode:5 0.1 0.0 0.1 0.0 0.5 Mode:15 N = 4; Rank: 856 0.0 0.1 N = 32; Rank: 6848 0.0 0.1 E[kcal/mol] RMSD displacement along normal mode [Å]
  • 30. 15/ 16 Mixed derivatives and IR spectra IR intensity: Mixed derivative wrt. nuclear coordinates and external electric field. Vibrational analysis using interface to Gaussian09 Model trained on distorted geometries of dichloromethane. Training includes: Energy Dipole moments Forces Many other operators to try out! 0 1000 2000 3000 4000 5000 [cm 1 ] MP2 N = 100 FCHL* MAE: 2.4 cm 1 N = 50 FCHL* MAE: 5.7 cm 1 N = 25 FCHL* MAE: 25.6 cm 1 N = 10 FCHL* MAE: 126.1 cm 1 N = 5 FCHL* MAE: 340.4 cm 1 AS Christensen et al. (2018) arXiv:1807.08811
  • 31. 15/ 16 Mixed derivatives and IR spectra IR intensity: Mixed derivative wrt. nuclear coordinates and external electric field. Vibrational analysis using interface to Gaussian09 Model trained on distorted geometries of dichloromethane. Training includes: Energy Dipole moments Forces Many other operators to try out! 0 1000 2000 3000 4000 5000 [cm 1 ] MP2 N = 100 FCHL* MAE: 2.4 cm 1 N = 50 FCHL* MAE: 5.7 cm 1 N = 25 FCHL* MAE: 25.6 cm 1 N = 10 FCHL* MAE: 126.1 cm 1 N = 5 FCHL* MAE: 340.4 cm 1 AS Christensen et al. (2018) arXiv:1807.08811
  • 32. 16/ 16 QML Code Python2/3 compatible toolkit Open Source (MIT License) Manual with examples pip install qml Website: qmlcode.org GitHub: github.com/qmlcode/qml Follow me on Twitter: @AndersSChristen