SlideShare a Scribd company logo
AI in Science Research
How can modern AI help to push the boundary of science
Ding Li 2022.1
2
MATHEMATICS
3
AI Aids Intuition in Mathematical Discovery
The cycle of developing mathematical theories by
studying examples.
• After recognizing a possible pattern in the properties of
mathematical objects, such as convex polyhedra (3D
shapes with flat faces, straight edges and vertices that all
point outwards), mathematicians typically go through a
cycle to understand this pattern.
• They first compute the properties of some simple
examples and analyze the possible relationships
between these properties.
• The researchers then refine these relationships. For
example, they might come up with Euler’s polyhedron
formula, which posits that the number of vertices (V)
minus the number of edges (E) plus the number of faces
(F) of a convex polyhedron is always equal to two:
V − E + F = 2.
• They then test this suggested relationship on more
complicated examples, discard irrelevant properties and
attempt to understand why the relationship holds. If it
remains unclear, mathematicians then consider different
examples, and the cycle continues.
• Davies et al.1 show that machine-learning techniques
can help researchers with the refinement step, which
usually relies strongly on human intuition
Stump 2021
4
Advancing mathematics by guiding human intuition with AI Davies 2021
As an illustrative example: let z be convex polyhedra,
X(z) ∈ Z2 × R2 be the number of vertices and edges of z, as well as the
volume and surface area, and Y(z) ∈ ℤ be the number of faces of z.
Euler’s formula states that there is an exact relationship between X(z)
and Y(z) in this case: X(z) · (−1, 1, 0, 0) + 2 = Y(z).
The framework helps guide the intuition of mathematicians in two
ways: by verifying the hypothesized existence of structure/patterns in
mathematical objects through the use of supervised machine learning;
and by helping in the understanding of these patterns through the use
of attribution techniques.
5
Quantum
Chemistry
6
Pushing the Frontiers of Density Functionals
by Solving the Fractional Electron Problem
Kirkpatrick 2021
• Computing electronic energies underpins theoretical chemistry and materials science, and
density functional theory (DFT) promises an exact and efficient approach
• But the approach has limitations and is known to give the wrong results for certain types of
molecule.
• “It’s sort of the ideal problem for machine learning: you know the answer, but not the
formula you want to apply.”
• The functional was evaluated by integrating local energies computed by a multilayer
perceptron (MLP), which took as input both local and nonlocal features of the occupied
Kohn-Sham (KS) orbitals and can be described as a local range-separated hybrid.
• To train the functional, the sum of two objective functions was used: a regression a
gradient regularization term that ensured that the functional derivatives can be used in
self-consistent field (SCF) calculations after training
Castelvecchi 2021
7
BIOLOGY
8
Primary Structure
Amino acids (20)
Peptide bond
Secondary Structure Tertiary Structure
Quaternary Structure
9
(MSA) Multiple Sequence Alignments Nseq x Nres
• Evolutionary constrains
• MSA clustering
• Cluster deletion
• Evolutionary correlations
Pairwise Feature Nres x Nres
• Physical and geometric constrains
• Target feat (amino acids), residue index
• Structural templates
• Template distogram
Near experimental accuracy in
most cases for CASP14 assessment
(May-July 2020)
Jumper 2021 GitHub
AlphaFold Protein Structure Database (JAK2)
Blog
Colab
UniProt (JAK2)
10
A BERT-style transformer was applied to predict randomly masked
individual residues within the MSA, which encourages the network to
learn to interpret phylogenetic and covariation relationships without
hardcoding a particular correlation statistic into the features.
Exchange information iteratively
to enable direct reasoning about
the spatial and evolutionary
relationships in the proteins.
Combination of the bioinformatics and physical approaches
We hope that AlphaFold—and computational approaches that apply its techniques
for other biophysical problems—will become essential tools of modern biology.
11
“Do not quench your inspiration
and your imagination; do not
become the slave of your
model.”
– Vincent van Gogh

More Related Content

PPTX
Recommendation system
PPT
Textmining Retrieval And Clustering
PDF
Handling Missing Attributes using Matrix Factorization 
PDF
SVD and the Netflix Dataset
PDF
Pca analysis
PDF
Unsupervised learning clustering
PPT
3.1 clustering
PPTX
Cluster analysis
Recommendation system
Textmining Retrieval And Clustering
Handling Missing Attributes using Matrix Factorization 
SVD and the Netflix Dataset
Pca analysis
Unsupervised learning clustering
3.1 clustering
Cluster analysis

What's hot (19)

PPTX
Principal Component Analysis (PCA) and LDA PPT Slides
PDF
EFFECTIVENESS PREDICTION OF MEMORY BASED CLASSIFIERS FOR THE CLASSIFICATION O...
PDF
A Novel Algorithm for Design Tree Classification with PCA
PPTX
Cluster analysis
PDF
Matrix Factorization Technique for Recommender Systems
PPTX
Machine learning clustering
PPTX
PPTX
Morse-Smale Regression for Risk Modeling
PPTX
Types of clustering and different types of clustering algorithms
PDF
DOCX
Clustering techniques final
PPTX
Clustering in Data Mining
PPTX
Unsupervised learning clustering
PDF
IRJET- Performance Evaluation of Various Classification Algorithms
PDF
Similarity Features, and their Role in Concept Alignment Learning
PDF
Literature Survey: Clustering Technique
PPT
Clustering
PPTX
Presentation on unsupervised learning
PPT
Capter10 cluster basic
Principal Component Analysis (PCA) and LDA PPT Slides
EFFECTIVENESS PREDICTION OF MEMORY BASED CLASSIFIERS FOR THE CLASSIFICATION O...
A Novel Algorithm for Design Tree Classification with PCA
Cluster analysis
Matrix Factorization Technique for Recommender Systems
Machine learning clustering
Morse-Smale Regression for Risk Modeling
Types of clustering and different types of clustering algorithms
Clustering techniques final
Clustering in Data Mining
Unsupervised learning clustering
IRJET- Performance Evaluation of Various Classification Algorithms
Similarity Features, and their Role in Concept Alignment Learning
Literature Survey: Clustering Technique
Clustering
Presentation on unsupervised learning
Capter10 cluster basic
Ad

Similar to AI to advance science research (20)

PDF
So sánh cấu trúc protein_Protein structure comparison
PDF
A MATLAB Computational Investigation of the Jordan Canonical Form of a Class ...
PDF
Q26099103
PPTX
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
PPTX
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
PDF
graph_embeddings
DOCX
Ib mathematics hl
PPTX
theory of computation lecture 01
PDF
08 Exponential Random Graph Models (ERGM)
PDF
08 Exponential Random Graph Models (2016)
PDF
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
PDF
Modeling the dynamics of molecular concentration during the diffusion procedure
PPTX
250505_Thuy_Labseminar[Discovering Invariant Rationales for Graph Neural Netw...
PDF
Gabor Frames for Quasicrystals and K-theory
PDF
Aussem
ODP
A Logical Language with a Prototypical Semantics
PDF
Em molnar2015
DOCX
Artifact3 allen
DOCX
Artifact3 allen
So sánh cấu trúc protein_Protein structure comparison
A MATLAB Computational Investigation of the Jordan Canonical Form of a Class ...
Q26099103
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
graph_embeddings
Ib mathematics hl
theory of computation lecture 01
08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (2016)
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
Modeling the dynamics of molecular concentration during the diffusion procedure
250505_Thuy_Labseminar[Discovering Invariant Rationales for Graph Neural Netw...
Gabor Frames for Quasicrystals and K-theory
Aussem
A Logical Language with a Prototypical Semantics
Em molnar2015
Artifact3 allen
Artifact3 allen
Ad

More from Ding Li (12)

PPTX
Software architecture for data applications
PPTX
Seismic data analysis with u net
PPTX
Titanic survivor prediction by machine learning
PPTX
Find nuclei in images with U-net
PPTX
Digit recognizer by convolutional neural network
PPTX
Reinforcement learning
PPTX
Practical data science
PPTX
Generative adversarial networks
PPTX
Machine learning with graph
PPTX
Natural language processing and transformer models
PPTX
Great neck school budget 2016-2017 analysis
PPTX
Business Intelligence and Big Data in Cloud
Software architecture for data applications
Seismic data analysis with u net
Titanic survivor prediction by machine learning
Find nuclei in images with U-net
Digit recognizer by convolutional neural network
Reinforcement learning
Practical data science
Generative adversarial networks
Machine learning with graph
Natural language processing and transformer models
Great neck school budget 2016-2017 analysis
Business Intelligence and Big Data in Cloud

Recently uploaded (20)

PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Mega Projects Data Mega Projects Data
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPT
Predictive modeling basics in data cleaning process
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Computer network topology notes for revision
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Introduction-to-Cloud-ComputingFinal.pptx
Qualitative Qantitative and Mixed Methods.pptx
climate analysis of Dhaka ,Banglades.pptx
[EN] Industrial Machine Downtime Prediction
Mega Projects Data Mega Projects Data
Data_Analytics_and_PowerBI_Presentation.pptx
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Miokarditis (Inflamasi pada Otot Jantung)
Predictive modeling basics in data cleaning process
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Optimise Shopper Experiences with a Strong Data Estate.pdf
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
ISS -ESG Data flows What is ESG and HowHow
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Computer network topology notes for revision

AI to advance science research

  • 1. AI in Science Research How can modern AI help to push the boundary of science Ding Li 2022.1
  • 3. 3 AI Aids Intuition in Mathematical Discovery The cycle of developing mathematical theories by studying examples. • After recognizing a possible pattern in the properties of mathematical objects, such as convex polyhedra (3D shapes with flat faces, straight edges and vertices that all point outwards), mathematicians typically go through a cycle to understand this pattern. • They first compute the properties of some simple examples and analyze the possible relationships between these properties. • The researchers then refine these relationships. For example, they might come up with Euler’s polyhedron formula, which posits that the number of vertices (V) minus the number of edges (E) plus the number of faces (F) of a convex polyhedron is always equal to two: V − E + F = 2. • They then test this suggested relationship on more complicated examples, discard irrelevant properties and attempt to understand why the relationship holds. If it remains unclear, mathematicians then consider different examples, and the cycle continues. • Davies et al.1 show that machine-learning techniques can help researchers with the refinement step, which usually relies strongly on human intuition Stump 2021
  • 4. 4 Advancing mathematics by guiding human intuition with AI Davies 2021 As an illustrative example: let z be convex polyhedra, X(z) ∈ Z2 × R2 be the number of vertices and edges of z, as well as the volume and surface area, and Y(z) ∈ ℤ be the number of faces of z. Euler’s formula states that there is an exact relationship between X(z) and Y(z) in this case: X(z) · (−1, 1, 0, 0) + 2 = Y(z). The framework helps guide the intuition of mathematicians in two ways: by verifying the hypothesized existence of structure/patterns in mathematical objects through the use of supervised machine learning; and by helping in the understanding of these patterns through the use of attribution techniques.
  • 6. 6 Pushing the Frontiers of Density Functionals by Solving the Fractional Electron Problem Kirkpatrick 2021 • Computing electronic energies underpins theoretical chemistry and materials science, and density functional theory (DFT) promises an exact and efficient approach • But the approach has limitations and is known to give the wrong results for certain types of molecule. • “It’s sort of the ideal problem for machine learning: you know the answer, but not the formula you want to apply.” • The functional was evaluated by integrating local energies computed by a multilayer perceptron (MLP), which took as input both local and nonlocal features of the occupied Kohn-Sham (KS) orbitals and can be described as a local range-separated hybrid. • To train the functional, the sum of two objective functions was used: a regression a gradient regularization term that ensured that the functional derivatives can be used in self-consistent field (SCF) calculations after training Castelvecchi 2021
  • 8. 8 Primary Structure Amino acids (20) Peptide bond Secondary Structure Tertiary Structure Quaternary Structure
  • 9. 9 (MSA) Multiple Sequence Alignments Nseq x Nres • Evolutionary constrains • MSA clustering • Cluster deletion • Evolutionary correlations Pairwise Feature Nres x Nres • Physical and geometric constrains • Target feat (amino acids), residue index • Structural templates • Template distogram Near experimental accuracy in most cases for CASP14 assessment (May-July 2020) Jumper 2021 GitHub AlphaFold Protein Structure Database (JAK2) Blog Colab UniProt (JAK2)
  • 10. 10 A BERT-style transformer was applied to predict randomly masked individual residues within the MSA, which encourages the network to learn to interpret phylogenetic and covariation relationships without hardcoding a particular correlation statistic into the features. Exchange information iteratively to enable direct reasoning about the spatial and evolutionary relationships in the proteins. Combination of the bioinformatics and physical approaches We hope that AlphaFold—and computational approaches that apply its techniques for other biophysical problems—will become essential tools of modern biology.
  • 11. 11 “Do not quench your inspiration and your imagination; do not become the slave of your model.” – Vincent van Gogh