SlideShare a Scribd company logo
8/06/2019 1
Hanoi, June 2019
Truyen Tran
Deakin University
@truyenoz
truyentran.github.io
truyen.tran@deakin.edu.au
letdataspeak.blogspot.com
goo.gl/3jJ1O0
Memory Advances
in Neural Turing
Machines
8/06/2019 2
Deep Learning
Domain expert
Knowledge-based
8/06/2019 3
Can we learn from data a model that is as
powerful as a Turing machine?
In other words, can we learn a (neural)
program that learns to program from data?
8/06/2019 4
Program memory
Outlook
Sparse read/write
Variational memory
Neural Turing Machine
Agenda
Modelling
Three interwoven processes:
• Disease progression
• Interventions & care
processes
• Recording rules
Example: Electronic
medical records
8/06/2019 5
Source: medicalbillingcodings.org
visits/admissions
time gap
?
prediction point
Abstraction
Need memory to handle thousands of events,
compute complex healthcare “grammars”,
support chain of reasoning, rapid switching of
tasks.
Neural Turing machine (NTM)
A controller that takes
input/output and talks to an
external memory module.
Memory has read/write
operations.
The main issue is where to write,
and how to update the memory
state.
All operations are differentiable.
https://rylanschaeffer.github.io/content/research/neural_turing_machine/main.html
8/06/2019 7
Program memory
Outlook
Sparse read/write
Variational memory
Neural Turing Machine
Agenda
Motivation: Dialog system
8/06/2019 8
A dialog system needs to maintain the history of
chat (e.g., could be hours)
  Memory is needed
The generation of response needs to be flexible,
adapting to variation of moods, styles
 Current techniques are mostly based on LSTM, leading
to “stiff” default responses (e.g., “I see”).
There are many ways to express the same
thought
  Variational generative methods are needed. vectorstock
Variational memory encoder-
decoder (VMED)
8/06/2019 9
Conditional Variational Auto-Encoder
contextgenerated
latent variables
VMED
contextgenerated
latent variables memory
reads
Sample response
8/06/2019 10
8/06/2019 11
Program memory
Outlook
Sparse read/write
Variational memory
Neural Turing Machine
Agenda
Problems of current NTMs
Lack of theoretical analysis on optimal memory operations.
Previous works are based on intuitions:
Location-based reading/writing; temporal linkage reading; least-used
writing [Santoro et.al, Graves et.al]
Sparse access over big memory [Rae et.al]
Very slow due to heavy memory read/write computations
12
Cached Uniform Writing (CUW)
13
Ablation Study
Memory-augmented Neural Networks w/wo Uniform Writing
14Task: repeat the input sequence twice
Synthetic tasks: memorize all
15
Synthetic tasks: memorize selectively
16
Synthetic sinusoidal generation:
memorize featured points
17
Flatten MNIST classification
18
Document classification
19
8/06/2019 20
Program memory
Outlook
Sparse read/write
Variational memory
Neural Turing Machine
Agenda
Computing devices vs neural counterparts
FSM (1943) ↔ RNNs (1982)
PDA (1954) ↔ Stack RNN (1993)
TM (1936) ↔ NTM (2014)
UTM/VNA (1936/1945) ↔ NUTM--ours (2019)
The missing piece: A memory to store programs
 Neural stored-program memory
NUTM = NTM + NSM
Multi-level modelling
Hierarchical Regression: if the input is clustered, clustering before
regression helps
Prove for low dimensions maybe available, higher dimension?
NSM is beneficial to NTM
Algorithmic single tasks
Sequencing tasks
Continual Learning
Few-shot learning
Question answering (bAbI dataset)
8/06/2019 30
Program memory
Outlook
Sparse read/write
Variational memory
Neural Turing Machine
Agenda
Memory for graphs & relational
structures
Turing machine to design
machine learning algorithms
Memory-supported reasoning
Imaginative memory
Social memory: collective mem,
theory of mind, memory of
others
Full cognitive architectures
Theoretical analysis
8/06/2019 31
https://twitter.com/nvidia/status/1010545517405835264
Towards AGI:
Is Human Brain a
(super-)Turing machine?

More Related Content

PDF
Deep learning 1.0 and Beyond, Part 2
PDF
Deep learning 1.0 and Beyond, Part 1
PDF
Deep Learning 2.0
PDF
Empirical AI Research
PDF
Visual reasoning
PDF
Machine Reasoning at A2I2, Deakin University
PDF
AI/ML as an empirical science
PDF
Deep learning for detecting anomalies and software vulnerabilities
Deep learning 1.0 and Beyond, Part 2
Deep learning 1.0 and Beyond, Part 1
Deep Learning 2.0
Empirical AI Research
Visual reasoning
Machine Reasoning at A2I2, Deakin University
AI/ML as an empirical science
Deep learning for detecting anomalies and software vulnerabilities

What's hot (20)

PDF
Deep learning and applications in non-cognitive domains III
PDF
Deep learning and applications in non-cognitive domains I
PDF
Deep learning and applications in non-cognitive domains II
PPTX
Deep Learning Explained
DOCX
Case study on deep learning
PDF
Case study on machine learning
PDF
IT_Computational thinking
PPTX
Computational Thinking in the Workforce and Next Generation Science Standards...
PPTX
Cognitive Computing and the future of Artificial Intelligence
PDF
Cognitive Computing by Professor Gordon Pipa
PPTX
The Deep Learning Glossary
PDF
3234150
PDF
Engage with FutureGrid at XSEDE 12
PPTX
Keynote 1: Teaching and Learning Computational Thinking at Scale
PPTX
Intro to deep learning
PPTX
Cognitive computing 2016
PDF
Computational thinking-illustrated
PPTX
Semantics of the Black-Box: Using knowledge-infused learning approach to make...
PPTX
A Semantics-based Approach to Machine Perception
PDF
Challenges in deep learning methods for medical imaging - Pubrica
Deep learning and applications in non-cognitive domains III
Deep learning and applications in non-cognitive domains I
Deep learning and applications in non-cognitive domains II
Deep Learning Explained
Case study on deep learning
Case study on machine learning
IT_Computational thinking
Computational Thinking in the Workforce and Next Generation Science Standards...
Cognitive Computing and the future of Artificial Intelligence
Cognitive Computing by Professor Gordon Pipa
The Deep Learning Glossary
3234150
Engage with FutureGrid at XSEDE 12
Keynote 1: Teaching and Learning Computational Thinking at Scale
Intro to deep learning
Cognitive computing 2016
Computational thinking-illustrated
Semantics of the Black-Box: Using knowledge-infused learning approach to make...
A Semantics-based Approach to Machine Perception
Challenges in deep learning methods for medical imaging - Pubrica
Ad

Similar to Memory advances in Neural Turing Machines (20)

PDF
Role of computers in research
PDF
A SURVEY OF DIFFERENT APPROACHES FOR OVERCOMING THE PROCESSOR-MEMORY BOTTLENECK
PDF
A Survey of Different Approaches for Overcoming the Processor - Memory Bottle...
PDF
A Survey of Different Approaches for Overcoming the Processor - Memory Bottle...
PDF
151 A SURVEY OF DIFFERENT APPROACHES FOR OVERCOMING THE PROCESSOR-MEMORY BOTT...
PPTX
Introduction to computer application for patient care delivery.
PPTX
introductiontocomputerapplicationforpatientcaredelivery-240228032326-7ce03cfd...
PPT
virtual memory.ppt
PPTX
Parallel computing
PPTX
ROLE OF COMPUTERS IN RESEARCH. pptx notes for all researchers
PDF
EPQ Main
PPTX
Information Processing
PPT
NOV11 virtual memory.ppt
PPT
Chapter 09 - Virtual Memory.ppt
PDF
Advanced computer architechture -Memory Hierarchies and its Properties and Type
PPT
NOV11 virtual memory.ppt
PPTX
Analytics & Business Intelligence @ center-stage
PPTX
Computer Tools for Teaching and Learning.pptx
PPTX
Classification of memory hierarchy in system unit
PDF
Memory organization
Role of computers in research
A SURVEY OF DIFFERENT APPROACHES FOR OVERCOMING THE PROCESSOR-MEMORY BOTTLENECK
A Survey of Different Approaches for Overcoming the Processor - Memory Bottle...
A Survey of Different Approaches for Overcoming the Processor - Memory Bottle...
151 A SURVEY OF DIFFERENT APPROACHES FOR OVERCOMING THE PROCESSOR-MEMORY BOTT...
Introduction to computer application for patient care delivery.
introductiontocomputerapplicationforpatientcaredelivery-240228032326-7ce03cfd...
virtual memory.ppt
Parallel computing
ROLE OF COMPUTERS IN RESEARCH. pptx notes for all researchers
EPQ Main
Information Processing
NOV11 virtual memory.ppt
Chapter 09 - Virtual Memory.ppt
Advanced computer architechture -Memory Hierarchies and its Properties and Type
NOV11 virtual memory.ppt
Analytics & Business Intelligence @ center-stage
Computer Tools for Teaching and Learning.pptx
Classification of memory hierarchy in system unit
Memory organization
Ad

More from Deakin University (19)

PDF
Artificial intelligence in the post-deep learning era
PDF
Deep learning and reasoning: Recent advances
PDF
AI for automated materials discovery via learning to represent, predict, gene...
PDF
Deep analytics via learning to reason
PDF
Generative AI to Accelerate Discovery of Materials
PDF
Generative AI: Shifting the AI Landscape
PDF
From deep learning to deep reasoning
PDF
Machine Learning and Reasoning for Drug Discovery
PDF
Machine reasoning
PDF
AI in the Covid-19 pandemic
PDF
AI for tackling climate change
PDF
AI for drug discovery
PDF
Deep learning for episodic interventional data
PDF
Deep learning for biomedical discovery and data mining I
PDF
Deep learning for biomedical discovery and data mining II
PDF
AI that/for matters
PDF
Representation learning on graphs
PDF
Deep learning for genomics: Present and future
PDF
Deep learning for biomedicine
Artificial intelligence in the post-deep learning era
Deep learning and reasoning: Recent advances
AI for automated materials discovery via learning to represent, predict, gene...
Deep analytics via learning to reason
Generative AI to Accelerate Discovery of Materials
Generative AI: Shifting the AI Landscape
From deep learning to deep reasoning
Machine Learning and Reasoning for Drug Discovery
Machine reasoning
AI in the Covid-19 pandemic
AI for tackling climate change
AI for drug discovery
Deep learning for episodic interventional data
Deep learning for biomedical discovery and data mining I
Deep learning for biomedical discovery and data mining II
AI that/for matters
Representation learning on graphs
Deep learning for genomics: Present and future
Deep learning for biomedicine

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Machine learning based COVID-19 study performance prediction
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Review of recent advances in non-invasive hemoglobin estimation
Digital-Transformation-Roadmap-for-Companies.pptx
Machine learning based COVID-19 study performance prediction
sap open course for s4hana steps from ECC to s4
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
MYSQL Presentation for SQL database connectivity
Dropbox Q2 2025 Financial Results & Investor Presentation
Understanding_Digital_Forensics_Presentation.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Programs and apps: productivity, graphics, security and other tools
Building Integrated photovoltaic BIPV_UPV.pdf
Unlocking AI with Model Context Protocol (MCP)
Encapsulation_ Review paper, used for researhc scholars
Network Security Unit 5.pdf for BCA BBA.
MIND Revenue Release Quarter 2 2025 Press Release
Review of recent advances in non-invasive hemoglobin estimation

Memory advances in Neural Turing Machines