Self-Attentive	Associative	Memory
Presented by Hung	Le
1
Key	takeaways
2
Propose	Self-
Attentive	Associtive
Memory	to	
implement	
relational	memory
Propose	a	unified	
model	of	item	and	
relational	memory
Propose	Outer	
Product	Attention	
for	richer	memory-
memory	relational	
binding
How	to	learn	to	memorize	and	relate	
temporal	items	simultaneously?
Background
3
Common	memory	paradigm	
is	storing	and	retrieval	
X1-Y1
X2-Y2
X3-Y3
Store Retrieve
X1
Y1
4
Key-Value
Query	is	some	key
Answer	is	the	associated	value
Current	associative	memory	is	item	memory,	
weak	at	recognizing	relationships
Item
Memory
• Store	and	retrieve	individual	items
• Relate	pair	of	items	of	the	same	time	step
• Fail	to	relate	temporally	distant	items
5
Source:	Zeithamova et	al.,	"The	hippocampus	and	inferential	reasoning:	building	memories	to	navigate	future	decisions.
" Frontiers	in	human	neuroscience 6	(2012).
Current	attention	is	dot	product
NTM DNC Transformer RMCALSTM
6
Represent	a	
relationship	as	a	
scalar
Source:	Santoro	et	al.,	"Relational	recurrent	neural	networks.“
In Advances	in	neural	information	processing	systems.	2018.
Limitations	of	current	memory	systems
• The	relational	representation	is	often	computed	without	storing
• Few	works	that	manage	both	items	and	the	relationships
in	a	single	memory
• The	memory-memory	relationship	is	coarse	
7
Motivations
8
Neuropathologists	say there	exist	2	memory	
systems	in	human	brain
9
• Store	items
• Simple,	low-order
Relational
Memory
• Store	relationships	between	items
• Complicated,	high-order
Item
Memory
Howard	Eichenbaum,	Memory,	amnesia,	and	the	hippocampal	system (MIT	press,	1993).
Alex	Konkel and	Neal	J	Cohen,	"Relational	memory	and	the	hippocampus:	representations	and	methods",	Frontiers	in	neuroscience 3	(2009).
Graphical	navigation	requires	both	memories
10
5	km
2	km
1	km
5	km
2	km
1	km
Relational	memory
Item	memory
Dot	product	attention	works	for	simple	
relationship,	but	…
11
What	is		
most	
similar	to	
me?
0.7									0.9												- 0.1										0.4
What	is		most	
similar	to	me	
but	different	
from	tiger?
For	hard	relationship,	scalar	
representation	is	limited
Method
12
Propose	Self-
Attentive	
Associative	Memory	
to	implement	
relational	memory
Propose	a	unified	
model	of	item	and	
relational	memory	
using	neural	
networks
Propose	Outer	
Product	Attention	
for	richer	memory-
memory	relational	
binding
Outer	product	attention
13
Not	attend	to	
some	v
Associate	
q,	k,	v
Dot	product	attention	is	a	special	case	of	
outer	product	attention
14
Self-attentive	associative	memory	(SAM)
15
• Build	upon	OPA
• Differentiable
• Convert	2d	matrix	to	3d	tensor
• Convert	item	memory	to	relational	memory
Extract	
items
Associate	
items
Item	memory	and	relational	memory	in	single	
model	(STM)
16
Construct	item	memory
17
Construct	relational	memory
18
Relational	memory	transfers	its	content	to	
item	memory	and	the	output
19
Flatten
High-
dimensional	
linear	
transformation
Experimental	Results
20
Ablation	study	to	test	item	memory:	simple	
associative	retrieval
21
Transfer	
helps	long	
sequence
Cannot	learn	
without	
gates
Bigger	
memory	is		
better
Ablation	study	to	test	relational	memory:	Nth	
farthest
22
More	
extracted	
items	is	
better
Source:	Santoro	et	al.,	"Relational	recurrent	neural	networks.“
In Advances	in	neural	information	processing	systems.	2018.
Algorithm	tasks	to	test	both	item	and	
relational	memory
23
Copy	+	Sorting
(no	query)
What	is		most	
similar	or	
dissimilar	to	
me?
Relational	
associative	
recall
(with	query)
Item	+	relational	memory
Each	item	is	represented	by	a	32-bit	binary	
vector:		the	space	of	>4	billion	possible	items
24
Geometry	and	graph	with	associated	node	
features	(represented	by	at	least	20-bit	vectors)
25
Task	summary	and	order	of	relationship
26
Reinforcement	learning	can	reap	benefit	from	
both	memory
27
Skip4
Standard
No	Skip
Skip8
…
Question	answering	with	bAbI
28
Limitations	and	future	directions
• OPA	is	powerful	but	complicated
• The	current	SAM	uses	only	1	layer	of	OPA.	Is	deep	SAM	possible?
• Can	we	do	relational	reasoning	with	visual	associated	features?	E.g.,	
reasoning	over	graph	with	each	node	associated	with	an	image.	
29
5	km
2	km
1	km
Human:	Shortest	path	
from											to							is		
->							->
Meet the authors
Hung Le
Associate Research Fellow
Applied Artificial Intelligence
Institute
Truyen Tran
Associate Professor
Applied Artificial
Intelligence Institute
Svetha Venkatesh
Co-Director, Applied
Artificial Intelligence
Institute (A2I2)
Alfred Deakin Professor,
Australian Laureate
Fellow
30
Thank you
thai.le@deakin.edu.au
A²I²
Deakin University
Geelong Waurn Ponds Campus,
Geelong, VIC 3220
Hung Le
31

More Related Content

PDF
TOMO 2 QUIMICA.pdf (base para química en todo)
PPTX
Sai.pptx
PPT
Information processing approach
PDF
Deep Learning 2.0
PDF
Deep analytics via learning to reason
PPTX
information processing theory
PPTX
Topic Pages. From articles to answers.
PDF
CHILDHOOD MENTAL DISORDERS - Unit 4
TOMO 2 QUIMICA.pdf (base para química en todo)
Sai.pptx
Information processing approach
Deep Learning 2.0
Deep analytics via learning to reason
information processing theory
Topic Pages. From articles to answers.
CHILDHOOD MENTAL DISORDERS - Unit 4

Similar to Self-Attentive Associative Memory (20)

PDF
Evolution Of Cognitive Psychology
PPTX
Creativity. 3. Processes.pptx
PPTX
DR.pptx
PPTX
Epistemic networks for Epistemic Commitments
PPTX
Week 4 power point
PDF
ACT-R_Nina Wei
DOCX
311 Paper 3
PDF
Outline And Evaluate The Memory Process
PPTX
the role of workiing memory.....pptx
PPT
The biochemistry of memory
PDF
Memory Essay
PPTX
Cognitive Learning Theory
PPT
How do we know what we don’t know: Using the Neuroscience Information Framew...
PPT
Human development the mechanistic overview (part ii)
PDF
Visual reasoning
PPTX
Connectionist and Dynamical Systems approach to Cognition
PPTX
1 informationprocessing
PPTX
Schemata and reading comprehension
PPTX
Memory Organization in Computer Architecture and Organization
Evolution Of Cognitive Psychology
Creativity. 3. Processes.pptx
DR.pptx
Epistemic networks for Epistemic Commitments
Week 4 power point
ACT-R_Nina Wei
311 Paper 3
Outline And Evaluate The Memory Process
the role of workiing memory.....pptx
The biochemistry of memory
Memory Essay
Cognitive Learning Theory
How do we know what we don’t know: Using the Neuroscience Information Framew...
Human development the mechanistic overview (part ii)
Visual reasoning
Connectionist and Dynamical Systems approach to Cognition
1 informationprocessing
Schemata and reading comprehension
Memory Organization in Computer Architecture and Organization
Ad

More from Hung Le (7)

PPTX
AJCAI24 Tutorial: Towards Safe and Controlled LLMs
PPTX
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
PDF
Memory-based Reinforcement Learning
PDF
Memory for Lean Reinforcement Learning.pdf
PDF
Episodic Policy Gradient Training
PDF
Model Based Episodic Memory
PDF
Neural Stored-program Memory
AJCAI24 Tutorial: Towards Safe and Controlled LLMs
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Memory-based Reinforcement Learning
Memory for Lean Reinforcement Learning.pdf
Episodic Policy Gradient Training
Model Based Episodic Memory
Neural Stored-program Memory
Ad

Recently uploaded (20)

PDF
Yusen Logistics Group Sustainability Report 2024.pdf
PPTX
Lesson-7-Gas. -Exchange_074636.pptx
PPTX
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
PPTX
Sustainable Forest Management ..SFM.pptx
PPTX
FINAL TEST 3C_OCTAVIA RAMADHANI SANTOSO-1.pptx
PPTX
2025-08-17 Joseph 03 (shared slides).pptx
PPT
Lessons from Presentation Zen_ how to craft your story visually
PDF
public speaking for kids in India - LearnifyU
PDF
Microsoft-365-Administrator-s-Guide_.pdf
PPTX
Shizophrnia ppt for clinical psychology students of AS
PPTX
Knowledge Knockout ( General Knowledge Quiz )
PPTX
Phylogeny and disease transmission of Dipteran Fly (ppt).pptx
PPTX
NORMAN_RESEARCH_PRESENTATION.in education
PDF
6.-propertise of noble gases, uses and isolation in noble gases
PPTX
ANICK 6 BIRTHDAY....................................................
PPTX
power point presentation ofDracena species.pptx
PPTX
Phylogeny and disease transmission of Dipteran Fly (ppt).pptx
DOCX
Action plan to easily understanding okey
PPTX
PurpoaiveCommunication for students 02.pptx
DOCX
CLASS XII bbbbbnjhcvfyfhfyfyhPROJECT.docx
Yusen Logistics Group Sustainability Report 2024.pdf
Lesson-7-Gas. -Exchange_074636.pptx
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
Sustainable Forest Management ..SFM.pptx
FINAL TEST 3C_OCTAVIA RAMADHANI SANTOSO-1.pptx
2025-08-17 Joseph 03 (shared slides).pptx
Lessons from Presentation Zen_ how to craft your story visually
public speaking for kids in India - LearnifyU
Microsoft-365-Administrator-s-Guide_.pdf
Shizophrnia ppt for clinical psychology students of AS
Knowledge Knockout ( General Knowledge Quiz )
Phylogeny and disease transmission of Dipteran Fly (ppt).pptx
NORMAN_RESEARCH_PRESENTATION.in education
6.-propertise of noble gases, uses and isolation in noble gases
ANICK 6 BIRTHDAY....................................................
power point presentation ofDracena species.pptx
Phylogeny and disease transmission of Dipteran Fly (ppt).pptx
Action plan to easily understanding okey
PurpoaiveCommunication for students 02.pptx
CLASS XII bbbbbnjhcvfyfhfyfyhPROJECT.docx

Self-Attentive Associative Memory