SlideShare a Scribd company logo
Random indexing: On space and meaning Simon Belak
Order of the day Meaning Philosophy Neuroscience Computer science Space Words as points in space On dimensionality Random indexing
What’s the meaning of  meaning ?
Philosophers say: “ Meaning just is use.” –   Wittgenstein
Neuroscientists say: Episodic memory    semantic memory (concrete event    abstract concept) Hebbian process
Computer scientists say: LSA  semantic networks HAL TLC SAM ACT-R ontology
Projecting meaning into space
Adjacent words closely related
Movement Co-occurrences Hebbian process Self-organisation Clustering Evolution of language Coach   ( Kocs      carriage    train    car)
Problem:  homonym s Table 1.   a.  An article of furniture supported by one or more vertical legs and having a flat horizontal surface. b.  The objects laid out for a meal on this article of furniture. 2.  The food and drink served at meals; fare: kept an excellent table. 3.  The company of people assembled around a table, as for a meal. 4   A plateau or tableland. 5 .  a.  A flat facet cut across the top of a precious stone. b .  A stone or gem cut in this fashion. 6 .  Music a.   The front part of the body of a stringed instrument. b.   The sounding board of a harp. 7 .  Architecture   a.   A raised or sunken rectangular panel on a wall. b.   A raised horizontal surface or continuous band on an exterior wall; a stringcourse. 8 .  A part of the human palm framed by four lines, analyzed in palmistry. 9 .  An orderly arrangement of data, especially one in which the data are arranged in columns and rows in an essentially rectangular form. 1 0 .  An abbreviated list, as of contents; a synopsis. 1 1 .  An engraved slab or tablet bearing an inscription or a device. 1 2 .  Anatomy  The inner or outer flat layer of bones of the skull separated by the dipole.
Solution: high dimensionality One dimension per word   Table   extends into  food ,  furniture ,  music ,... dimensions
Problem: synonyms amazing ,  stupefying ,  staggering ,  awesome ,  awful ,   awe-inspiring ,   awing ,   astonishing ,  astounding
Solution: latent meaning Reduced dimensionality Closely related words fold into one “ Higher-order” meaning
Random indexing
The idea Word is the sum of it’s contexts Context is the sum of it’s words Grounding?
The algorithm Take a context of words Generate a context index vector Add index to all the word vectors Go to 1) Episodic memory (2) + Hebbian process (3)
Dimensionality reduction Sparse high-dimensional ternary index  (a small number of randomly distributed +1s and -1s) N early orthogonal Distances approximately preserved
The good Fast, scalable Trivially parallelised Per word Addition is associative, commutative Stable Words are independent Integer arithmetics Incremental
The bad Memory hungry Caching (Zipf’s law)
Uses Comparing words to words Query expnasion Comparing documents to documents  Clustering Search Recomendations Comparing documents to words Keyword extraction
Key points Meaning is use Words in space Multiple meanings, multiple dimensions Random indexing Cognitive rationale Simple Fast, scalable
Questions?
References http://www.sics.se/~mange/papers/KarlgrenSahlgren2001.pdf http://www.kfs.org/~jonathan/witt/tlph.html http://www.mtsu.edu/~sschmidt/Cognitive/semantic/semantic.html http://memory.syr.edu/marc/papers/HowaAddiJingKaha-LSAChap-doc.pdf http://memory.psych.upenn.edu/research/research_episodic_memory.php

More Related Content

PPTX
Thales of miletus slide
PPS
Thales
PDF
Semantic Indexing and Search for Content Management Systems with Apache Stanbol
PDF
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
PPTX
An Improved Approach to Word Sense Disambiguation
PPT
Word Net
PDF
Word Space Models and Random Indexing
ODP
Word Space Models & Random indexing
Thales of miletus slide
Thales
Semantic Indexing and Search for Content Management Systems with Apache Stanbol
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
An Improved Approach to Word Sense Disambiguation
Word Net
Word Space Models and Random Indexing
Word Space Models & Random indexing

Similar to Random Indexing (20)

PPTX
Vocabulary building
PPTX
Jarrar.lecture notes.lexicalsemanticsandmultilingualism
PDF
Fifty Long Words in English - Farha Baig
PDF
Hierarchical taxonomy extraction
PPTX
Jarrar: WordNet And Global WordNets
PPT
Vocabulary Concepts
PPTX
EuroVis DocuBurst Presentation 2009
DOCX
PDF
Academic_Word_List.pdf
PPT
L2 Thinking
PPT
PDF
M2 session 1 slides
PDF
Terminology work and term databases in Estonia
PDF
Semantic Annotation of the Cyttron Database
PPTX
Extracting Meaning from Wikipedia
PPT
Marcelo Funes-Gallanzi - Simplish - Computational intelligence unconference
PPTX
LexicalSemanticsWordSenses.pptxMMMMMMMMMMMMMMMMMMMMMMMMM
PPTX
PPTX
TF_IDF_PMI_Jurafsky.pptxnnnnnnnnnnnnnnnn
PPT
Nctm 03 24 07
Vocabulary building
Jarrar.lecture notes.lexicalsemanticsandmultilingualism
Fifty Long Words in English - Farha Baig
Hierarchical taxonomy extraction
Jarrar: WordNet And Global WordNets
Vocabulary Concepts
EuroVis DocuBurst Presentation 2009
Academic_Word_List.pdf
L2 Thinking
M2 session 1 slides
Terminology work and term databases in Estonia
Semantic Annotation of the Cyttron Database
Extracting Meaning from Wikipedia
Marcelo Funes-Gallanzi - Simplish - Computational intelligence unconference
LexicalSemanticsWordSenses.pptxMMMMMMMMMMMMMMMMMMMMMMMMM
TF_IDF_PMI_Jurafsky.pptxnnnnnnnnnnnnnnnn
Nctm 03 24 07
Ad

More from Simon Belak (20)

PDF
Tools for building the future
PDF
Doing data science with clojure
PDF
Exploratory analysis
PDF
Levelling up your data infrastructure
PDF
The subtle art of recommendation
PDF
Metabase Ljubljana Meetup #2
PDF
Metabase lj meetup
PDF
Sketch algorithms
PDF
Transducing for fun and profit
PDF
Your metrics are wrong
PDF
Writing smart contracts the sane way
PDF
Online statistical analysis using transducers and sketch algorithms
PDF
Save the princess
PDF
Data driven going to market strategy
PDF
Spec: a lisp-flavoured type system
PDF
A data layer in clojure
PDF
Odkrivanje segmentov iz podatkov
PDF
Using Onyx in anger
PDF
Spec + onyx
PDF
Dao of lisp
Tools for building the future
Doing data science with clojure
Exploratory analysis
Levelling up your data infrastructure
The subtle art of recommendation
Metabase Ljubljana Meetup #2
Metabase lj meetup
Sketch algorithms
Transducing for fun and profit
Your metrics are wrong
Writing smart contracts the sane way
Online statistical analysis using transducers and sketch algorithms
Save the princess
Data driven going to market strategy
Spec: a lisp-flavoured type system
A data layer in clojure
Odkrivanje segmentov iz podatkov
Using Onyx in anger
Spec + onyx
Dao of lisp
Ad

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Tartificialntelligence_presentation.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Machine Learning_overview_presentation.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Approach and Philosophy of On baking technology
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
MYSQL Presentation for SQL database connectivity
Building Integrated photovoltaic BIPV_UPV.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Spectroscopy.pptx food analysis technology
Tartificialntelligence_presentation.pptx
Encapsulation theory and applications.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Machine Learning_overview_presentation.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
NewMind AI Weekly Chronicles - August'25-Week II
Spectral efficient network and resource selection model in 5G networks
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Approach and Philosophy of On baking technology
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Group 1 Presentation -Planning and Decision Making .pptx
A comparative analysis of optical character recognition models for extracting...
MYSQL Presentation for SQL database connectivity

Random Indexing

  • 1. Random indexing: On space and meaning Simon Belak
  • 2. Order of the day Meaning Philosophy Neuroscience Computer science Space Words as points in space On dimensionality Random indexing
  • 3. What’s the meaning of meaning ?
  • 4. Philosophers say: “ Meaning just is use.” – Wittgenstein
  • 5. Neuroscientists say: Episodic memory  semantic memory (concrete event  abstract concept) Hebbian process
  • 6. Computer scientists say: LSA semantic networks HAL TLC SAM ACT-R ontology
  • 9. Movement Co-occurrences Hebbian process Self-organisation Clustering Evolution of language Coach ( Kocs  carriage  train  car)
  • 10. Problem: homonym s Table 1. a. An article of furniture supported by one or more vertical legs and having a flat horizontal surface. b. The objects laid out for a meal on this article of furniture. 2. The food and drink served at meals; fare: kept an excellent table. 3. The company of people assembled around a table, as for a meal. 4 A plateau or tableland. 5 . a. A flat facet cut across the top of a precious stone. b . A stone or gem cut in this fashion. 6 . Music a. The front part of the body of a stringed instrument. b. The sounding board of a harp. 7 . Architecture a. A raised or sunken rectangular panel on a wall. b. A raised horizontal surface or continuous band on an exterior wall; a stringcourse. 8 . A part of the human palm framed by four lines, analyzed in palmistry. 9 . An orderly arrangement of data, especially one in which the data are arranged in columns and rows in an essentially rectangular form. 1 0 . An abbreviated list, as of contents; a synopsis. 1 1 . An engraved slab or tablet bearing an inscription or a device. 1 2 . Anatomy The inner or outer flat layer of bones of the skull separated by the dipole.
  • 11. Solution: high dimensionality One dimension per word Table extends into food , furniture , music ,... dimensions
  • 12. Problem: synonyms amazing , stupefying , staggering , awesome , awful , awe-inspiring , awing , astonishing , astounding
  • 13. Solution: latent meaning Reduced dimensionality Closely related words fold into one “ Higher-order” meaning
  • 15. The idea Word is the sum of it’s contexts Context is the sum of it’s words Grounding?
  • 16. The algorithm Take a context of words Generate a context index vector Add index to all the word vectors Go to 1) Episodic memory (2) + Hebbian process (3)
  • 17. Dimensionality reduction Sparse high-dimensional ternary index (a small number of randomly distributed +1s and -1s) N early orthogonal Distances approximately preserved
  • 18. The good Fast, scalable Trivially parallelised Per word Addition is associative, commutative Stable Words are independent Integer arithmetics Incremental
  • 19. The bad Memory hungry Caching (Zipf’s law)
  • 20. Uses Comparing words to words Query expnasion Comparing documents to documents Clustering Search Recomendations Comparing documents to words Keyword extraction
  • 21. Key points Meaning is use Words in space Multiple meanings, multiple dimensions Random indexing Cognitive rationale Simple Fast, scalable
  • 23. References http://www.sics.se/~mange/papers/KarlgrenSahlgren2001.pdf http://www.kfs.org/~jonathan/witt/tlph.html http://www.mtsu.edu/~sschmidt/Cognitive/semantic/semantic.html http://memory.syr.edu/marc/papers/HowaAddiJingKaha-LSAChap-doc.pdf http://memory.psych.upenn.edu/research/research_episodic_memory.php