SlideShare a Scribd company logo
Towards Understanding
Linear Word Analogies
ACL LT
, 2019/11/02,
• Towards Understanding Linear Word Analogies
• Kawin Ethayarajh, David Duvenaud, Graeme Hirst
• University of Toronto
• ACL 2019
•
• SGNS GloVe
(1) (2)
csPMI
2
• SGNS GloVe
•
⃗king − ⃗man + ⃗woman ≈ ⃗queen
3
•
• Latent Variable Model [Arora+,2016]
•
• [Mimno
and Thompson, 2017]
• Paraphrase Model [Gittens+, 2017]
•
• Zipf’s
4
https://ja.wikipedia.org/wiki/
•
5
• SGNS PMI [Levy & Goldberg 2014]
• SGNS
PMI
• NS
•
• k>0 SGNS negative sample
• GloVe log
⃗x
⃗y c
6
⟨ ⃗x , ⃗y c⟩ = log Xx,y − bx − by
⟨ ⃗x , ⃗y c⟩ = PMI(x, y) − log k
Co-occurrence Shifted PMI Theorem
•
• S (x, y) csPMI(x, y)
• S (x, y), (a, b) csPMI(a, x) = csPMI(b, y)
• a, b, x, y
• king - man + woman = queen
• csPMI(king, queen) = csPMI(man, woman)
• csPMI(man, king) = csPMI(woman, queen)
• man, woman, king, queen
7
x a b y
csPMI(x, y) = PMI(x, y) + log p(x, y)
• [Mikolov+2013]
•
•
8
https://github.com/tmikolov/word2vec/blob/master/questions-words.txt
capital-world
csPMI
9
⃗wc = λ ⃗w
p(w) ∝ p(w, w)
α ∈ ℝ−
γ′, λ ∈ ℝ, α ∈ ℝ−
csPMI
•
•
• null csPMI
• csPMI
• ” ” cf:IDF
• : (x,y) =(the, apple) z = “the apple”
• p(the) > p(apple)
• csMPI(“the apple”, “the”) < csPMI(“the apple”, “apple”)
⃗z = ⃗x + ⃗y
∅ ⃗0
csPMI(x, z) = csPMI(∅, y)
⇒ csPMI(x, z) = log p(y) + δ
⃗z ⃗x ⃗y p(y)
p(x) > p(y) ⟺ csPMI(z, y) > csPMI(z, x)
10
δ
csPMI
•
• -csPMI
• [Pennungton+’14][Arora+’16]
• (16) (2) 2
11
(csPMI(x, y) ∈ {−∞,0])
p(w|king)
p(w|queen)
≈
p(w|king)
p(w|queen)
(w ∈ Vocab) ⃗king − ⃗man + ⃗woman ≈ ⃗queen
•
• csPMI
• csPMI
• csPMI
12
csPMI
13
csPMI
• -csPMI >0.5
•
•
csPMI
14
csPMI
• csPMI
→ csPMI
•
• Fig2
• (x,y)
• OK
• csPMI
• csPMI
cf. Table1 currency
εx,y = Mx,y − < ⃗x , ⃗y c >
15
⟨ ⃗x , ⃗y c⟩ = PMI(x, y) − log k
⟨ ⃗x , ⃗y c⟩ ≈ PMI(x, y) − log k
⟨ ⃗x , ⃗y c⟩ ≠ PMI(x, y) − log k
• csPMI
• csPMI=
• csPMI
16

More Related Content

PDF
Text mining lab (summer 2017) - Word Vector Representation
PDF
Haskell
PDF
Beyond clicks dwell time for personalization
PPT
Tutorials--Graphs of Logarithmic Functions
PDF
SIT292 Linear Algebra 2013
PDF
Rdio's Alex Gaynor at Heroku's Waza 2013: Why Python, Ruby and Javascript are...
PDF
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser Bootsma
PDF
[DL輪読会]Recent Advances in Autoencoder-Based Representation Learning
Text mining lab (summer 2017) - Word Vector Representation
Haskell
Beyond clicks dwell time for personalization
Tutorials--Graphs of Logarithmic Functions
SIT292 Linear Algebra 2013
Rdio's Alex Gaynor at Heroku's Waza 2013: Why Python, Ruby and Javascript are...
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser Bootsma
[DL輪読会]Recent Advances in Autoencoder-Based Representation Learning

Similar to [論文紹介] Towards Understanding Linear Word Analogies (6)

PDF
[DL輪読会]GANとエネルギーベースモデル
PDF
[DL輪読会]Understanding Measures of Uncertainty for Adversarial Example Detection
PDF
Improved Security Proof for the Camenisch- Lysyanskaya Signature-Based Synchr...
PDF
第5回NIPS読み会・関西発表資料
PDF
[DL輪読会]Hindsight Experience Replayを応用した再ラベリングによる効率的な強化学習
PDF
Introduction to Polyhedral Compilation
[DL輪読会]GANとエネルギーベースモデル
[DL輪読会]Understanding Measures of Uncertainty for Adversarial Example Detection
Improved Security Proof for the Camenisch- Lysyanskaya Signature-Based Synchr...
第5回NIPS読み会・関西発表資料
[DL輪読会]Hindsight Experience Replayを応用した再ラベリングによる効率的な強化学習
Introduction to Polyhedral Compilation
Ad

More from Makoto Takenaka (10)

PDF
[論文紹介] Understanding and improving transformer from a multi particle dynamic ...
PDF
Lpixel論文読み会資料 "Interpretation of neural network is fragile"
PDF
Understanding the origin of bias in word embeddings
PPTX
[NeurIPS2018読み会@PFN] On the Dimensionality of Word Embedding
PPTX
[研究室論文紹介用スライド] Adversarial Contrastive Estimation
PPTX
Probabilistic fasttext for multi sense word embeddings
PPTX
Deep neural models of semantic shift
PPTX
All-but-the-Top: Simple and Effective Postprocessing for Word Representations
PDF
multimodal word distributions
PDF
Adversarial Multi-task Learning for Text Classification
[論文紹介] Understanding and improving transformer from a multi particle dynamic ...
Lpixel論文読み会資料 "Interpretation of neural network is fragile"
Understanding the origin of bias in word embeddings
[NeurIPS2018読み会@PFN] On the Dimensionality of Word Embedding
[研究室論文紹介用スライド] Adversarial Contrastive Estimation
Probabilistic fasttext for multi sense word embeddings
Deep neural models of semantic shift
All-but-the-Top: Simple and Effective Postprocessing for Word Representations
multimodal word distributions
Adversarial Multi-task Learning for Text Classification
Ad

Recently uploaded (20)

PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
famous lake in india and its disturibution and importance
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PDF
An interstellar mission to test astrophysical black holes
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
Microbiology with diagram medical studies .pptx
PDF
Sciences of Europe No 170 (2025)
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
famous lake in india and its disturibution and importance
INTRODUCTION TO EVS | Concept of sustainability
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Classification Systems_TAXONOMY_SCIENCE8.pptx
Biophysics 2.pdffffffffffffffffffffffffff
Cell Membrane: Structure, Composition & Functions
Placing the Near-Earth Object Impact Probability in Context
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
An interstellar mission to test astrophysical black holes
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
TOTAL hIP ARTHROPLASTY Presentation.pptx
Microbiology with diagram medical studies .pptx
Sciences of Europe No 170 (2025)
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS

[論文紹介] Towards Understanding Linear Word Analogies