SlideShare a Scribd company logo
Concept Cloud-based
Sentiment Visualization for
Financial Reviews
Tomoki Ito*, Kota Tsubouchi**, Hiroki Sakaji*,
Tatsuo Yamashita**, Kiyoshi Izumi*
** Graduate School of Engineering, The University of Tokyo
* Yahoo Japan Corporation
Back ground
• Online reviews are useful for decision making in the
investment.
• e.g., micro-blogs, SNS, and news articles…
Difficulty in Reading Large Volume of Reviews
• To read all the posts should not be practical
• the volume of the posts is sometimes very large.
Difficulty in Reading Large Volume of Reviews
• To read all the posts should not be practical
• the volume of the posts is sometimes very large.
• Framework for visualizing the summary of the
financial reviews is necessary
What is important for decision
making in the investment ?
• In the decision making process, the following
two types of sentiments are important
• Word-level sentiment score
• Concept-level sentiment score
What is important for decision
making in the investment ?
• In the decision making process, the following
two types of sentiments are important
• Word-level sentiment score
• Concept-level sentiment score
Word-level sentiment
• Word-level sentiment means the sentiment scores in
word units
In total, we are in a bull market.
0.1 0.1 0.2 0.0 0.3 1.2 -0.1
Word-level sentiment
• Word-level sentiment means the sentiment scores in
word units
• We should consider the context in this score assignment
• e.g., sentiment shift by “not”
In total, we are in a bull market.
0.1 0.1 0.2 0.0 0.3 1.2 -0.1
In total, we are not in a bull market.
0.1 0.1 0.2 0.0 0.3 -0.1 -0.1
What is important for decision
making in the investment ?
• In the decision making process, the following
two types of sentiments are important
• Word-level sentiment score
• Concept-level sentiment score
Concept-level Sentiment
• Concept-level sentiment means the sentiment
scores in concept units
• Concept means a set of similar words
Up, Down, ↗︎↗︎
Delicious,
Nasty,
Palatable, …
Clean, Dirty
Trend: 0.5 Taste: -0.1
Cleanness: -0.2
Reviews
What is important for decision
making in the investment ?
• In the decision making process, the following
two types of sentiments are important
• Word-level sentiment score
• Concept-level sentiment score
Purpose
• This study aims to develop a method for visualizing
• Word-level sentiment score, and
• Concept-level sentiment score
at the same time in a user-friendly way
Our Approach
• We propose a novel text-visualization framework
called CCSV
Concept Cloud-based Sentiment
Visualization
CCSV Example
• Using CCSV, we can summarize reviews as follows
(The price was inversed www)
(The price was inversed.)
(Go down below 3000yen.
I cannot buy now.)
・
・
・
(Over 1000 reviews in five days)
CCSV Example
• Using CCSV, we can summarize reviews as follows
text-visualization results for a set of reviews for trading company X in
September 25th, 2017 and September 30th, 2017 extracted from the
Yahoo Financial Micro-blogs.
• Color
• Red: Positive
Blue:
Negative
• Size: Volume of
Sentiment
Contribution
Our Contribution is summarized as follows
• We propose a novel text-visualization framework
called CCSV
• We experimentally evaluated the validity of the
CSCV using real dataset
Contribution
Our Contribution is summarized as follows
• We propose a novel text-visualization framework
called CCSV
• We experimentally evaluated the validity of the
CCSV
Concept Cloud-based Sentiment
Visualization
• CCSV is constructed from the following three parts
1. Word-level sentiment Extraction
2. Concept-level sentiment Extraction
3. Word and Concept-level sentiment Visualization
Concept Cloud-based Sentiment
Visualization
• CCSV is constructed from the following three parts
1. Word-level sentiment Extraction
2. Concept-level sentiment Extraction
3. Word and Concept-level sentiment Visualization
Word-level sentiment Extraction
• This step addresses the following contextual
word-level sentiment score assignment task
Input: In total, we are not in a bull market.
In total, we are not in a bull market.
( Sentiment influence: polarity of “bull” is shifted by not)
In total, we are not in a bull market.
(Red and blue words has positive and negative sentiments, respectively )
Original Word-level Sentiment (sentiment score before considering contexts):
Contextual Word-level Sentiment (sentiment score after considering contexts)
Task Setting
• This step aims to assign word-level sentiment scores
using only a text corpus dataset including reviews
and their positive or negative sentiment tags
In total, we are in a bull market.
Review:
Tag: Positive
• We decided this task setting considering the
practicality
Text corpus dataset
Previous approach in Word-level
sentiment Extraction
• Previous works[Vo 2016, Li 2017] address this task by
automatically developing a word sentiment score
dictionary
• However, they cannot consider contexts
Input: In total, we are not in a bull market.
In total, we are not in a bull market.
Cannot consider contexts
Our approach
• We solve this task by estimating
• P (•) : Original word-level sentiment
• R (•) : Contextual word-level sentiment
using the LRP method [L. Arras. et. al., 2017] with the RNN model
Document dataset: {Di}N
i=1 where Di = {wt
i }N
i=1
Sentiment tag:
Our approach
• We solve this challenge by estimating
• P (•) : Original word-level sentiment
• R (•) : Contextual word-level sentiment
using the LRP method [L. Arras. et. al., 2017] with the RNN model
Document dataset: {Di}N
i=1 where Di = {wt
i }N
i=1
Sentiment tag:
LRP-based Approach Process
• We estimate R (•) as follows
LRP-based Approach Process
• We estimate R (•) as follows
1. Develop a RNN model with LSTM cells using Text corpus
dataset including document and their positive or negative
sentiment tags
LRP-based Approach Process
• We estimate R (•) as follows
1. Develop a RNN model with LSTM cells using Text corpus
dataset including document and their positive or negative
sentiment tags
2. Estimate
• R (•) : Contextual word-level sentiment
using the LRP method [L. Arras. et. al., 2017] with the RNN model
In total, we are in a bull market.
R (•) : LRP + RNN (LSTM)
0.1 0.1 0.2 0.0 0.3 1.2 -0.1
Layer-wise Representation Propagation(LRP)
• LRP is the method for interpreting Neural Networks
• LRP calculates the relevance score of the input value to the output
value
• LRP can be used in the RNN with LSTM cells
• the relevance score of each term from the LRP with the RNN is
expected to consider contexts
market
is
bull
Positive
Negative
OutputInput
(L. Arras. et. al., 2017)
28
Concept Cloud-based Sentiment
Visualization
• CCSV is constructed from the following three parts
1. Word-level sentiment Extraction
• using the LRP method
2. Concept-level sentiment Extraction
• with the word-level sentiments and K-means
clustering method
3. Word and Concept-level sentiment Visualization
• using Word Cloud method
Concept-level sentiment Extraction
• This step calculates the concept-level sentiment
score using the k-means clustering method
Up: 0.5
Down: -0.2
↗︎↗︎ : 0.6
delicious: 0.6
nasty: -0.7
Palatable: +1.5
Clean: +0.3
Dirty: -0.2
Reviews
LRP
Up: 0.5
Down: -0.2
↗︎↗︎ : 0.6
delicious: 0.6
nasty: -0.7
Palatable: +1.5
Clean: +0.3
Dirty: -0.2
0.9
+1.4
+0.1
K-means
method
with
word2vec
Concept Cloud-based Sentiment
Visualization
• CCSV is constructed from the following three parts
1. Word-level sentiment Extraction
• using the LRP method
2. Concept-level sentiment Extraction
• with the word-level sentiments and K-means
clustering method
3. Word and Concept-level sentiment Visualization
• using Word Cloud method
Word and Concept-level sentiment Visualization
• This step visualize the word-level and concept-level
sentiment scores using Tag Cloud Approach
Up: 0.5
Down: -0.2
↗︎↗︎ : 0.6
delicious: 0.6
nasty: -0.7
Palatable: +1.5
Clean: +0.3
Dirty: -0.2
0.9
+1.4
+0.1
Up Down ↗︎ ↗︎
Deliciousnasty
Palatable
Clean
Dirty
• Color
• Red: Positive Blue: Negative
• Size: Volume of Sentiment
Contribution
Our Contribution is summarized as follows
• We propose a novel text-visualization framework
called CCSV
• We experimentally evaluated the validity of the
CCSV
Experimental Evaluation
• We evaluated our method from two aspects using
real textual datasets
• Original Sentiment assignment property
• Contextual sentiment assignment property
Dataset
• We evaluated the validity of our approach using the following
dataset
• Text Corpus
• Economic dataset: Current economy watchers survey
• Train: 20,000 positive posts and 20,000 negative posts
• Valid: 2,000 positive posts and 2,000 negative posts
• Test: 4,000 positive posts and 4,000 negative posts
• Yahoo dataset: Yahoo Finance micro-blogs between
September
• Train: 30,612 positive posts and 9,388 negative posts
• Valid: 3,387 positive posts and 1,613 negative posts
• Test: 7,538 positive posts and 2,462 negative posts
Experimental Evaluation
• We evaluated our method from three aspects using
real textual datasets
• Original Sentiment assignment property
• Contextual sentiment assignment property
Original Sentiment assignment property
• How accurately P (•) presents the
positive or negative polarity of each
term in the word polarity list
• Economic word polarity list
• 348 positive and 391 negative words
• We used this list when we estimated P (•)
using the Economic dataset
• Yahoo word polarity list
• 422 positive and 372 negative words
• We used this list when we estimated P (•)
using the Yahoo dataset
Good: Positive
Bad: Negative
Great: Positive
Bullish: Positive
・
・
・
Word polarity list
Comparison Method
• We compared our method with the following
comparison methods
• Word-level sentiment score assignment methods
• PMI
• FLW [D. T. Vo et. al., 2016]
• SONN [Q. Li et. al., 2017]
Result
0.5
0.55
0.6
0.65
0.7
0.75
0.8
0.85
PMI LFW SONN LRP (Our Approach)
Economic Dataset Yahoo Dataset
Our method outperformed the other word-level sentiment
assignment approaches
MacroF1score
Experimental Evaluation
• We evaluated our method from three aspects using
real textual datasets
• Original Sentiment assignment property
• Contextual sentiment assignment property
Contextual Sentiment assignment property
• How accurately the sum of the contextual word-level sentiment
scores in a term of each review in test dataset presents the
positive or negative polarity of the review
In total, we are in a bull market.
R (•) :LRP + RNN (LSTM)
0.1 0.1 0.2 0.0 0.3 1.2 -0.1
0.1 + 0.1 + 0.2 + 0,0 + 0.3 + 1.2 + -0.1 = 1.8
Positive Accurate ?
Comparison Method
• We compared our method with the following
comparison methods
• Word-level sentiment score assignment methods
• PMI
• FLW [D. T. Vo et. al., 2016]
• SONN [Q. Li et. al., 2017]
• LR: Logistic Regression
• RNN with LSTM cells
Result
0.5
0.55
0.6
0.65
0.7
0.75
0.8
0.85
0.9
0.95
1
LR PMI LFW SONN LRP (Our Approach) LSTM
Economic Dataset Yahoo Dataset
Our method outperformed the other word-level sentiment
assignment approaches
(did not work better than the LSTM)
MacroF1score
CCSV Example
• Using CCSV, we can summarize reviews as follows
Text-visualization results for a set of reviews for trading company X in September 25th,
2017 and September 30th, 2017 extracted from the Yahoo Financial Micro-blogs.
Useful for
decision making
process in
investment
• Color
Red: Positive
Blue: Negative
• Size: Volume of
Sentiment
Conclusion
• Summary
• We propose a novel text-visualization
framework called CCSV
• We experimentally evaluated the validity of
the CCSV
• Future work
• We will modify the CCSV more user-friendly
• We will apply this approach to other languages
Previous work
• Rule base ?
• Dictionary base ?
Previous work
• Rule base ?
• Dictionary base ?
Riles in
Financial
domain are too
specific and
specialized
To understand the financial text, we need the specialized dictionary.
Challenge
• To consider both original word-level sentiment and
contexts is challenging problem !
Motivation
• To understand the financial text, we need
• the specialized dictionary for word-level sentiment
• Understand sentiment shift
Back ground
• Non-experts often fail to understand financial text
Layer-wise Representation Propagation(LRP)
• Calculate the relevance score of the input value to the output value by
• starting from the output layer of the neural network and
• backpropagating this quantity up to the input layer.
market
is
bull
Positive
Negative
OutputInput
(L. Arras. et. al., 2017)
52
Layer-wise Representation Propagation(LRP)
zj
Linear Connections
zg
zs
zj
Multiplicative Connections
◎
zi wij
Sigoid 関数などにより(-1,1) に変
換 53
(L. Arras. et. al., 2017)
Layer-wise Representation Propagation(LRP)
zj
Linear Connections
zg
zs
zj
Multiplicative Connections
◎
zi wij
54
(L. Arras. et. al., 2017)
LRP-based Approach Process
• We estimate R (•) as follows
1. Develop a RNN model with LSTM cells using Text corpus
dataset including document and their positive or negative
sentiment tags
In total, we are in a bull market.
Review:
Tag: Positive
Text corpus dataset
Previous approach in Word-level
sentiment Extraction
• Previous works[Vo 2016, Li 2017] address this task by
automatically developing a word sentiment score
dictionary
• However, they cannot consider contexts
Input: In total, we are not in a bull market.
In total, we are not in a bull market.
Cannot consider contexts

More Related Content

PPTX
Abstractive Review Summarization
PDF
PPT
An overview of concepts of Sentiment Analysis
PPT
Sentiment+Analysis.ppt
PDF
Aspect-Level Sentiment Analysis On Hotel Reviews
PDF
Sentiment analysis-by-nltk
PPTX
Sentiment analysis and opinion mining
Abstractive Review Summarization
An overview of concepts of Sentiment Analysis
Sentiment+Analysis.ppt
Aspect-Level Sentiment Analysis On Hotel Reviews
Sentiment analysis-by-nltk
Sentiment analysis and opinion mining

Similar to Concept Cloud-based Sentiment Visualization for Financial Reviews (20)

PDF
Icdm2013 slides
PDF
A Context-Based Algorithm For Sentiment Analysis
PDF
Multimedia data minig and analytics sentiment analysis using social multimedia
PDF
Sentimental analysis of audio based customer reviews without textual conversion
PPTX
Sentiment analysis
PDF
IRJET- Aspect based Sentiment Analysis on Financial Data using Transferred Le...
PDF
Sentiwordnet [IIT-Bombay]
PDF
IRJET- Sentimental Analysis on Audio and Video
PDF
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
PDF
SemEval - Aspect Based Sentiment Analysis
PPT
opinionmining-131221011849-phpapp02-converted.ppt
PDF
A survey on approaches for performing sentiment analysis ijrset october15
PPTX
Continuous Sentiment Intensity Prediction based on Deep Learning
PPT
Opinion Mining
PPTX
Lexicon-Based Sentiment Analysis at GHC 2014
PPTX
Lac presentation
PDF
Infer relationship in social network from images
PDF
#like or #fail - How Can Computers Tell the Difference?
PPT
sa-mincut-aditya.ppt
Icdm2013 slides
A Context-Based Algorithm For Sentiment Analysis
Multimedia data minig and analytics sentiment analysis using social multimedia
Sentimental analysis of audio based customer reviews without textual conversion
Sentiment analysis
IRJET- Aspect based Sentiment Analysis on Financial Data using Transferred Le...
Sentiwordnet [IIT-Bombay]
IRJET- Sentimental Analysis on Audio and Video
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
SemEval - Aspect Based Sentiment Analysis
opinionmining-131221011849-phpapp02-converted.ppt
A survey on approaches for performing sentiment analysis ijrset october15
Continuous Sentiment Intensity Prediction based on Deep Learning
Opinion Mining
Lexicon-Based Sentiment Analysis at GHC 2014
Lac presentation
Infer relationship in social network from images
#like or #fail - How Can Computers Tell the Difference?
sa-mincut-aditya.ppt
Ad

Recently uploaded (20)

PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Leprosy and NLEP programme community medicine
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
New ISO 27001_2022 standard and the changes
PDF
Business Analytics and business intelligence.pdf
PPTX
Introduction to Inferential Statistics.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Introduction to Data Science and Data Analysis
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
Transcultural that can help you someday.
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
DOCX
Factor Analysis Word Document Presentation
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
IMPACT OF LANDSLIDE.....................
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Leprosy and NLEP programme community medicine
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
New ISO 27001_2022 standard and the changes
Business Analytics and business intelligence.pdf
Introduction to Inferential Statistics.pptx
ISS -ESG Data flows What is ESG and HowHow
Introduction to Data Science and Data Analysis
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Acceptance and paychological effects of mandatory extra coach I classes.pptx
CYBER SECURITY the Next Warefare Tactics
Transcultural that can help you someday.
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Factor Analysis Word Document Presentation
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
Topic 5 Presentation 5 Lesson 5 Corporate Fin
IMPACT OF LANDSLIDE.....................
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Ad

Concept Cloud-based Sentiment Visualization for Financial Reviews

  • 1. Concept Cloud-based Sentiment Visualization for Financial Reviews Tomoki Ito*, Kota Tsubouchi**, Hiroki Sakaji*, Tatsuo Yamashita**, Kiyoshi Izumi* ** Graduate School of Engineering, The University of Tokyo * Yahoo Japan Corporation
  • 2. Back ground • Online reviews are useful for decision making in the investment. • e.g., micro-blogs, SNS, and news articles…
  • 3. Difficulty in Reading Large Volume of Reviews • To read all the posts should not be practical • the volume of the posts is sometimes very large.
  • 4. Difficulty in Reading Large Volume of Reviews • To read all the posts should not be practical • the volume of the posts is sometimes very large. • Framework for visualizing the summary of the financial reviews is necessary
  • 5. What is important for decision making in the investment ? • In the decision making process, the following two types of sentiments are important • Word-level sentiment score • Concept-level sentiment score
  • 6. What is important for decision making in the investment ? • In the decision making process, the following two types of sentiments are important • Word-level sentiment score • Concept-level sentiment score
  • 7. Word-level sentiment • Word-level sentiment means the sentiment scores in word units In total, we are in a bull market. 0.1 0.1 0.2 0.0 0.3 1.2 -0.1
  • 8. Word-level sentiment • Word-level sentiment means the sentiment scores in word units • We should consider the context in this score assignment • e.g., sentiment shift by “not” In total, we are in a bull market. 0.1 0.1 0.2 0.0 0.3 1.2 -0.1 In total, we are not in a bull market. 0.1 0.1 0.2 0.0 0.3 -0.1 -0.1
  • 9. What is important for decision making in the investment ? • In the decision making process, the following two types of sentiments are important • Word-level sentiment score • Concept-level sentiment score
  • 10. Concept-level Sentiment • Concept-level sentiment means the sentiment scores in concept units • Concept means a set of similar words Up, Down, ↗︎↗︎ Delicious, Nasty, Palatable, … Clean, Dirty Trend: 0.5 Taste: -0.1 Cleanness: -0.2 Reviews
  • 11. What is important for decision making in the investment ? • In the decision making process, the following two types of sentiments are important • Word-level sentiment score • Concept-level sentiment score
  • 12. Purpose • This study aims to develop a method for visualizing • Word-level sentiment score, and • Concept-level sentiment score at the same time in a user-friendly way
  • 13. Our Approach • We propose a novel text-visualization framework called CCSV Concept Cloud-based Sentiment Visualization
  • 14. CCSV Example • Using CCSV, we can summarize reviews as follows (The price was inversed www) (The price was inversed.) (Go down below 3000yen. I cannot buy now.) ・ ・ ・ (Over 1000 reviews in five days)
  • 15. CCSV Example • Using CCSV, we can summarize reviews as follows text-visualization results for a set of reviews for trading company X in September 25th, 2017 and September 30th, 2017 extracted from the Yahoo Financial Micro-blogs. • Color • Red: Positive Blue: Negative • Size: Volume of Sentiment
  • 16. Contribution Our Contribution is summarized as follows • We propose a novel text-visualization framework called CCSV • We experimentally evaluated the validity of the CSCV using real dataset
  • 17. Contribution Our Contribution is summarized as follows • We propose a novel text-visualization framework called CCSV • We experimentally evaluated the validity of the CCSV
  • 18. Concept Cloud-based Sentiment Visualization • CCSV is constructed from the following three parts 1. Word-level sentiment Extraction 2. Concept-level sentiment Extraction 3. Word and Concept-level sentiment Visualization
  • 19. Concept Cloud-based Sentiment Visualization • CCSV is constructed from the following three parts 1. Word-level sentiment Extraction 2. Concept-level sentiment Extraction 3. Word and Concept-level sentiment Visualization
  • 20. Word-level sentiment Extraction • This step addresses the following contextual word-level sentiment score assignment task Input: In total, we are not in a bull market. In total, we are not in a bull market. ( Sentiment influence: polarity of “bull” is shifted by not) In total, we are not in a bull market. (Red and blue words has positive and negative sentiments, respectively ) Original Word-level Sentiment (sentiment score before considering contexts): Contextual Word-level Sentiment (sentiment score after considering contexts)
  • 21. Task Setting • This step aims to assign word-level sentiment scores using only a text corpus dataset including reviews and their positive or negative sentiment tags In total, we are in a bull market. Review: Tag: Positive • We decided this task setting considering the practicality Text corpus dataset
  • 22. Previous approach in Word-level sentiment Extraction • Previous works[Vo 2016, Li 2017] address this task by automatically developing a word sentiment score dictionary • However, they cannot consider contexts Input: In total, we are not in a bull market. In total, we are not in a bull market. Cannot consider contexts
  • 23. Our approach • We solve this task by estimating • P (•) : Original word-level sentiment • R (•) : Contextual word-level sentiment using the LRP method [L. Arras. et. al., 2017] with the RNN model Document dataset: {Di}N i=1 where Di = {wt i }N i=1 Sentiment tag:
  • 24. Our approach • We solve this challenge by estimating • P (•) : Original word-level sentiment • R (•) : Contextual word-level sentiment using the LRP method [L. Arras. et. al., 2017] with the RNN model Document dataset: {Di}N i=1 where Di = {wt i }N i=1 Sentiment tag:
  • 25. LRP-based Approach Process • We estimate R (•) as follows
  • 26. LRP-based Approach Process • We estimate R (•) as follows 1. Develop a RNN model with LSTM cells using Text corpus dataset including document and their positive or negative sentiment tags
  • 27. LRP-based Approach Process • We estimate R (•) as follows 1. Develop a RNN model with LSTM cells using Text corpus dataset including document and their positive or negative sentiment tags 2. Estimate • R (•) : Contextual word-level sentiment using the LRP method [L. Arras. et. al., 2017] with the RNN model In total, we are in a bull market. R (•) : LRP + RNN (LSTM) 0.1 0.1 0.2 0.0 0.3 1.2 -0.1
  • 28. Layer-wise Representation Propagation(LRP) • LRP is the method for interpreting Neural Networks • LRP calculates the relevance score of the input value to the output value • LRP can be used in the RNN with LSTM cells • the relevance score of each term from the LRP with the RNN is expected to consider contexts market is bull Positive Negative OutputInput (L. Arras. et. al., 2017) 28
  • 29. Concept Cloud-based Sentiment Visualization • CCSV is constructed from the following three parts 1. Word-level sentiment Extraction • using the LRP method 2. Concept-level sentiment Extraction • with the word-level sentiments and K-means clustering method 3. Word and Concept-level sentiment Visualization • using Word Cloud method
  • 30. Concept-level sentiment Extraction • This step calculates the concept-level sentiment score using the k-means clustering method Up: 0.5 Down: -0.2 ↗︎↗︎ : 0.6 delicious: 0.6 nasty: -0.7 Palatable: +1.5 Clean: +0.3 Dirty: -0.2 Reviews LRP Up: 0.5 Down: -0.2 ↗︎↗︎ : 0.6 delicious: 0.6 nasty: -0.7 Palatable: +1.5 Clean: +0.3 Dirty: -0.2 0.9 +1.4 +0.1 K-means method with word2vec
  • 31. Concept Cloud-based Sentiment Visualization • CCSV is constructed from the following three parts 1. Word-level sentiment Extraction • using the LRP method 2. Concept-level sentiment Extraction • with the word-level sentiments and K-means clustering method 3. Word and Concept-level sentiment Visualization • using Word Cloud method
  • 32. Word and Concept-level sentiment Visualization • This step visualize the word-level and concept-level sentiment scores using Tag Cloud Approach Up: 0.5 Down: -0.2 ↗︎↗︎ : 0.6 delicious: 0.6 nasty: -0.7 Palatable: +1.5 Clean: +0.3 Dirty: -0.2 0.9 +1.4 +0.1 Up Down ↗︎ ↗︎ Deliciousnasty Palatable Clean Dirty • Color • Red: Positive Blue: Negative • Size: Volume of Sentiment
  • 33. Contribution Our Contribution is summarized as follows • We propose a novel text-visualization framework called CCSV • We experimentally evaluated the validity of the CCSV
  • 34. Experimental Evaluation • We evaluated our method from two aspects using real textual datasets • Original Sentiment assignment property • Contextual sentiment assignment property
  • 35. Dataset • We evaluated the validity of our approach using the following dataset • Text Corpus • Economic dataset: Current economy watchers survey • Train: 20,000 positive posts and 20,000 negative posts • Valid: 2,000 positive posts and 2,000 negative posts • Test: 4,000 positive posts and 4,000 negative posts • Yahoo dataset: Yahoo Finance micro-blogs between September • Train: 30,612 positive posts and 9,388 negative posts • Valid: 3,387 positive posts and 1,613 negative posts • Test: 7,538 positive posts and 2,462 negative posts
  • 36. Experimental Evaluation • We evaluated our method from three aspects using real textual datasets • Original Sentiment assignment property • Contextual sentiment assignment property
  • 37. Original Sentiment assignment property • How accurately P (•) presents the positive or negative polarity of each term in the word polarity list • Economic word polarity list • 348 positive and 391 negative words • We used this list when we estimated P (•) using the Economic dataset • Yahoo word polarity list • 422 positive and 372 negative words • We used this list when we estimated P (•) using the Yahoo dataset Good: Positive Bad: Negative Great: Positive Bullish: Positive ・ ・ ・ Word polarity list
  • 38. Comparison Method • We compared our method with the following comparison methods • Word-level sentiment score assignment methods • PMI • FLW [D. T. Vo et. al., 2016] • SONN [Q. Li et. al., 2017]
  • 39. Result 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 PMI LFW SONN LRP (Our Approach) Economic Dataset Yahoo Dataset Our method outperformed the other word-level sentiment assignment approaches MacroF1score
  • 40. Experimental Evaluation • We evaluated our method from three aspects using real textual datasets • Original Sentiment assignment property • Contextual sentiment assignment property
  • 41. Contextual Sentiment assignment property • How accurately the sum of the contextual word-level sentiment scores in a term of each review in test dataset presents the positive or negative polarity of the review In total, we are in a bull market. R (•) :LRP + RNN (LSTM) 0.1 0.1 0.2 0.0 0.3 1.2 -0.1 0.1 + 0.1 + 0.2 + 0,0 + 0.3 + 1.2 + -0.1 = 1.8 Positive Accurate ?
  • 42. Comparison Method • We compared our method with the following comparison methods • Word-level sentiment score assignment methods • PMI • FLW [D. T. Vo et. al., 2016] • SONN [Q. Li et. al., 2017] • LR: Logistic Regression • RNN with LSTM cells
  • 43. Result 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 LR PMI LFW SONN LRP (Our Approach) LSTM Economic Dataset Yahoo Dataset Our method outperformed the other word-level sentiment assignment approaches (did not work better than the LSTM) MacroF1score
  • 44. CCSV Example • Using CCSV, we can summarize reviews as follows Text-visualization results for a set of reviews for trading company X in September 25th, 2017 and September 30th, 2017 extracted from the Yahoo Financial Micro-blogs. Useful for decision making process in investment • Color Red: Positive Blue: Negative • Size: Volume of Sentiment
  • 45. Conclusion • Summary • We propose a novel text-visualization framework called CCSV • We experimentally evaluated the validity of the CCSV • Future work • We will modify the CCSV more user-friendly • We will apply this approach to other languages
  • 46. Previous work • Rule base ? • Dictionary base ?
  • 47. Previous work • Rule base ? • Dictionary base ? Riles in Financial domain are too specific and specialized
  • 48. To understand the financial text, we need the specialized dictionary.
  • 49. Challenge • To consider both original word-level sentiment and contexts is challenging problem !
  • 50. Motivation • To understand the financial text, we need • the specialized dictionary for word-level sentiment • Understand sentiment shift
  • 51. Back ground • Non-experts often fail to understand financial text
  • 52. Layer-wise Representation Propagation(LRP) • Calculate the relevance score of the input value to the output value by • starting from the output layer of the neural network and • backpropagating this quantity up to the input layer. market is bull Positive Negative OutputInput (L. Arras. et. al., 2017) 52
  • 53. Layer-wise Representation Propagation(LRP) zj Linear Connections zg zs zj Multiplicative Connections ◎ zi wij Sigoid 関数などにより(-1,1) に変 換 53 (L. Arras. et. al., 2017)
  • 54. Layer-wise Representation Propagation(LRP) zj Linear Connections zg zs zj Multiplicative Connections ◎ zi wij 54 (L. Arras. et. al., 2017)
  • 55. LRP-based Approach Process • We estimate R (•) as follows 1. Develop a RNN model with LSTM cells using Text corpus dataset including document and their positive or negative sentiment tags In total, we are in a bull market. Review: Tag: Positive Text corpus dataset
  • 56. Previous approach in Word-level sentiment Extraction • Previous works[Vo 2016, Li 2017] address this task by automatically developing a word sentiment score dictionary • However, they cannot consider contexts Input: In total, we are not in a bull market. In total, we are not in a bull market. Cannot consider contexts

Editor's Notes