SlideShare a Scribd company logo
2
Most read
4
Most read
6
Most read
Stock Market Prediction
Using Data Mining
By
Shivakumar Soppannavar
CMPE 239
Under the Guidance of
Prof. Eirinaki Magdalini
11/10/2015
Different machine learning algorithms are used to predict the stock market trading.
Use text from different sources and use Text and Data Mining (TDM) to extract pattern or
information or any hidden data of interest to predict the Ups and downs of the targeted
stocks.
Then
Data Mining Isn't a Good Bet For Stock-Market Predictions [2]
Aug. 8, 2009 - JASON ZWEIG , Wall Street Journal
Now
How Traders Are Using Text and Data Mining to Beat the Market [3]
Feb 12 2015 - Market Roy Kaufman , The Street
Applying Machine Learning to Stock Market Trading - Bryce Taylor [1]
Machine learning algorithm to read headlines from financial news magazines and
make predictions on the directional change of stock prices after a moderate-length
time interval
[Stanford Student project 2013, CS 229]
Introduction
Data Sources and Research question
Twitter data to predict stock market changes
Change in management, M&A
Intermittent headlines to react to the first headlines up or down ???
Data sources:
 Headlines from financial analysts
http://seekingalpha.com/
 Historic stock prices
http://www.nasdaq.com/
 7 targeted companies
IBM, NFLX, GOOG, ANF, MCD, SHLD, AAPL
Research Questions:
“Given a headline released today about some company X, will the stock price of X
rise by more than P percent over the next time period T?”
T= 3 months
Bayesian Classifiers
Bayesian Classifier
 Simple multinomial Bayesian classifier that analyze the headlines based on the
presence of each token in the headline
 51202 tokens -> Laplace smoothening -> 693 tokens -> Top 10 tokens
 Classification Error for Reduced features < 0.5
Precision/Recall
 Increase in P increases the Positive error and decrease in Negative error
Support Vector Machines
 SVM (Polynomial, linear, etc) was used on reduced data set, didn’t beat the
result obtained from Bayesian classifier
Naïve Baye’s Testing Error
Table 1: Bayesian classifier result
run for top 10 most indicative
symbols
Few more ways of analysis!
Natural Language Processing
 Stanford has a publicly available Natural Language Processing Toolkit that
provides sentiment analysis to sentences with high accuracy (>80%)
 Use of NLP didn’t achieve high success
 Natural language processors would need to be specifically tailored to processing
headline-like data to be able to make a meaningful contribution towards
answering my research questions.
Principal Component Analysis
 Principal component analysis are run on the data and then tested linear SVMs on
several of the top principal components.
Manual Key word Selection
 Keywords are selected manually
Few more ways of analysis, Results
Principal Component Analysis Manual Key word Selection
Conclusion
Sophisticated model able to beat overall market trends by reading financial news
headlines cannot be easily found without fairly sophisticated human-like processing
of the headlines. –By Author
Examples:
Tweet on Credit card breach at Home Depot (HD) -> Stocks 2% down. (9/2/2014) [3]
Nate Silver's uncannily accurate predictions of the U.S. national elections. (2012) [3]
Yes, by using Text and Data Mining and superior algorithms in near future, we may be
able to predict the stock market with greater accuracy.
Thank you
References
1. B. Taylor. (2013). “Applying Machine Learning to Stock Market Trading”. Retrieved from
Stanford CS229 project lists 2013.
http://cs229.stanford.edu/proj2013/Taylor-
Applying%20Machine%20Learning%20to%20Stock%20Market%20Trading.pdf
2. JASON ZWEIG , (Aug. 8, 2009). Retrieved from Wall Street Journal website
http://www.wsj.com/articles/SB124967937642715417
3. M. R. Kaufman,(Feb 12 2015). Retrieved from The Street website
http://www.thestreet.com/story/13044694/2/how-traders-are-using-text-and-data-
mining-to-beat-the-market.html
4. http://cs229.stanford.edu/projects2013.html

More Related Content

PDF
Stock Market Price Prediction Using Technical Analysis
PPT
STOCK MARKET PRREDICTION WITH FEATURE EXTRACTION USING NEURAL NETWORK TEHNIQUE
PDF
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODS
PDF
Stock Price Trend Forecasting using Supervised Learning
PPTX
Stock Price Prediction PPT
PPTX
Stock Market Prediction
PPTX
Stock Market Prediction
PPT
STOCK MARKET PREDICTION
Stock Market Price Prediction Using Technical Analysis
STOCK MARKET PRREDICTION WITH FEATURE EXTRACTION USING NEURAL NETWORK TEHNIQUE
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODS
Stock Price Trend Forecasting using Supervised Learning
Stock Price Prediction PPT
Stock Market Prediction
Stock Market Prediction
STOCK MARKET PREDICTION

What's hot (20)

PPTX
Stock Market Prediction using Machine Learning
PDF
Stock Market Analysis
PDF
A Comparison of Stock Trend Prediction Using Accuracy Driven Neural Network V...
PDF
Stock Market Prediction.pptx
PDF
IRJET- Future Stock Price Prediction using LSTM Machine Learning Algorithm
PPTX
Final PPT.pptx
PPTX
Stock Price Prediction
PPT
STOCK MARKET PREDICTION
PPTX
stock market prediction
PPT
AI Lecture 7 (uncertainty)
DOC
Aditya report finaL
PDF
Deep Learning for Stock Prediction
PDF
IRJET- Stock Market Prediction using Machine Learning
PPTX
Machine learning: Stock Price Prediction
DOCX
Stock Market Analysis and Prediction
PPTX
Stock-market-prediction.pptx
PDF
Stock market analysis
PDF
Data Streaming For Big Data
PPTX
Performance analysis and prediction of stock market for investment decision u...
PDF
Google Stock Price Forecasting
Stock Market Prediction using Machine Learning
Stock Market Analysis
A Comparison of Stock Trend Prediction Using Accuracy Driven Neural Network V...
Stock Market Prediction.pptx
IRJET- Future Stock Price Prediction using LSTM Machine Learning Algorithm
Final PPT.pptx
Stock Price Prediction
STOCK MARKET PREDICTION
stock market prediction
AI Lecture 7 (uncertainty)
Aditya report finaL
Deep Learning for Stock Prediction
IRJET- Stock Market Prediction using Machine Learning
Machine learning: Stock Price Prediction
Stock Market Analysis and Prediction
Stock-market-prediction.pptx
Stock market analysis
Data Streaming For Big Data
Performance analysis and prediction of stock market for investment decision u...
Google Stock Price Forecasting
Ad

Viewers also liked (17)

PPTX
Stock market prediction technique:
PDF
Software for Stock Market Prediction
PPTX
Data mining and knowledge discovery
PDF
Data Mining methodology
PPTX
An intelligent scalable stock market prediction system
PPTX
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUES
PDF
2558 project
PDF
Prediction of stock market index using genetic algorithm
PDF
PPTX
Presentation1
PPT
presentation of stock valuation
PPT
1.PPT (1.PREDICTION OF DISEASES New)
PPT
Data mining in agriculture
PPTX
Earthquake prediction
PPT
Capital Markets Development in Bangladesh: The Status of Dhaka Stock Exchange
PPTX
Data mining in Telecommunications
PPTX
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
Stock market prediction technique:
Software for Stock Market Prediction
Data mining and knowledge discovery
Data Mining methodology
An intelligent scalable stock market prediction system
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUES
2558 project
Prediction of stock market index using genetic algorithm
Presentation1
presentation of stock valuation
1.PPT (1.PREDICTION OF DISEASES New)
Data mining in agriculture
Earthquake prediction
Capital Markets Development in Bangladesh: The Status of Dhaka Stock Exchange
Data mining in Telecommunications
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
Ad

Similar to Stock market prediction using data mining (20)

PDF
Stock Market Prediction Using Artificial Neural Network
PDF
Data-Driven Approach to Stock Market Prediction and Sentiment Analysis
PPTX
updated stock market ppt.pptx stock market presentation
PDF
stock price prediction using sentiment analysis
PDF
IRJET- Prediction in Stock Marketing
PDF
IRJET- Stock Market Prediction using Financial News Articles
PDF
CASE STUDY ON STOCK MARKET PREDICTION IN ML
PDF
Stock Price Prediction Using Sentiment Analysis and Historic Data of Stock
PDF
Stock Market Prediction Analysis
PDF
Stock Market Prediction using Machine Learning
PPTX
Stock Price Prediction using ML Techniques
PDF
STOCK MARKET PREDICTION AND ANALYSIS USING MACHINE LEARNING ALGORITHMS
PDF
IRJET - Stock Market Analysis and Prediction using Deep Learning
PPTX
BATCH 1 FIRST REVIEW-1.pptx
PDF
En36855867
PDF
IRJET - Stock Market Analysis and Prediction
PPTX
Stock prediction1600759770283_ak.ppt.pptx
PDF
IRJET- Prediction of Stock Market using Machine Learning Algorithms
PDF
Investment Portfolio Risk Manager using Machine Learning and Deep-Learning.
PDF
IRJET - Stock Market Prediction using Machine Learning Algorithm
Stock Market Prediction Using Artificial Neural Network
Data-Driven Approach to Stock Market Prediction and Sentiment Analysis
updated stock market ppt.pptx stock market presentation
stock price prediction using sentiment analysis
IRJET- Prediction in Stock Marketing
IRJET- Stock Market Prediction using Financial News Articles
CASE STUDY ON STOCK MARKET PREDICTION IN ML
Stock Price Prediction Using Sentiment Analysis and Historic Data of Stock
Stock Market Prediction Analysis
Stock Market Prediction using Machine Learning
Stock Price Prediction using ML Techniques
STOCK MARKET PREDICTION AND ANALYSIS USING MACHINE LEARNING ALGORITHMS
IRJET - Stock Market Analysis and Prediction using Deep Learning
BATCH 1 FIRST REVIEW-1.pptx
En36855867
IRJET - Stock Market Analysis and Prediction
Stock prediction1600759770283_ak.ppt.pptx
IRJET- Prediction of Stock Market using Machine Learning Algorithms
Investment Portfolio Risk Manager using Machine Learning and Deep-Learning.
IRJET - Stock Market Prediction using Machine Learning Algorithm

Recently uploaded (20)

PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
UNIT 4 Total Quality Management .pptx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPT
Mechanical Engineering MATERIALS Selection
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
Lecture Notes Electrical Wiring System Components
PDF
composite construction of structures.pdf
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
additive manufacturing of ss316l using mig welding
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Sustainable Sites - Green Building Construction
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
UNIT 4 Total Quality Management .pptx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
R24 SURVEYING LAB MANUAL for civil enggi
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
bas. eng. economics group 4 presentation 1.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Mechanical Engineering MATERIALS Selection
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Lecture Notes Electrical Wiring System Components
composite construction of structures.pdf
Operating System & Kernel Study Guide-1 - converted.pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
additive manufacturing of ss316l using mig welding
Internet of Things (IOT) - A guide to understanding
Sustainable Sites - Green Building Construction
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx

Stock market prediction using data mining

  • 1. Stock Market Prediction Using Data Mining By Shivakumar Soppannavar CMPE 239 Under the Guidance of Prof. Eirinaki Magdalini 11/10/2015
  • 2. Different machine learning algorithms are used to predict the stock market trading. Use text from different sources and use Text and Data Mining (TDM) to extract pattern or information or any hidden data of interest to predict the Ups and downs of the targeted stocks. Then Data Mining Isn't a Good Bet For Stock-Market Predictions [2] Aug. 8, 2009 - JASON ZWEIG , Wall Street Journal Now How Traders Are Using Text and Data Mining to Beat the Market [3] Feb 12 2015 - Market Roy Kaufman , The Street Applying Machine Learning to Stock Market Trading - Bryce Taylor [1] Machine learning algorithm to read headlines from financial news magazines and make predictions on the directional change of stock prices after a moderate-length time interval [Stanford Student project 2013, CS 229] Introduction
  • 3. Data Sources and Research question Twitter data to predict stock market changes Change in management, M&A Intermittent headlines to react to the first headlines up or down ??? Data sources:  Headlines from financial analysts http://seekingalpha.com/  Historic stock prices http://www.nasdaq.com/  7 targeted companies IBM, NFLX, GOOG, ANF, MCD, SHLD, AAPL Research Questions: “Given a headline released today about some company X, will the stock price of X rise by more than P percent over the next time period T?” T= 3 months
  • 4. Bayesian Classifiers Bayesian Classifier  Simple multinomial Bayesian classifier that analyze the headlines based on the presence of each token in the headline  51202 tokens -> Laplace smoothening -> 693 tokens -> Top 10 tokens  Classification Error for Reduced features < 0.5 Precision/Recall  Increase in P increases the Positive error and decrease in Negative error Support Vector Machines  SVM (Polynomial, linear, etc) was used on reduced data set, didn’t beat the result obtained from Bayesian classifier
  • 5. Naïve Baye’s Testing Error Table 1: Bayesian classifier result run for top 10 most indicative symbols
  • 6. Few more ways of analysis! Natural Language Processing  Stanford has a publicly available Natural Language Processing Toolkit that provides sentiment analysis to sentences with high accuracy (>80%)  Use of NLP didn’t achieve high success  Natural language processors would need to be specifically tailored to processing headline-like data to be able to make a meaningful contribution towards answering my research questions. Principal Component Analysis  Principal component analysis are run on the data and then tested linear SVMs on several of the top principal components. Manual Key word Selection  Keywords are selected manually
  • 7. Few more ways of analysis, Results Principal Component Analysis Manual Key word Selection
  • 8. Conclusion Sophisticated model able to beat overall market trends by reading financial news headlines cannot be easily found without fairly sophisticated human-like processing of the headlines. –By Author Examples: Tweet on Credit card breach at Home Depot (HD) -> Stocks 2% down. (9/2/2014) [3] Nate Silver's uncannily accurate predictions of the U.S. national elections. (2012) [3] Yes, by using Text and Data Mining and superior algorithms in near future, we may be able to predict the stock market with greater accuracy.
  • 10. References 1. B. Taylor. (2013). “Applying Machine Learning to Stock Market Trading”. Retrieved from Stanford CS229 project lists 2013. http://cs229.stanford.edu/proj2013/Taylor- Applying%20Machine%20Learning%20to%20Stock%20Market%20Trading.pdf 2. JASON ZWEIG , (Aug. 8, 2009). Retrieved from Wall Street Journal website http://www.wsj.com/articles/SB124967937642715417 3. M. R. Kaufman,(Feb 12 2015). Retrieved from The Street website http://www.thestreet.com/story/13044694/2/how-traders-are-using-text-and-data- mining-to-beat-the-market.html 4. http://cs229.stanford.edu/projects2013.html

Editor's Notes

  • #3: Text mining is the data analysis of natural language works (articles, books, etc.), using text as a form of data. It is often joined with data mining, the numeric analysis of data works (like filings and reports), and referred to as "text and data mining" or, simply, "TDM.“ [3]
  • #5: https://en.wikipedia.org/wiki/Laplacian_smoothing Support vector machines (SVMs) are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis.