SlideShare a Scribd company logo
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
USING HASHTAG GRAPH-BASED TOPIC MODEL TO CONNECT SEMANTICALLY-
RELATED WORDS WITHOUT CO-OCCURRENCE IN MICROBLOGS
ABSTRACT:
In this paper, we introduce a new topic model to understand the chaotic
microblogging environment by using hashtag graphs. Inferring topics on
Twitter becomes a vital but challenging task in many important applications.
The shortness and informality of tweets leads to extreme sparse vector
representations with a large vocabulary. This makes the conventional topic
models (e.g., Latent Dirichlet Allocation and Latent Semantic Analysis ) fail to
learn high quality topic structures. Tweets are always showing up with rich
user-generated hashtags. The hashtags make tweets semi-structured inside
and semantically related to each other. Since hashtags are utilized as keywords
in tweets to mark messages or to form conversations, they provide an
additional path to connect semantically related words. In this paper, treating
tweets as semi-structured texts, we propose a novel topic model, denoted as
Hashtag Graph-based Topic Model (HGTM) to discover topics of tweets. By
utilizing hashtag relation information in hashtag graphs, HGTM is able to
discover word semantic relations even if words are not co-occurred within a
specific tweet. With this method, HGTM successfully alleviates the sparsity
problem. Our investigation illustrates that the user-contributed hashtags could
serve as weakly-supervised information for topic modeling, and the relation
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
between hashtags could reveal latent semantic relation between words. We
evaluate the effectiveness of HGTM on tweet (hashtag) clustering and hashtag
classification problems. Experiments on two real-world tweet data sets show
that HGTM has strong capability to handle sparseness and noise problem in
tweets. Furthermore, HGTM can discover more distinct and coherent topics
than the state-of-the-art baselines.
CONCLUSION:
Uncovering topics within tweets has become a vital task for widespread
content analysis and social media mining. Different from modeling normal text,
tweet mining has suffered a great deal of sparseness and informality problems.
In this work, we consider that users have provided hashtags as a powerful and
valuable data source in the vast amount of tweets on the web. This paper
presents HGTM that first introduces the hashtag relation graphs as weakly-
supervised information for tweet semantic modeling. We demonstrate that
hashtag graphs contain reliable information to bridge semantically-related
words in sparse short texts. HGTM can enhance semantic relations between
tweets and reduce noise at the same time. Compared to single document
oriented topic models (e.g., LSA, LDA, ATM, TWTM, TWDA), HGTM has a better
ability to capture semantic relations between words with or without co-
occurrence by utilizing the wisdom of crowds from user-generated hashtags.
The model provides a more robust solution for tweet modeling than
aggregation strategies with traditional topic models. We also prove that LDA
framework inherently can not benefit from hashtag graphs. We achieve
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
significant improvement on the performance of content mining tasks, such as
tweet clustering, hashtag clustering and hashtag classification. HGTM discovers
more readable and distinguishable topics than the stat-of-the-art models as
well. This paper shows one effective alternative of utilizing usercontributed
hashtags for tweet topic modeling to handle both sparseness and noise in
tweets. However, there are still many questions which need to be explored.
For example, we would like to explore reasonable and effective ways of
combining multimodal hash tag relations for tweet modelling and to model
time
REFERENCES
[1] D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent Dirichlet Allocation,” the
Journal of machine Learning research, vol. 3, pp. 993–1022, 2003
[2] S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A.
Harshman, “Indexing by latent semantic analysis,” JASIS, vol. 41, no. 6, pp.
391–407, 1990.
[3] S. Vieweg, A. L. Hughes, K. Starbird, and L. Palen, “Microblogging during two
natural hazards events: What Twitter may contribute to situational
awareness,” in Proceedings of the SIGCHI Conference on Human Factors in
Computing Systems, ser. CHI ’10. New York, NY, USA: ACM, 2010, pp. 1079–
1088.
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
[4] Y. Chen, H. Amiri, Z. Li, and T.-S. Chua, “Emerging topic detection for
organizations from microblogs,” in Proceedings of the 36th International ACM
SIGIR Conference on Research and Development in Information Retrieval, ser.
SIGIR ’13. New York, NY, USA: ACM, 2013, pp. 43– 52.
[5] J. Chen, R. Nairn, L. Nelson, M. Bernstein, and E. Chi, “Short and tweet:
Experiments on recommending content from information streams,” in
Proceedings of the SIGCHI Conference on Human Factors in Computing
Systems, ser. CHI ’10. New York, NY, USA: ACM, 2010, pp. 1185– 1194.
[6] K. Tao, F. Abel, Q. Gao, and G.-J. Houben, “TUMS: Twitter-based user
modeling service,” in The Semantic Web: ESWC 2011 Workshops, ser. Lecture
Notes in Computer Science, R. Garca-Castro, D. Fensel, and G. Antoniou, Eds.
Springer Berlin Heidelberg, 2012, vol. 7117, pp. 269–283.
[7] A. Dong, R. Zhang, P. Kolari, J. Bai, F. Diaz, Y. Chang, Z. Zheng, and H. Zha,
“Time is of the essence: Improving recency ranking using Twitter data,” in
Proceedings of the 19th International Conference on World Wide Web, ser.
WWW ’10. New York, NY, USA: ACM, 2010, pp. 331–340.
[8] T. Hofmann, “Probabilistic latent semantic indexing,” in Proceedings of the
22Nd Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval, ser. SIGIR ’99. New York, NY, USA: ACM,
1999, pp. 50–57.
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
[9] L. Hong and B. D. Davison, “Empical study of topic modeling in Twitter,” in
Proceedings of the First Workshop on Social Media Analytics, ser. SOMA ’10.
New York, NY, USA: ACM, 2010, pp. 80–88.
[10] R. Mehrotra, S. Sanner, W. Buntine, and L. Xie, “Improving LDA topic
models for microblogs via tweet pooling and automatic labeling,” in
Proceedings of the 36th International ACM SIGIR Conference on Research and
Development in Information Retrieval, ser. SIGIR ’13. New York, NY, USA: ACM,
2013, pp. 889–892.

More Related Content

PDF
Tweet Segmentation and Its Application to Named Entity Recognition
PDF
1026 telling story from text 2
PDF
Using hash tag graph based topic model to connect semantically-related words ...
PDF
Exploring the Current Trends and Future Prospects in Terrorist Network Mining
PDF
Spreading processes on temporal networks
PDF
Why rumors spread fast in social networks
PDF
IRJET- Fake News Detection and Rumour Source Identification
PDF
Latent Dirichlet Allocation as a Twitter Hashtag Recommendation System
Tweet Segmentation and Its Application to Named Entity Recognition
1026 telling story from text 2
Using hash tag graph based topic model to connect semantically-related words ...
Exploring the Current Trends and Future Prospects in Terrorist Network Mining
Spreading processes on temporal networks
Why rumors spread fast in social networks
IRJET- Fake News Detection and Rumour Source Identification
Latent Dirichlet Allocation as a Twitter Hashtag Recommendation System

What's hot (19)

PDF
Multiple Regression to Analyse Social Graph of Brand Awareness
PDF
A QUERY LEARNING ROUTING APPROACH BASED ON SEMANTIC CLUSTERS
PDF
Acm tist-v3 n4-tist-2010-11-0317
PPTX
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
PDF
Effective Data Retrieval System with Bloom in a Unstructured p2p Network
DOCX
Tweet segmentation and its application to named entity recognition
PPTX
PPTX
A comparative study of social network analysis tools
PDF
Predicting_new_friendships_in_social_networks
PDF
Disease spreading & control in temporal networks
PPTX
Conversation graphs in Online Social Media
PDF
Tweet segmentation and its application to named entity recognition
PDF
Tweet segmentation and its application to named entity recognition.
PDF
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
PDF
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING
PPTX
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
PPT
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
DOC
Poster Abstracts
PDF
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
Multiple Regression to Analyse Social Graph of Brand Awareness
A QUERY LEARNING ROUTING APPROACH BASED ON SEMANTIC CLUSTERS
Acm tist-v3 n4-tist-2010-11-0317
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Effective Data Retrieval System with Bloom in a Unstructured p2p Network
Tweet segmentation and its application to named entity recognition
A comparative study of social network analysis tools
Predicting_new_friendships_in_social_networks
Disease spreading & control in temporal networks
Conversation graphs in Online Social Media
Tweet segmentation and its application to named entity recognition
Tweet segmentation and its application to named entity recognition.
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Poster Abstracts
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
Ad

Similar to USING HASHTAG GRAPH-BASED TOPIC MODEL TO CONNECT SEMANTICALLY-RELATED WORDS WITHOUT CO-OCCURRENCE IN MICROBLOGS (20)

PDF
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
PDF
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
PDF
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana Ifrim
PDF
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
PDF
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
PDF
A SEMANTIC METADATA ENRICHMENT SOFTWARE ECOSYSTEM BASED ON TOPIC METADATA ENR...
PPTX
A Topic Analysis Approach To Revealing Discussions On The Australian Twitters...
PPTX
Learning from Twitter Hashtags: Leveraging Proximate Tags to Enhance Graph-ba...
PDF
IRJET-A Review on Topic Detection and Term-Term Relation Analysis in Big Data
PDF
SubTopic Detection of Tweets Related to an Entity
PPTX
Automatic and unsupervised topic discovery in social networks
PDF
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
PPTX
Slides ecir2016
PDF
Tweet Summarization and Segmentation: A Survey
PPTX
Real time twitter trend mining system – rt2 m
PDF
Learning to recommend with user generated content
PDF
An adaptive clustering and classification algorithm for Twitter data streamin...
PDF
Twitter as a personalizable information service ii
PPTX
Self Trending a Tweet - Cluster and Topic Analysis on Tweets
PDF
IRJET - Socirank Identifying and Ranking Prevalent News Topics using Social M...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana Ifrim
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
A SEMANTIC METADATA ENRICHMENT SOFTWARE ECOSYSTEM BASED ON TOPIC METADATA ENR...
A Topic Analysis Approach To Revealing Discussions On The Australian Twitters...
Learning from Twitter Hashtags: Leveraging Proximate Tags to Enhance Graph-ba...
IRJET-A Review on Topic Detection and Term-Term Relation Analysis in Big Data
SubTopic Detection of Tweets Related to an Entity
Automatic and unsupervised topic discovery in social networks
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Slides ecir2016
Tweet Summarization and Segmentation: A Survey
Real time twitter trend mining system – rt2 m
Learning to recommend with user generated content
An adaptive clustering and classification algorithm for Twitter data streamin...
Twitter as a personalizable information service ii
Self Trending a Tweet - Cluster and Topic Analysis on Tweets
IRJET - Socirank Identifying and Ranking Prevalent News Topics using Social M...
Ad

More from Nexgen Technology (20)

DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CH...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENN...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENNA...
DOCX
Ieee 2020 21 vlsi projects in pondicherry,ieee vlsi projects in chennai
DOCX
Ieee 2020 21 power electronics in pondicherry,Ieee 2020 21 power electronics
DOCX
Ieee 2020 -21 ns2 in pondicherry, Ieee 2020 -21 ns2 projects,best project cen...
DOCX
Ieee 2020 21 ns2 in pondicherry,best project center in pondicherry,final year...
DOCX
Ieee 2020 21 java dotnet in pondicherry,final year projects in pondicherry,pr...
DOCX
Ieee 2020 21 iot in pondicherry,final year projects in pondicherry,project ce...
DOCX
Ieee 2020 21 blockchain in pondicherry,final year projects in pondicherry,bes...
DOCX
Ieee 2020 -21 bigdata in pondicherry,project center in pondicherry,best proje...
DOCX
Ieee 2020 21 embedded in pondicherry,final year projects in pondicherry,best...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CH...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENN...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENNA...
Ieee 2020 21 vlsi projects in pondicherry,ieee vlsi projects in chennai
Ieee 2020 21 power electronics in pondicherry,Ieee 2020 21 power electronics
Ieee 2020 -21 ns2 in pondicherry, Ieee 2020 -21 ns2 projects,best project cen...
Ieee 2020 21 ns2 in pondicherry,best project center in pondicherry,final year...
Ieee 2020 21 java dotnet in pondicherry,final year projects in pondicherry,pr...
Ieee 2020 21 iot in pondicherry,final year projects in pondicherry,project ce...
Ieee 2020 21 blockchain in pondicherry,final year projects in pondicherry,bes...
Ieee 2020 -21 bigdata in pondicherry,project center in pondicherry,best proje...
Ieee 2020 21 embedded in pondicherry,final year projects in pondicherry,best...

Recently uploaded (20)

PDF
Pre independence Education in Inndia.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
Institutional Correction lecture only . . .
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Sports Quiz easy sports quiz sports quiz
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Classroom Observation Tools for Teachers
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Lesson notes of climatology university.
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
master seminar digital applications in india
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Pre independence Education in Inndia.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Institutional Correction lecture only . . .
Abdominal Access Techniques with Prof. Dr. R K Mishra
TR - Agricultural Crops Production NC III.pdf
Sports Quiz easy sports quiz sports quiz
Microbial diseases, their pathogenesis and prophylaxis
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Classroom Observation Tools for Teachers
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
O7-L3 Supply Chain Operations - ICLT Program
Lesson notes of climatology university.
Pharma ospi slides which help in ospi learning
Pharmacology of Heart Failure /Pharmacotherapy of CHF
master seminar digital applications in india
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...

USING HASHTAG GRAPH-BASED TOPIC MODEL TO CONNECT SEMANTICALLY-RELATED WORDS WITHOUT CO-OCCURRENCE IN MICROBLOGS

  • 1. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com USING HASHTAG GRAPH-BASED TOPIC MODEL TO CONNECT SEMANTICALLY- RELATED WORDS WITHOUT CO-OCCURRENCE IN MICROBLOGS ABSTRACT: In this paper, we introduce a new topic model to understand the chaotic microblogging environment by using hashtag graphs. Inferring topics on Twitter becomes a vital but challenging task in many important applications. The shortness and informality of tweets leads to extreme sparse vector representations with a large vocabulary. This makes the conventional topic models (e.g., Latent Dirichlet Allocation and Latent Semantic Analysis ) fail to learn high quality topic structures. Tweets are always showing up with rich user-generated hashtags. The hashtags make tweets semi-structured inside and semantically related to each other. Since hashtags are utilized as keywords in tweets to mark messages or to form conversations, they provide an additional path to connect semantically related words. In this paper, treating tweets as semi-structured texts, we propose a novel topic model, denoted as Hashtag Graph-based Topic Model (HGTM) to discover topics of tweets. By utilizing hashtag relation information in hashtag graphs, HGTM is able to discover word semantic relations even if words are not co-occurred within a specific tweet. With this method, HGTM successfully alleviates the sparsity problem. Our investigation illustrates that the user-contributed hashtags could serve as weakly-supervised information for topic modeling, and the relation
  • 2. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com between hashtags could reveal latent semantic relation between words. We evaluate the effectiveness of HGTM on tweet (hashtag) clustering and hashtag classification problems. Experiments on two real-world tweet data sets show that HGTM has strong capability to handle sparseness and noise problem in tweets. Furthermore, HGTM can discover more distinct and coherent topics than the state-of-the-art baselines. CONCLUSION: Uncovering topics within tweets has become a vital task for widespread content analysis and social media mining. Different from modeling normal text, tweet mining has suffered a great deal of sparseness and informality problems. In this work, we consider that users have provided hashtags as a powerful and valuable data source in the vast amount of tweets on the web. This paper presents HGTM that first introduces the hashtag relation graphs as weakly- supervised information for tweet semantic modeling. We demonstrate that hashtag graphs contain reliable information to bridge semantically-related words in sparse short texts. HGTM can enhance semantic relations between tweets and reduce noise at the same time. Compared to single document oriented topic models (e.g., LSA, LDA, ATM, TWTM, TWDA), HGTM has a better ability to capture semantic relations between words with or without co- occurrence by utilizing the wisdom of crowds from user-generated hashtags. The model provides a more robust solution for tweet modeling than aggregation strategies with traditional topic models. We also prove that LDA framework inherently can not benefit from hashtag graphs. We achieve
  • 3. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com significant improvement on the performance of content mining tasks, such as tweet clustering, hashtag clustering and hashtag classification. HGTM discovers more readable and distinguishable topics than the stat-of-the-art models as well. This paper shows one effective alternative of utilizing usercontributed hashtags for tweet topic modeling to handle both sparseness and noise in tweets. However, there are still many questions which need to be explored. For example, we would like to explore reasonable and effective ways of combining multimodal hash tag relations for tweet modelling and to model time REFERENCES [1] D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent Dirichlet Allocation,” the Journal of machine Learning research, vol. 3, pp. 993–1022, 2003 [2] S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman, “Indexing by latent semantic analysis,” JASIS, vol. 41, no. 6, pp. 391–407, 1990. [3] S. Vieweg, A. L. Hughes, K. Starbird, and L. Palen, “Microblogging during two natural hazards events: What Twitter may contribute to situational awareness,” in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, ser. CHI ’10. New York, NY, USA: ACM, 2010, pp. 1079– 1088.
  • 4. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com [4] Y. Chen, H. Amiri, Z. Li, and T.-S. Chua, “Emerging topic detection for organizations from microblogs,” in Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, ser. SIGIR ’13. New York, NY, USA: ACM, 2013, pp. 43– 52. [5] J. Chen, R. Nairn, L. Nelson, M. Bernstein, and E. Chi, “Short and tweet: Experiments on recommending content from information streams,” in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, ser. CHI ’10. New York, NY, USA: ACM, 2010, pp. 1185– 1194. [6] K. Tao, F. Abel, Q. Gao, and G.-J. Houben, “TUMS: Twitter-based user modeling service,” in The Semantic Web: ESWC 2011 Workshops, ser. Lecture Notes in Computer Science, R. Garca-Castro, D. Fensel, and G. Antoniou, Eds. Springer Berlin Heidelberg, 2012, vol. 7117, pp. 269–283. [7] A. Dong, R. Zhang, P. Kolari, J. Bai, F. Diaz, Y. Chang, Z. Zheng, and H. Zha, “Time is of the essence: Improving recency ranking using Twitter data,” in Proceedings of the 19th International Conference on World Wide Web, ser. WWW ’10. New York, NY, USA: ACM, 2010, pp. 331–340. [8] T. Hofmann, “Probabilistic latent semantic indexing,” in Proceedings of the 22Nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ser. SIGIR ’99. New York, NY, USA: ACM, 1999, pp. 50–57.
  • 5. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com [9] L. Hong and B. D. Davison, “Empical study of topic modeling in Twitter,” in Proceedings of the First Workshop on Social Media Analytics, ser. SOMA ’10. New York, NY, USA: ACM, 2010, pp. 80–88. [10] R. Mehrotra, S. Sanner, W. Buntine, and L. Xie, “Improving LDA topic models for microblogs via tweet pooling and automatic labeling,” in Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, ser. SIGIR ’13. New York, NY, USA: ACM, 2013, pp. 889–892.