SlideShare a Scribd company logo
3
Most read
NetSpam: a Network-based Spam Detection Framework for
Reviews in Online Social Media
Abstract—Nowadays, a big part of people rely on available content in social media in their
decisions (e.g. reviews and feedback on a topic or product). The possibility that anybody can
leave a review provides a golden opportunity for spammers to write spam reviews about products
and services for different interests. Identifying these spammers and the spam content is a hot
topic of research and although a considerable number of studies have been done recently toward
this end, but so far the methodologies put forth still barely detect spam reviews, and none of
them show the importance of each extracted feature type. In this study, we propose a novel
framework, named NetSpam, which utilizes spam features for modeling review datasets as
heterogeneous information networks to map spam detection procedure into a classification
problem in such networks. Using the importance of spam features help us to obtain better results
in terms of different metrics experimented on real-world review datasets from Yelp and Amazon
websites. The results show that NetSpam outperforms the existing methods and among four
categories of features; including review-behavioral, user-behavioral, review linguistic, user-
linguistic, the first type of features performs better than the other categories.
Existing work
In Existing work, the work only depend on the detect the spam reviews and spammers. None of
them show the importance of each extracted feature type. On the other hand, a considerable
amount of literature has been published on the techniques used to identify spam and spammers as
well as different type of analysis on this topic. These techniques can be classified into different
categories; some using linguistic patterns in text which are mostly based on bigram, and
unigram, others are based on behavioral patterns that rely on features extracted from patterns in
users’ behavior which are mostly metadata based.
Disadvantages:
 These work not enough to classify the spam network.
 Lack of work to detect spam features.
Proposed System
We propose NetSpam framework that is a novel network based approach which models review
networks as heterogeneous information networks. The general concept of our proposed
framework is to model a given review dataset as a Heterogeneous Information Network (HIN)
and to map the problem of spam detection into a HIN classification problem. In particular, we
model review dataset as a HIN in which reviews are connected through different node types
(such as features and users). A weighting concept is then employed to calculate each feature’s
importance (or weight). These weights are utilized to calculate the final labels for reviews using
both unsupervised and supervised approaches.
Advantages
 Importance of spam features help us to obtain better results in terms of different metrics
experimented on real-world review datasets
 Initiating the work to detect spam features.
SYSTEM REQUIREMENTS
HARDWARE REQUIREMENTS:
Hardware : Pentium
Speed : 1.1 GHz
RAM : 1GB
Hard Disk : 20 GB
SOFTWARE REQUIREMENTS:
Operating System : Windows Family
Technology : Java and J2EE
Web Technologies : Html, JavaScript, CSS
Web Server : Apache Tomcat 7.0/8.0
Database : My SQL 5.5 or Higher
UML's : StarUml
Java Version : JDK 1.7 or 1.8
Implemented by
Development team : Cloud Technologies
Website : http://www.cloudstechnologies.in/
Contact : 8121953811, 040-65511811

More Related Content

PPT
CYBER-CRIME PRESENTATION.ppt
PPTX
Security in e commerce
PPT
3d password ppt
PPTX
Hacking
DOC
Biometrics Technology Seminar Report.
PPT
Digital Signature
PPTX
Email Spoofing.pptx
PPTX
Encryption ppt
CYBER-CRIME PRESENTATION.ppt
Security in e commerce
3d password ppt
Hacking
Biometrics Technology Seminar Report.
Digital Signature
Email Spoofing.pptx
Encryption ppt

What's hot (20)

DOCX
farming assistant web service
PPT
Network security cryptography ppt
PPTX
Atm using fingerprint
PPSX
Unit 2
PDF
User Interface Design Module 5 screen based controls
PPTX
IP Spoofing
PPTX
digital tokens based on E-payments
PPTX
PHISHING PROJECT REPORT
PPTX
Consumer Oriented Application, Mercantile process and Mercantile models
PPTX
Credit Card Fraud Detection
PPTX
PPTX
Tools and methods used in cybercrime
PDF
PPTX
Detection of phishing websites
PPTX
Client server security threats
PPTX
Different types of Symmetric key Cryptography
PPT
Digital Signature
PPT
captcha.ppt
DOCX
Project report on (atm MAnagment system)
farming assistant web service
Network security cryptography ppt
Atm using fingerprint
Unit 2
User Interface Design Module 5 screen based controls
IP Spoofing
digital tokens based on E-payments
PHISHING PROJECT REPORT
Consumer Oriented Application, Mercantile process and Mercantile models
Credit Card Fraud Detection
Tools and methods used in cybercrime
Detection of phishing websites
Client server security threats
Different types of Symmetric key Cryptography
Digital Signature
captcha.ppt
Project report on (atm MAnagment system)
Ad

Similar to Net spam a network based spam detection framework for reviews in online social media (20)

PDF
Netspam: An Efficient Approach to Prevent Spam Messages using Support Vector ...
DOCX
VTU final year project report Main
PDF
IRJET-A Novel Technic to Notice Spam Reviews on e-Shopping
PDF
Analysis on Recommended System for Web Information Retrieval Using HMM
PDF
Improved spambase dataset prediction using svm rbf kernel with adaptive boost
PPTX
presentation
PDF
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
PDF
Survey in Online Social Media Skelton by Network based Spam
PPTX
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
PDF
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
PDF
International Journal of Engineering Research and Development
DOCX
Report of Previous Project by Yifan Guo
PPTX
Recommender System _Module 1_Introduction to Recommender System.pptx
PDF
IRJET- Improving Performance of Fake Reviews Detection in Online Review’s usi...
PDF
Fuzzy Logic Based Recommender System
PDF
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
PDF
Integrated approach to detect spam in social media networks using hybrid feat...
PDF
Classification Methods for Spam Detection in Online Social Network
PPTX
finbg dlf cm DH kf ki dfbjjhfsckhvkhal review ppt.pptx
PDF
An Approach for Malicious Spam Detection in Email with Comparison of Differen...
Netspam: An Efficient Approach to Prevent Spam Messages using Support Vector ...
VTU final year project report Main
IRJET-A Novel Technic to Notice Spam Reviews on e-Shopping
Analysis on Recommended System for Web Information Retrieval Using HMM
Improved spambase dataset prediction using svm rbf kernel with adaptive boost
presentation
Unification Algorithm in Hefty Iterative Multi-tier Classifiers for Gigantic ...
Survey in Online Social Media Skelton by Network based Spam
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
International Journal of Engineering Research and Development
Report of Previous Project by Yifan Guo
Recommender System _Module 1_Introduction to Recommender System.pptx
IRJET- Improving Performance of Fake Reviews Detection in Online Review’s usi...
Fuzzy Logic Based Recommender System
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Integrated approach to detect spam in social media networks using hybrid feat...
Classification Methods for Spam Detection in Online Social Network
finbg dlf cm DH kf ki dfbjjhfsckhvkhal review ppt.pptx
An Approach for Malicious Spam Detection in Email with Comparison of Differen...
Ad

More from CloudTechnologies (20)

DOCX
PublicEduChain A Framework for Sharing Student-Owned Educational Data on Publ...
DOCX
Blockchain Based Logging to Defeat Malicious Insiders The Case of Remote Heal...
DOCX
Enhancing Personalized Learning Experiences by Leveraging Deep Learning for C...
DOCX
Machine Learning Classification to predict water purity based on Viruses and ...
DOCX
iot based safety and health monitoring for construction workers
DOCX
Intelligent neonatal monitoring system based on android application using mul...
DOCX
An iot based smart garden with weather station system
DOCX
A deep learning facial expression recognition based scoring system for restau...
DOCX
Diabetes prediction using different machine learning approaches
DOCX
machine learning based predictive analytics of student academic performance i...
DOCX
Image based estimation of real food size for accurate food calorie estimation
DOCX
Network intrusion detection using supervised machine learning technique with ...
DOCX
Io t projects
DOCX
Cloud computing projects
DOCX
Data mining projects
DOCX
Python IEEE 2019 Projects List
DOCX
Machine learning projects
DOCX
Raspberry Pi based voice-operated personal assistant (Neobot)
DOCX
Automation in Agriculture and IoT
DOCX
Gas Leakage Detection Based on IOT
PublicEduChain A Framework for Sharing Student-Owned Educational Data on Publ...
Blockchain Based Logging to Defeat Malicious Insiders The Case of Remote Heal...
Enhancing Personalized Learning Experiences by Leveraging Deep Learning for C...
Machine Learning Classification to predict water purity based on Viruses and ...
iot based safety and health monitoring for construction workers
Intelligent neonatal monitoring system based on android application using mul...
An iot based smart garden with weather station system
A deep learning facial expression recognition based scoring system for restau...
Diabetes prediction using different machine learning approaches
machine learning based predictive analytics of student academic performance i...
Image based estimation of real food size for accurate food calorie estimation
Network intrusion detection using supervised machine learning technique with ...
Io t projects
Cloud computing projects
Data mining projects
Python IEEE 2019 Projects List
Machine learning projects
Raspberry Pi based voice-operated personal assistant (Neobot)
Automation in Agriculture and IoT
Gas Leakage Detection Based on IOT

Recently uploaded (20)

PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Pre independence Education in Inndia.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Classroom Observation Tools for Teachers
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Lesson notes of climatology university.
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Insiders guide to clinical Medicine.pdf
PDF
Sports Quiz easy sports quiz sports quiz
PPTX
Pharma ospi slides which help in ospi learning
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
master seminar digital applications in india
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
VCE English Exam - Section C Student Revision Booklet
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Pre independence Education in Inndia.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
2.FourierTransform-ShortQuestionswithAnswers.pdf
GDM (1) (1).pptx small presentation for students
Classroom Observation Tools for Teachers
human mycosis Human fungal infections are called human mycosis..pptx
Lesson notes of climatology university.
O5-L3 Freight Transport Ops (International) V1.pdf
Insiders guide to clinical Medicine.pdf
Sports Quiz easy sports quiz sports quiz
Pharma ospi slides which help in ospi learning
Abdominal Access Techniques with Prof. Dr. R K Mishra
master seminar digital applications in india
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf

Net spam a network based spam detection framework for reviews in online social media

  • 1. NetSpam: a Network-based Spam Detection Framework for Reviews in Online Social Media Abstract—Nowadays, a big part of people rely on available content in social media in their decisions (e.g. reviews and feedback on a topic or product). The possibility that anybody can leave a review provides a golden opportunity for spammers to write spam reviews about products and services for different interests. Identifying these spammers and the spam content is a hot topic of research and although a considerable number of studies have been done recently toward this end, but so far the methodologies put forth still barely detect spam reviews, and none of them show the importance of each extracted feature type. In this study, we propose a novel framework, named NetSpam, which utilizes spam features for modeling review datasets as heterogeneous information networks to map spam detection procedure into a classification problem in such networks. Using the importance of spam features help us to obtain better results in terms of different metrics experimented on real-world review datasets from Yelp and Amazon websites. The results show that NetSpam outperforms the existing methods and among four categories of features; including review-behavioral, user-behavioral, review linguistic, user- linguistic, the first type of features performs better than the other categories. Existing work In Existing work, the work only depend on the detect the spam reviews and spammers. None of them show the importance of each extracted feature type. On the other hand, a considerable amount of literature has been published on the techniques used to identify spam and spammers as well as different type of analysis on this topic. These techniques can be classified into different categories; some using linguistic patterns in text which are mostly based on bigram, and unigram, others are based on behavioral patterns that rely on features extracted from patterns in users’ behavior which are mostly metadata based. Disadvantages:  These work not enough to classify the spam network.  Lack of work to detect spam features.
  • 2. Proposed System We propose NetSpam framework that is a novel network based approach which models review networks as heterogeneous information networks. The general concept of our proposed framework is to model a given review dataset as a Heterogeneous Information Network (HIN) and to map the problem of spam detection into a HIN classification problem. In particular, we model review dataset as a HIN in which reviews are connected through different node types (such as features and users). A weighting concept is then employed to calculate each feature’s importance (or weight). These weights are utilized to calculate the final labels for reviews using both unsupervised and supervised approaches. Advantages  Importance of spam features help us to obtain better results in terms of different metrics experimented on real-world review datasets  Initiating the work to detect spam features. SYSTEM REQUIREMENTS HARDWARE REQUIREMENTS: Hardware : Pentium Speed : 1.1 GHz RAM : 1GB Hard Disk : 20 GB SOFTWARE REQUIREMENTS: Operating System : Windows Family Technology : Java and J2EE
  • 3. Web Technologies : Html, JavaScript, CSS Web Server : Apache Tomcat 7.0/8.0 Database : My SQL 5.5 or Higher UML's : StarUml Java Version : JDK 1.7 or 1.8 Implemented by Development team : Cloud Technologies Website : http://www.cloudstechnologies.in/ Contact : 8121953811, 040-65511811