Spam User Detection
Andy, Len, Petertc
Agenda
 Machine Learning Tools
 AWS Machine Learning
 Azure Machine Learning
 Spam Detection Algorithm
 Detect spam user by time frequency.
 Detect spam user by article correlation.
Machine Learning Tools
AWS Machine Learning
AWS Machine Learning
Azure Machine Learning
 https://manage.windowsazure.com
Spam Detection Algorithm
Detect spam user by time frequency.
Detect spam user by time frequency.
500,000 actions
Detect spam user by time frequency.
1,000,000 actions
500,000 actions
Detect spam user by time frequency.
1,000,000 actions
3,000,000 actions
Detect spam user by article correlation.
Spam User
Spam User
Spam User
High correlation
High correlation
High correlation
Detect spam user by article correlation.
Title_count
Article_count
High correlation
Thanks!

More Related Content

PDF
Email Meetup- Best Practices for Email Deliverability
DOCX
Classifying fake news articles using natural language processing to identify ...
PDF
Round tripping your assumptions
PPTX
Hadoop Con2015 - The Data Scientist’s Toolbox
PDF
Spam Detection with a Content-based Random-walk Algorithm (SMUC'2010)
PDF
MLPI Lecture 1: Maths for Machine Learning
PDF
Machine Learning Preliminaries and Math Refresher
Email Meetup- Best Practices for Email Deliverability
Classifying fake news articles using natural language processing to identify ...
Round tripping your assumptions
Hadoop Con2015 - The Data Scientist’s Toolbox
Spam Detection with a Content-based Random-walk Algorithm (SMUC'2010)
MLPI Lecture 1: Maths for Machine Learning
Machine Learning Preliminaries and Math Refresher

Viewers also liked (20)

PDF
CASL vs CAN-SPAM - Canada’s Anti‐Spam Law
PDF
Designing Teams for Emerging Challenges
PDF
UX, ethnography and possibilities: for Libraries, Museums and Archives
PDF
Visual Design with Data
PDF
3 Things Every Sales Team Needs to Be Thinking About in 2017
PDF
臺北市政府開放資料黑客松
PDF
綠黨網路支黨部 黨員大會工作報告
PDF
2014 Pixnet Hackathonh - EXIF Mining
PDF
How to Become a Thought Leader in Your Niche
PDF
Use Redis in Odd and Unusual Ways
PDF
02 math essentials
PDF
Secure Because Math: A Deep-Dive on Machine Learning-Based Monitoring (#Secur...
PDF
Madrid Agudelo Juliana_AporteIndividual
PDF
Hadoop con2016 - Implement Real-time Centralized logging System by Elastic Stack
PDF
DevNexus 2017 - Building and Deploying 12 Factor Apps in Scala, Java, Ruby, a...
PDF
Agile scrum in startup
PDF
Nine Pages You Should Optimize on Your Blog and How
PDF
African Americans: College Majors and Earnings
PDF
The Online College Labor Market
PDF
GAME ON! Integrating Games and Simulations in the Classroom
CASL vs CAN-SPAM - Canada’s Anti‐Spam Law
Designing Teams for Emerging Challenges
UX, ethnography and possibilities: for Libraries, Museums and Archives
Visual Design with Data
3 Things Every Sales Team Needs to Be Thinking About in 2017
臺北市政府開放資料黑客松
綠黨網路支黨部 黨員大會工作報告
2014 Pixnet Hackathonh - EXIF Mining
How to Become a Thought Leader in Your Niche
Use Redis in Odd and Unusual Ways
02 math essentials
Secure Because Math: A Deep-Dive on Machine Learning-Based Monitoring (#Secur...
Madrid Agudelo Juliana_AporteIndividual
Hadoop con2016 - Implement Real-time Centralized logging System by Elastic Stack
DevNexus 2017 - Building and Deploying 12 Factor Apps in Scala, Java, Ruby, a...
Agile scrum in startup
Nine Pages You Should Optimize on Your Blog and How
African Americans: College Majors and Earnings
The Online College Labor Market
GAME ON! Integrating Games and Simulations in the Classroom
Ad

Recently uploaded (20)

PDF
The influence of sentiment analysis in enhancing early warning system model f...
PPTX
Chapter 5: Probability Theory and Statistics
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
STKI Israel Market Study 2025 version august
PDF
Abstractive summarization using multilingual text-to-text transfer transforme...
PPT
Geologic Time for studying geology for geologist
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PPTX
The various Industrial Revolutions .pptx
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
A review of recent deep learning applications in wood surface defect identifi...
PPTX
Microsoft Excel 365/2024 Beginner's training
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
2018-HIPAA-Renewal-Training for executives
The influence of sentiment analysis in enhancing early warning system model f...
Chapter 5: Probability Theory and Statistics
NewMind AI Weekly Chronicles – August ’25 Week III
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
STKI Israel Market Study 2025 version august
Abstractive summarization using multilingual text-to-text transfer transforme...
Geologic Time for studying geology for geologist
Convolutional neural network based encoder-decoder for efficient real-time ob...
A proposed approach for plagiarism detection in Myanmar Unicode text
The various Industrial Revolutions .pptx
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Credit Without Borders: AI and Financial Inclusion in Bangladesh
Enhancing emotion recognition model for a student engagement use case through...
A review of recent deep learning applications in wood surface defect identifi...
Microsoft Excel 365/2024 Beginner's training
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
Developing a website for English-speaking practice to English as a foreign la...
A comparative study of natural language inference in Swahili using monolingua...
2018-HIPAA-Renewal-Training for executives
Ad

Spam user detection report

Editor's Notes

  • #5: * Content / tag take out Loss data (article -> user) : 20000 -> 2000 It’s result
  • #6: * Content / tag take out Loss data (article -> user) : 20000 -> 2000 It’s result