SlideShare a Scribd company logo
DISTRIBUTED MACHINE LEARNING EXAMPLES
STANLEY WANG
SOLUTION ARCHITECT, TECH LEAD
@SWANG68
http://www.linkedin.com/in/stanley-wang-a2b143b
Topic Modeling
• Topical categorization of blogs, documents or other objects that can be tagged with
text, improves the experience for end users;
• Discover Sets of
Topics from Large
Unstructured
Collections of
documents;
• Annotate
documents with
topic;
• Utilize Annotation
to Index, Search
and Classify on
documents;
The Intuitions behind LDA
• Latent Dirichlet Allocation (LDA) is an unsupervised, probabilistic, text
clustering algorithm. LDA defines a generative model that can be used
to model how documents are generated given a set of topics and the
words in the topics;
Graphical Model for LDA
• Topic-based text
classification;
• Topic modeling can be seen as
a pre-processing step before
applying supervised learning
methods, such as
Collaborative Filtering;
• Finding patterns in genetic
data, images, and social
networks;
Real Inference with LDA
• A 100-topic LDA model was fitted to 17,000 articles from the Science journal;
• At right are the top 15 most frequent words from the most frequent topics;
• At left are the inferred topic proportions for the example article from previous slide;
Topic Modeling and Analysis
Distributed machine learning examples
What is Community Intuition?
In social world, community is a collection of users that are more closely
related to each other than the rest of the network. The relation between users
can be amount of interaction, similar interest, geographical factors etc.
Why Detect Social Communities?
• Behavior Analysis
• Location-based Interaction Analysis
• Recommender Systems Development
• Link Prediction
• Customer Interaction and Analysis
• Media & Content Analysis
• Security
• Social Studies
Community And Applications
Structure Metrics
Centrality Metrics
Metrics of Graph Analysis
Graph Modularity
Graph Modularity Computation
Graph Modularity Examples
Diverse of Centrality
Social Tag Clustering
Social Tag Clustering - Examples

More Related Content

PPTX
Information retrieval system!
PDF
CS6007 information retrieval - 5 units notes
PPTX
Review of search and retrieval strategies
PPTX
Information retrieval introduction
PPTX
Functions of information retrival system(1)
PPT
Information retrieval system
PPT
Information retrieval
PPTX
WEB BASED INFORMATION RETRIEVAL SYSTEM
Information retrieval system!
CS6007 information retrieval - 5 units notes
Review of search and retrieval strategies
Information retrieval introduction
Functions of information retrival system(1)
Information retrieval system
Information retrieval
WEB BASED INFORMATION RETRIEVAL SYSTEM

What's hot (20)

PPTX
INFORMATION RETRIEVAL Anandraj.L
PDF
Research on ontology based information retrieval techniques
PPT
Eco4132 Spring 2010
PPT
Vellino presentationtocisti
PDF
Detailed Information Literacy Rubric
PDF
Basic Information Literacy Rubric
PPTX
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
PPTX
Abstract and i ndexing
PPT
Index nominum to ontology
PPTX
Subject Indexing & Techniques
PDF
Information Retrieval Methods in Libraries and Information Centers
PDF
Linked Open Data in the World of Patents
PDF
Exploring and accessing knowledge in Research
PPTX
Info 2402 irt-chapter_2
PPTX
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
PPTX
PPTX
Digital humanities
PDF
Evaluating Library Capacity to Manage Research Data
ODP
Week10
PDF
Information Retrieval Fundamentals - An introduction
INFORMATION RETRIEVAL Anandraj.L
Research on ontology based information retrieval techniques
Eco4132 Spring 2010
Vellino presentationtocisti
Detailed Information Literacy Rubric
Basic Information Literacy Rubric
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Abstract and i ndexing
Index nominum to ontology
Subject Indexing & Techniques
Information Retrieval Methods in Libraries and Information Centers
Linked Open Data in the World of Patents
Exploring and accessing knowledge in Research
Info 2402 irt-chapter_2
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Digital humanities
Evaluating Library Capacity to Manage Research Data
Week10
Information Retrieval Fundamentals - An introduction
Ad

Viewers also liked (11)

PPTX
DCDataFest - Text mining and machine learning
PDF
Terascale Learning
ODP
Challenges in Large Scale Machine Learning
PDF
Neural Networks and Deep Learning for Physicists
PDF
Using Machine Learning to aid Journalism at the New York Times
PDF
Distributed machine learning
PDF
H2O World - Consensus Optimization and Machine Learning - Stephen Boyd
PDF
Community detection in graphs
PDF
NIPS2013読み会: More Effective Distributed ML via a Stale Synchronous Parallel P...
PPTX
Lessons from 2MM machine learning models
PDF
Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O
DCDataFest - Text mining and machine learning
Terascale Learning
Challenges in Large Scale Machine Learning
Neural Networks and Deep Learning for Physicists
Using Machine Learning to aid Journalism at the New York Times
Distributed machine learning
H2O World - Consensus Optimization and Machine Learning - Stephen Boyd
Community detection in graphs
NIPS2013読み会: More Effective Distributed ML via a Stale Synchronous Parallel P...
Lessons from 2MM machine learning models
Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O
Ad

Similar to Distributed machine learning examples (20)

PDF
Topic-oriented writing at McAfee
PPT
SRM, AI research project, science schot.ppt
PDF
Federated to library discovery platfoms
PDF
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
PDF
Understanding Information Architecture
PDF
Hansen Metadata for Institutional Repositories
PDF
Text Analytics in Enterprise Search
PDF
Text Analytics in Enterprise Search - Daniel Ling
PPS
Process Re-engineering for Topic Based Authoring
PPTX
Coding Your Results
PPTX
The presentation explains the responsible use of AI in health research
PDF
A FAIR Approach to Publishing and Sharing Machine Learning Models
PPTX
Understanding Content Analysis in Qualitative Research.pptx
PDF
NCompass Live: Reading for Justice: A Database for YA & Youth Literature
PPTX
BIS1100 Nov 2017
PDF
Building an Innovative Learning Ecosystem at Scale with Graph Technologies
PPTX
PRANSHU_FINAL_PgjjcfgcghjjjjfgnnjPT.pptx
PPTX
2013 DataCite Summer Meeting - Elsevier's program to support research data (H...
PPTX
Data analysis – using computers
Topic-oriented writing at McAfee
SRM, AI research project, science schot.ppt
Federated to library discovery platfoms
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Understanding Information Architecture
Hansen Metadata for Institutional Repositories
Text Analytics in Enterprise Search
Text Analytics in Enterprise Search - Daniel Ling
Process Re-engineering for Topic Based Authoring
Coding Your Results
The presentation explains the responsible use of AI in health research
A FAIR Approach to Publishing and Sharing Machine Learning Models
Understanding Content Analysis in Qualitative Research.pptx
NCompass Live: Reading for Justice: A Database for YA & Youth Literature
BIS1100 Nov 2017
Building an Innovative Learning Ecosystem at Scale with Graph Technologies
PRANSHU_FINAL_PgjjcfgcghjjjjfgnnjPT.pptx
2013 DataCite Summer Meeting - Elsevier's program to support research data (H...
Data analysis – using computers

More from Stanley Wang (14)

PDF
Sparql a simple knowledge query
PDF
Ontologies and semantic web
PDF
Ontology model and owl
PDF
Resource description framework
PDF
Semantic web technology
PDF
Next generation big data bi
PDF
Overview of recommender system
PDF
Data analytics as a service
PDF
Fundamental of deep learning
PDF
Graph analytic and machine learning
PDF
Big data analytic market opportunity
PDF
A sdn based application aware and network provisioning
PDF
Hadoop ecosystem
PDF
Hadoop ecosystem
Sparql a simple knowledge query
Ontologies and semantic web
Ontology model and owl
Resource description framework
Semantic web technology
Next generation big data bi
Overview of recommender system
Data analytics as a service
Fundamental of deep learning
Graph analytic and machine learning
Big data analytic market opportunity
A sdn based application aware and network provisioning
Hadoop ecosystem
Hadoop ecosystem

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Cloud computing and distributed systems.
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
KodekX | Application Modernization Development
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
cuic standard and advanced reporting.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
MIND Revenue Release Quarter 2 2025 Press Release
Spectral efficient network and resource selection model in 5G networks
Cloud computing and distributed systems.
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Diabetes mellitus diagnosis method based random forest with bat algorithm
Review of recent advances in non-invasive hemoglobin estimation
Mobile App Security Testing_ A Comprehensive Guide.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
The AUB Centre for AI in Media Proposal.docx
KodekX | Application Modernization Development
Programs and apps: productivity, graphics, security and other tools
cuic standard and advanced reporting.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Digital-Transformation-Roadmap-for-Companies.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Dropbox Q2 2025 Financial Results & Investor Presentation

Distributed machine learning examples

  • 1. DISTRIBUTED MACHINE LEARNING EXAMPLES STANLEY WANG SOLUTION ARCHITECT, TECH LEAD @SWANG68 http://www.linkedin.com/in/stanley-wang-a2b143b
  • 2. Topic Modeling • Topical categorization of blogs, documents or other objects that can be tagged with text, improves the experience for end users; • Discover Sets of Topics from Large Unstructured Collections of documents; • Annotate documents with topic; • Utilize Annotation to Index, Search and Classify on documents;
  • 3. The Intuitions behind LDA • Latent Dirichlet Allocation (LDA) is an unsupervised, probabilistic, text clustering algorithm. LDA defines a generative model that can be used to model how documents are generated given a set of topics and the words in the topics;
  • 4. Graphical Model for LDA • Topic-based text classification; • Topic modeling can be seen as a pre-processing step before applying supervised learning methods, such as Collaborative Filtering; • Finding patterns in genetic data, images, and social networks;
  • 5. Real Inference with LDA • A 100-topic LDA model was fitted to 17,000 articles from the Science journal; • At right are the top 15 most frequent words from the most frequent topics; • At left are the inferred topic proportions for the example article from previous slide;
  • 8. What is Community Intuition? In social world, community is a collection of users that are more closely related to each other than the rest of the network. The relation between users can be amount of interaction, similar interest, geographical factors etc.
  • 9. Why Detect Social Communities? • Behavior Analysis • Location-based Interaction Analysis • Recommender Systems Development • Link Prediction • Customer Interaction and Analysis • Media & Content Analysis • Security • Social Studies
  • 13. Metrics of Graph Analysis
  • 19. Social Tag Clustering - Examples