SlideShare a Scribd company logo
Mining Heterogeneous Information Networks
Ashwin Pingali
CTO – Apps Consultants & Dzee Solutions
ashwin@appsconsultants.com
Agenda
Why Graphs
Graphs / Networks - History
Homogeneous Networks
Heterogeneous Networks
HINs – examples
Mining HINs - Examples
Other Case Studies – Forecasting / Recommendations
“Graph analysis is the true killer app for Big Data.”
“Forrester estimates that over 25% of enterprises will be
using graph databases by 2017.”
“Graph analysis is possibly the single most effective
competitive differentiator for organizations pursuing data-
driven operations and decisions after the design of data
capture.”
“By the end of 2018, 70% of leading organizations will have
one or more pilot or proof-of-concept efforts underway
utilizing graph databases.”
Why Graphs
Graphs / Networks - History
Is it possible to walk with a
route that crosses each bridge
exactly once, and return to
the starting point.
In 1736, Leonhard
Euler proved that it
was not possible.
A
B
C
D A
B
C
D1
2 3
4
5
6
7
1
4
2
3
5
6
7
Homogeneous Networks
C12
C11 C13
C14 C14
C1
C9 C7
C4
C5
C3
C10
C8
C6
Characteristics
• Single type of node
• Single link type
Examples
• Network of computers
• Network of people
Heterogeneous Networks
Characteristics
• Different types of nodes
• Different types of relationships
Examples
• Influencing the decision maker
Decision
maker
Sales rep
Employee
Consultant
Employee
Consultant
Sales rep
Sales rep
Employee
Best tool to represent heterogeneous networks
Heterogeneous Information Networks (HINs)
Example 1: Customer Ordering Information Network
Heterogeneous Information Networks (HINs)
Example 1: Customer Ordering Information Network
Heterogeneous Information Networks (HINs)
Example 1: Customer Ordering Information Network
Heterogeneous Information Networks (HINs)
Example 1: Customer Ordering Information Network
Mining HINs – Customer Order Mining
• Searching for Similar customers based on past order behavior (frequency, items
ordered)
• Classifying customers (clustering) and then ranking customers within each
classification
• Predicting Relationships – For example: recommending products that they might
be interested in.
ROI Case Studies
• Identify least profitable and most profitable transactions
• Predict problematic transactions before they are fulfilled
• Increase the revenue from customer base by targeted upsell.
Heterogeneous Information Networks (HINs)
Example 2: HealthCare Service Consumption Network
Heterogeneous Information Networks (HINs)
Example 2: HealthCare Service Consumption Network
Mining HINs – HealthCare Service Consumption
• Identifying similar customers based on their actual health care service usage
• Predicting health care service consumption over time
• Modeling Risk for different health care groups
Other Case Studies
• Hierarchical forecasting taking underlying components into account.
• Behavioral patterns of customers as networks evolving over time
• Using networks to calculate cycle times

More Related Content

PDF
Technology tipping points Big Data and Blockchain use case presentation
PPTX
How Blockchain Can Be Used In Big Data Analytics
PDF
Top 20 Big Data Tools 2019
DOCX
K nearest neighbor classification over semantically secure encrypted
PDF
K nearest neighbor classification over semantically secure encrypted relation...
PPTX
Blockchain: Background and Data61 Research Overview
PPTX
DN18 | Ocean Protocol – Empowering a Decentralized Marketplace for The New AI...
DOCX
K-NEAREST NEIGHBOR CLASSIFICATION OVER SEMANTICALLY SECURE ENCRYPTED RELATION...
Technology tipping points Big Data and Blockchain use case presentation
How Blockchain Can Be Used In Big Data Analytics
Top 20 Big Data Tools 2019
K nearest neighbor classification over semantically secure encrypted
K nearest neighbor classification over semantically secure encrypted relation...
Blockchain: Background and Data61 Research Overview
DN18 | Ocean Protocol – Empowering a Decentralized Marketplace for The New AI...
K-NEAREST NEIGHBOR CLASSIFICATION OVER SEMANTICALLY SECURE ENCRYPTED RELATION...

What's hot (20)

PDF
K nearest neighbor classification over semantically secure encrypted relation...
PDF
Applying Blockchain Technology for Digital Transformation
PPTX
Software Architecture and Model-Driven Engineering for Blockchain
PDF
Confidential Computing - Analysing Data Without Seeing Data
PDF
Behavioral Analytics and Blockchain Applications – a Reliability View. Keynot...
PDF
Up-skilling India in Blockchain
PPT
Blockchain Technology and Its Application in Artificial Intelligence and Mach...
PDF
DYNAMIC AND PUBLIC AUDITING WITH FAIR ARBITRATION FOR CLOUD DATA
PDF
Towards a Post-Modern Hash Chain Future
PDF
An introduction for blockcahin
DOCX
Privacy preserving multi-keyword ranked search over encrypted cloud data
PPTX
Cybersecurity aspects of blockchain and cryptocurrency
PDF
Dynamic and public auditing with fair arbitration for cloud data
PDF
Data in action in a smart way
PDF
Legal Engineering
PDF
Decentralised AI through Distributed Ledger Technologies
PPT
IT Controls Presentation
PPTX
marketAnalyticsFinal
PDF
IRJET - Healthcare Data Storage using Blockchain
PDF
Aplicación de la tecnología blockchain en sistemas energéticos sostenibles
K nearest neighbor classification over semantically secure encrypted relation...
Applying Blockchain Technology for Digital Transformation
Software Architecture and Model-Driven Engineering for Blockchain
Confidential Computing - Analysing Data Without Seeing Data
Behavioral Analytics and Blockchain Applications – a Reliability View. Keynot...
Up-skilling India in Blockchain
Blockchain Technology and Its Application in Artificial Intelligence and Mach...
DYNAMIC AND PUBLIC AUDITING WITH FAIR ARBITRATION FOR CLOUD DATA
Towards a Post-Modern Hash Chain Future
An introduction for blockcahin
Privacy preserving multi-keyword ranked search over encrypted cloud data
Cybersecurity aspects of blockchain and cryptocurrency
Dynamic and public auditing with fair arbitration for cloud data
Data in action in a smart way
Legal Engineering
Decentralised AI through Distributed Ledger Technologies
IT Controls Presentation
marketAnalyticsFinal
IRJET - Healthcare Data Storage using Blockchain
Aplicación de la tecnología blockchain en sistemas energéticos sostenibles
Ad

Similar to Mining heterogeneous information networks (20)

PPTX
Predictive Analytics: Context and Use Cases
PDF
Big data-analytics-changing-way-organizations-conducting-business
PPTX
Big Data Forum - Phoenix
PPTX
Data warehouse and data mining
PPT
1328cvkdlgkdgjfdkjgjdfgdfkgdflgkgdfglkjgld8679 - Copy.ppt
PPTX
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
PDF
uae views on big data
PPT
PPT
Data Mining Xuequn Shang NorthWestern Polytechnical University
PDF
Introduction to Data Analytics and data analytics life cycle
PPTX
Types of Blockchain, AI and its future
PDF
Patterns of Data-Infused Business Model Innovation
PPTX
How Startups can leverage big data?
PDF
Big data in marketing at harvard business club nick1 june 15 2013
PPTX
Big Data World
PPTX
Liberating data power of APIs
PDF
KIT-601 Lecture Notes-UNIT-1.pdf
PPTX
Trends in data analytics
PDF
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
Predictive Analytics: Context and Use Cases
Big data-analytics-changing-way-organizations-conducting-business
Big Data Forum - Phoenix
Data warehouse and data mining
1328cvkdlgkdgjfdkjgjdfgdfkgdflgkgdfglkjgld8679 - Copy.ppt
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
uae views on big data
Data Mining Xuequn Shang NorthWestern Polytechnical University
Introduction to Data Analytics and data analytics life cycle
Types of Blockchain, AI and its future
Patterns of Data-Infused Business Model Innovation
How Startups can leverage big data?
Big data in marketing at harvard business club nick1 june 15 2013
Big Data World
Liberating data power of APIs
KIT-601 Lecture Notes-UNIT-1.pdf
Trends in data analytics
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
Ad

More from DZee Solutions (9)

PPTX
Block chain as a graph
PDF
2011 open world oracle update_sales_and_service_contracts
PDF
2010 open world r12 service contracts
PDF
2009 open world chooing the right subscription contract
PDF
2009 08 25_business value of upgrading to oracle e-business suite contracts r...
PDF
2008 11 18_contracts _oaug-sig
PDF
2010 oow contracts_sig
PDF
2009 Oracle Open World SIG Contracts Update
PPTX
Collaborate 12 : Contracts SIG Meeting
Block chain as a graph
2011 open world oracle update_sales_and_service_contracts
2010 open world r12 service contracts
2009 open world chooing the right subscription contract
2009 08 25_business value of upgrading to oracle e-business suite contracts r...
2008 11 18_contracts _oaug-sig
2010 oow contracts_sig
2009 Oracle Open World SIG Contracts Update
Collaborate 12 : Contracts SIG Meeting

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
cuic standard and advanced reporting.pdf
PPTX
Big Data Technologies - Introduction.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Machine learning based COVID-19 study performance prediction
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Programs and apps: productivity, graphics, security and other tools
Network Security Unit 5.pdf for BCA BBA.
Unlocking AI with Model Context Protocol (MCP)
Mobile App Security Testing_ A Comprehensive Guide.pdf
cuic standard and advanced reporting.pdf
Big Data Technologies - Introduction.pptx
The AUB Centre for AI in Media Proposal.docx
NewMind AI Weekly Chronicles - August'25 Week I
The Rise and Fall of 3GPP – Time for a Sabbatical?
Diabetes mellitus diagnosis method based random forest with bat algorithm
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Review of recent advances in non-invasive hemoglobin estimation
Machine learning based COVID-19 study performance prediction
Agricultural_Statistics_at_a_Glance_2022_0.pdf

Mining heterogeneous information networks

  • 1. Mining Heterogeneous Information Networks Ashwin Pingali CTO – Apps Consultants & Dzee Solutions ashwin@appsconsultants.com
  • 2. Agenda Why Graphs Graphs / Networks - History Homogeneous Networks Heterogeneous Networks HINs – examples Mining HINs - Examples Other Case Studies – Forecasting / Recommendations
  • 3. “Graph analysis is the true killer app for Big Data.” “Forrester estimates that over 25% of enterprises will be using graph databases by 2017.” “Graph analysis is possibly the single most effective competitive differentiator for organizations pursuing data- driven operations and decisions after the design of data capture.” “By the end of 2018, 70% of leading organizations will have one or more pilot or proof-of-concept efforts underway utilizing graph databases.” Why Graphs
  • 4. Graphs / Networks - History Is it possible to walk with a route that crosses each bridge exactly once, and return to the starting point. In 1736, Leonhard Euler proved that it was not possible. A B C D A B C D1 2 3 4 5 6 7 1 4 2 3 5 6 7
  • 5. Homogeneous Networks C12 C11 C13 C14 C14 C1 C9 C7 C4 C5 C3 C10 C8 C6 Characteristics • Single type of node • Single link type Examples • Network of computers • Network of people
  • 6. Heterogeneous Networks Characteristics • Different types of nodes • Different types of relationships Examples • Influencing the decision maker Decision maker Sales rep Employee Consultant Employee Consultant Sales rep Sales rep Employee Best tool to represent heterogeneous networks
  • 7. Heterogeneous Information Networks (HINs) Example 1: Customer Ordering Information Network
  • 8. Heterogeneous Information Networks (HINs) Example 1: Customer Ordering Information Network
  • 9. Heterogeneous Information Networks (HINs) Example 1: Customer Ordering Information Network
  • 10. Heterogeneous Information Networks (HINs) Example 1: Customer Ordering Information Network
  • 11. Mining HINs – Customer Order Mining • Searching for Similar customers based on past order behavior (frequency, items ordered) • Classifying customers (clustering) and then ranking customers within each classification • Predicting Relationships – For example: recommending products that they might be interested in. ROI Case Studies • Identify least profitable and most profitable transactions • Predict problematic transactions before they are fulfilled • Increase the revenue from customer base by targeted upsell.
  • 12. Heterogeneous Information Networks (HINs) Example 2: HealthCare Service Consumption Network
  • 13. Heterogeneous Information Networks (HINs) Example 2: HealthCare Service Consumption Network
  • 14. Mining HINs – HealthCare Service Consumption • Identifying similar customers based on their actual health care service usage • Predicting health care service consumption over time • Modeling Risk for different health care groups
  • 15. Other Case Studies • Hierarchical forecasting taking underlying components into account. • Behavioral patterns of customers as networks evolving over time • Using networks to calculate cycle times