SlideShare a Scribd company logo
Personal Social Network —  A New Approach to Personal Network Search based on Information Extraction   Jie Tang, Mingcai Hong, Jing Zhang, Bangyong Liang, and Juanzi Li   Knowledge Engineering Group, Department of Computer Science and Technology, Tsinghua University Sep. 5 th , 2006
Personal Social Network Personal social network is an important research area. A person usually has different types of information Personal profile (including portrait, homepage, position, affiliation, publications, and documents)  Contact information (including address, email, telephone, and fax number) Friends Unfortunately, the information is often hidden in heterogeneous and distributed web pages
Our Approach Personal Social Network = Building  +  Search   +  Mining Doc collection Annotation  Integration  Person search Publication search Association search Expert finding Research interesting finding
Processing Flow Submitted to Returned pages Fed to Extracting  and saving to Ontology base Query Classification Model
Building the Personal Network >400,000 Persons >700,000 Publications
Annotation using SVMs Personal profile: e.g. image, affiliation, etc. Contact information:  fax, email, phone, etc. Start position model End position model Identified info. Features sets
Person Search Search for a person using the name or other information, e.g. affiliation
Publication Search Searching for a publication using IR model
Publication Online-View
Association Search Finding associations between persons - high efficiency - Top-K associations  Usage: - to find a partner - to find a person with same interests
Expert Finding Finding experts on a topic
Research Interest Finding Finding research interests for a person
Homepage: http:// keg.cs.tsinghua.edu.cn/persons/tj Thank You

More Related Content

PDF
Foundation Directory: Finding Grants to Support Your Research
PPT
Visualizing data1
PDF
Scholarly Communication: Tools and Strategies for Learning and Sharing in the...
PDF
Navigating the mechanics of research
PDF
Flyer for Dr. Chirag Shah Speaker
PDF
Overbeeke
PPT
Humanities info mgmt diss
PDF
Pressforward Presentation at the Western Humanities Alliance Annual Meeting, ...
Foundation Directory: Finding Grants to Support Your Research
Visualizing data1
Scholarly Communication: Tools and Strategies for Learning and Sharing in the...
Navigating the mechanics of research
Flyer for Dr. Chirag Shah Speaker
Overbeeke
Humanities info mgmt diss
Pressforward Presentation at the Western Humanities Alliance Annual Meeting, ...

Viewers also liked (7)

PPTX
Social media network for personal branding
PPTX
Social Media: Personal Branding
PPTX
Expanding Engagement & Inspiring Action with Your Next Product Launch Webinar
PPT
Personal Branding & You-How to use social Media to create tour own person...
PPTX
Self-guided Social Media Training Manual
PDF
Youth and Social Media: Today and Beyond
PDF
Personal Branding with Social Media by @JoeyShepp
Social media network for personal branding
Social Media: Personal Branding
Expanding Engagement & Inspiring Action with Your Next Product Launch Webinar
Personal Branding & You-How to use social Media to create tour own person...
Self-guided Social Media Training Manual
Youth and Social Media: Today and Beyond
Personal Branding with Social Media by @JoeyShepp
Ad

Similar to New Approach To Personal Network Search Based On Information Extraction (Tin180 Com) (20)

PDF
Benchmarking the Privacy-­Preserving People Search
PDF
MDS 2011 Paper: An Unsupervised Approach to Discovering and Disambiguating So...
PPTX
Personal network analysis september 18
PPTX
Profiling User Interests on the Social Semantic Web
PPT
UserZoom: Search For People Online Study
PPT
Who am I? Blogtalk 2010 presentation
PDF
Tutorial on User Profiling with Graph Neural Networks and Related Beyond-Acc...
PPTX
MDS 2011 Presentation: An Unsupervised Approach to Discovering and Disambigua...
PDF
M045067275
PPT
analysis of a real online social network using semantic web frameworks
PDF
Leveraging Graph Neural Networks for User Profiling: Recent Advances and Open...
PPTX
Extracting Semantic User Networks from Informal Communication Exchanges
PDF
@WebSciDL PhD Student Project Reviews August 5&6, 2015
PDF
merged_document
KEY
Snac dh2011-june-2011
PDF
Enhancing the Privacy Protection of the User Personalized Web Search Using RDF
PPT
Information Retrieval and Social Media
PDF
Making sense of strangers' expertise from signals in digital artifacts
PPTX
Extracting Semantic
PPT
Profiling a Person With Search Log Data
Benchmarking the Privacy-­Preserving People Search
MDS 2011 Paper: An Unsupervised Approach to Discovering and Disambiguating So...
Personal network analysis september 18
Profiling User Interests on the Social Semantic Web
UserZoom: Search For People Online Study
Who am I? Blogtalk 2010 presentation
Tutorial on User Profiling with Graph Neural Networks and Related Beyond-Acc...
MDS 2011 Presentation: An Unsupervised Approach to Discovering and Disambigua...
M045067275
analysis of a real online social network using semantic web frameworks
Leveraging Graph Neural Networks for User Profiling: Recent Advances and Open...
Extracting Semantic User Networks from Informal Communication Exchanges
@WebSciDL PhD Student Project Reviews August 5&6, 2015
merged_document
Snac dh2011-june-2011
Enhancing the Privacy Protection of the User Personalized Web Search Using RDF
Information Retrieval and Social Media
Making sense of strangers' expertise from signals in digital artifacts
Extracting Semantic
Profiling a Person With Search Log Data
Ad

More from Tin180 VietNam (20)

PPS
Tình thương làm thăng hoa cuộc sống
PPT
Web Spam Techniques
PPT
The Six Secrets Of News Search Engine Optimization
PPT
Seo Beginners Slide Show
PPT
Search Engine Optimisation (Seo) And Search Engine Marketing
PPT
Introduction To Seo
PPT
Php White Hat Seo
PPT
Google Tech For Better Content
PPT
Best Kept Secrets To Search Engine Optimization Success The Art And The Scie...
PPT
S E O & Adsense Optimization
PPT
Web 2 0 Panel Make Social Media Work For You (Tin180 Com)
PPT
Viral Marketing Advertising Strategies For Social Networks Presentation (Ti...
PPT
Trend Analysis In Social Tagging An Lis Perspective Ecdl2007 (Tin180 Com)
PPT
The Changers Eu Social Media And The Impact On Business Communications And ...
PPT
Socialnetworkanalysis (Tin180 Com)
PPT
Social Media Success (Tin180 Com)
PPT
Social Software In The Travel & Tourism Industry, & In Teaching A Sustainable...
PPT
Social Network Based Information Systems (Tin180 Com)
PPT
Social Information Processing (Tin180 Com)
PPT
Sharma Social Networks (Tin180 Com)
Tình thương làm thăng hoa cuộc sống
Web Spam Techniques
The Six Secrets Of News Search Engine Optimization
Seo Beginners Slide Show
Search Engine Optimisation (Seo) And Search Engine Marketing
Introduction To Seo
Php White Hat Seo
Google Tech For Better Content
Best Kept Secrets To Search Engine Optimization Success The Art And The Scie...
S E O & Adsense Optimization
Web 2 0 Panel Make Social Media Work For You (Tin180 Com)
Viral Marketing Advertising Strategies For Social Networks Presentation (Ti...
Trend Analysis In Social Tagging An Lis Perspective Ecdl2007 (Tin180 Com)
The Changers Eu Social Media And The Impact On Business Communications And ...
Socialnetworkanalysis (Tin180 Com)
Social Media Success (Tin180 Com)
Social Software In The Travel & Tourism Industry, & In Teaching A Sustainable...
Social Network Based Information Systems (Tin180 Com)
Social Information Processing (Tin180 Com)
Sharma Social Networks (Tin180 Com)

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Machine learning based COVID-19 study performance prediction
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
MYSQL Presentation for SQL database connectivity
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Network Security Unit 5.pdf for BCA BBA.
A comparative analysis of optical character recognition models for extracting...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Machine learning based COVID-19 study performance prediction
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Encapsulation theory and applications.pdf
sap open course for s4hana steps from ECC to s4
Unlocking AI with Model Context Protocol (MCP)
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Per capita expenditure prediction using model stacking based on satellite ima...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
The Rise and Fall of 3GPP – Time for a Sabbatical?
Dropbox Q2 2025 Financial Results & Investor Presentation
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
MYSQL Presentation for SQL database connectivity
NewMind AI Weekly Chronicles - August'25-Week II
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf

New Approach To Personal Network Search Based On Information Extraction (Tin180 Com)

  • 1. Personal Social Network — A New Approach to Personal Network Search based on Information Extraction Jie Tang, Mingcai Hong, Jing Zhang, Bangyong Liang, and Juanzi Li Knowledge Engineering Group, Department of Computer Science and Technology, Tsinghua University Sep. 5 th , 2006
  • 2. Personal Social Network Personal social network is an important research area. A person usually has different types of information Personal profile (including portrait, homepage, position, affiliation, publications, and documents) Contact information (including address, email, telephone, and fax number) Friends Unfortunately, the information is often hidden in heterogeneous and distributed web pages
  • 3. Our Approach Personal Social Network = Building + Search + Mining Doc collection Annotation Integration Person search Publication search Association search Expert finding Research interesting finding
  • 4. Processing Flow Submitted to Returned pages Fed to Extracting and saving to Ontology base Query Classification Model
  • 5. Building the Personal Network >400,000 Persons >700,000 Publications
  • 6. Annotation using SVMs Personal profile: e.g. image, affiliation, etc. Contact information: fax, email, phone, etc. Start position model End position model Identified info. Features sets
  • 7. Person Search Search for a person using the name or other information, e.g. affiliation
  • 8. Publication Search Searching for a publication using IR model
  • 10. Association Search Finding associations between persons - high efficiency - Top-K associations Usage: - to find a partner - to find a person with same interests
  • 11. Expert Finding Finding experts on a topic
  • 12. Research Interest Finding Finding research interests for a person

Editor's Notes

  • #2: Welcome Professor Dieter to Tsinghua. Wish to get your advice and instruction. I am Li Juanzi from Knowledge Engineering Group in the department of computer science and technology. Thank professor Yang to give me this opportunity to introduce our work about semantic web And web services.
  • #5: The is the processing flow of the contact search. After the user inputted the person name, the system first query the database. If the database has the contact information of that person, the system will return the contact information directly. If not, the system submits the person name to Google. For the returned documents by Google, we take into consideration the top ranked 50 documents and fed them to a classifier. Our statistic shows that more that 90% of the personal information is located in the top ranked 20 documents and more that 95% of the personal information is located in the top ranked 50 documents. The classifier identifies whether a document contains the personal information or not. Finally, we make use a SVM based method for the extraction and save the extracted data into the database.
  • #7: In non-text filtering, we use the similar methods for header, signature, and program code detection. In the methods, we view a text line in an email as an instance in SVM. For each instance, we define a set of features. The method consists of two stages: training and detection. We use header as example to explain how we conduct the non-text block detection. In training, we use the training data as input and define two sets of features respectively for header start line and header end line detection. We then use the two feature sets to construct two SVM models. In detection, we identify whether or not a line is the start line of a header, and whether or not a line is the end line of a header using the two SVM models. We then view the lines between the identified start line and the end line as a header. So, to define effective features is one of our focuses.
  • #14: That is all for my introduction to our lab. Thank all