SlideShare a Scribd company logo
Next-Generation
LinkedIn Talent Search
Ryan Wu
Staff Machine Learning Scientist & Tech Lead
LinkedIn Corporation
Outline
Mission & Product
Expertise Search
Candidate Discovery via Similar Profiles Modeling
Search By Example
3
• 200+ countries and
territories
• 2+ new members per
second
4
● Dual Roles of Search
○ Enable talent discover opportunity
○ Help companies to search for the right talent
Standardized
Member Profile
At its core, LinkedIn is a digital representation of
the business world—a collection of the people,
companies, educations, skills, jobs, and the
connections between them. Our products use
this data to connect members with relevant
information, contacts, content, and opportunities.
The success of these products relies on data
standardization, our ability to understand user
data and to effectively make use of this
information.
6
Flagship Search
Recruiter Search
Sales Navigator Search
Agenda
Mission & Product
Expertise Search
Candidate Discovery via Similar Profiles Modeling
Search By Example
Expertise (Skill) Search at LinkedIn
▪ Skills
– 104’s of standardized skills
– Members get endorsed for
skills listed on their profile.
– Represent professional
expertise. 8
Searching by Skills
▪ Unique challenges to LinkedIn expertise Search
– Scale: 450M members x 35K standardized skills
– Sparsity of skills in profiles
– Personalization
9
…
Skill Reputation Scores [BigData’15]
10
▪ Decision Maker: searcher
▪ Record: Professional
career
▪ Skill reputation: member
expertise on a skill
Estimating Skill Reputation
11
Endorse
profile
browsemap
? .85 .45
? ? .35
? .42 ?
? ? .05
Members
Skills
P(expert| member, skill)
Supervised Learning
algorithm
Estimating Skill Reputation
12
Endorse
profile
browse
map
? .85 .45
? ? .35
? .42 ?
? ? .05
Members
Skills
0.5 1
0.7 0
0 0.6
0.1 0
0.2 0.3 0.5
0.5 0.7 0.2
Members
Skills
Each row is a representation of a
member in latent space
Each column
represents a skill in
latent space
Matrix Factorization
Estimating Skill Reputation
13
Endorse
profile
browsemap
? .85 .45
? ? .35
? .42 ?
.02 ? ?
Members
Skills
0.5 1
0.7 0
0 0.6
0.1 0
0.2 0.3 0.5
0.5 0.7 0.2
Members
Skills
.6 .85 .45
.14 .21 .35
.3 .42 .12
.02 .03 .05
Members
Skills
Fill in unknown cells in
the original matrix
Agenda
Mission & Product
Expertise Search
Candidate Discovery via Similar Profiles Modeling
Search By Example
Similar People How you rank for
profile views
People You May Hire Lead Recommendations
Similar Profiles Recommender
Title : Software Engineer, Research
Engineer, Research Assistant
Specialty : Machine Learning, Data
Analysis, Hadoop, Networks
Company: Cisco, Linkedin, Penn
State
Summary: Software Engineer,
Research Engineer, Machine
Learning, Data Analysis, Networks,
Research Assistant
How to model a profile with career trajectory?
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: Networks
- Company: Penn State
- Title: Research Assistant
- Duration: (2006.9-2010.6)
Keywords Profile Model Sequence Profile Model
How to match two profiles ?
Title : Software Engineer, RA
Specialty: ML, Networks…
Company: Cisco, Linkedin, Penn
State
Summary: Software Engineer
ML, Networks, …
Title : Software Engineer, Ph.D.
Specialty: ML, DM, Mobile…
Company: Yahoo, Linkedin,
Intel, Dartmouth
Summary: Software Engineer
ML, DM, Mobile …
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: ML
- Company: Linkedin
- Title: Software Engineer
- Duration: (2012.7-2013.3)
- Summary: Mobile
- Company: Intel
- Title: Software Engineer
- Duration: (2010.5-2012.3)
- Summary: Hadoop, DM
- Company: Yahoo
- Title: Research Scientist
- Duration: (2008.8-2010.4)
Similar Profiles
Similar Career Paths
?
Set of Positions
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: Networks
- Company: Penn State
- Title: Research Assistant
- Duration: (2006.9-2010.6)
- Summary: ML
- Company: Linkedin
- Title: Software Engineer
- Duration: (2012.7-2013.3)
- Summary: Mobile
- Company: Intel
- Title: Software Engineer
- Duration: (2010.5-2012.3)
- Summary: Hadoop, DM
- Company: Yahoo
- Title: Research Scientist
- Duration: (2008.8-2010.4)
• Profile 2
- Summary: Sensor
- Company: Dartmouth
- Title: Ph.D. Student
- Duration: (2002.9-2008.7)
• Profile 1
Sequence of Positions
• Profile 1
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: Networks
- Company: Penn State
- Title: Research Assistant
- Duration: (2006.9-2010.6)
• Profile 2
- Summary: ML
- Company: Linkedin
- Title: Software Engineer
- Duration: (2012.7-2013.3)
- Summary: Mobile
- Company: Intel
- Title: Software Engineer
- Duration: (2010.5-2012.3)
- Summary: Hadoop, DM
- Company: Yahoo
- Title: Research Scientist
- Duration: (2008.8-2010.4)
- Summary: Sensor
- Company: Dartmouth
- Title: Ph.D. Student
- Duration: (2002.9-2008.7)
Agenda
Mission & Product
Expertise Search
Candidate Discovery via Similar Profiles Modeling
Search By Example
Search by Ideal
Candidate
Challenges in Search by Example
1. Given a set of users determine
a. Most relevant skills
b. Most relevant titles + seniority
c. Related Companies
1. Maintaining high level of personalization for the user
Architecture : Search by Ideal Candidate
The first part in the function (f1) estimates how a result r is relevant to
query q and searcher s, as in the standard personalized search. The
second part (f2) aims to guarantee a direct similarity between a result and
input ideal candidates (IC).
λ is a parameter controlling decay rate
Some of the contributing features
1. Skills reputation
2. Career Similarity
3. Related Companies
References
1. Search by Ideal Candidates: Next Generation of Talent Search at LinkedIn, Ha-Thuc, Viet; Xu, Ye; Pradeep
Kanduri, Satya; Wu, Xianren; Dialani, Vijay; Yan, Yan; Gupta, Abhishek; Sinha, Shakti, WWW 2015
2. Personalized expertise search at linkedin, V. Ha-Thuc, G. Venkataraman, M. Rodriguez, S. Sinha, S. Sundaram,
and L. Guo, In Proceedings of the 4th IEEE International Conference Big Data (BigData), IEEE, 2015.
3. Modeling Professional Similarity by mining Professional Career Trajectories, Y. Xu; Z. Li; A. Gupta; A. Bugdayci;
A. Bhasin, In Proceedings of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining,
IEEE, 2014.
Thanks to my collaborators:
1. Viet Ha-Thuc
2. Ye Xu
3. Satya Kanduri
4. Vijay Dialani
5. Yan Yan

More Related Content

PPTX
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
PPTX
LinkedIn Skills: RecSys Conference 2014
PDF
Gutmacher In-House Sourcing Model Offshore and Onshore Nov. 2016
DOC
Aaron Felts 2011
DOCX
safety summit
PDF
PPTX
Legislación Tributaria
PDF
APOSTILA MPB
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
LinkedIn Skills: RecSys Conference 2014
Gutmacher In-House Sourcing Model Offshore and Onshore Nov. 2016
Aaron Felts 2011
safety summit
Legislación Tributaria
APOSTILA MPB

Viewers also liked (8)

PDF
How to Get Rid of Warts With Herbs
PPTX
7 BASIC QUESTIONS
PPTX
Trabalho fec
PDF
Emotional Intelligence changing human resource functionality
PPTX
Alimentación Saludable
PDF
Kaiqi 2015 catalogue
PPT
Teoria Musical I
PDF
Der demographische Wandel in Deutschland
How to Get Rid of Warts With Herbs
7 BASIC QUESTIONS
Trabalho fec
Emotional Intelligence changing human resource functionality
Alimentación Saludable
Kaiqi 2015 catalogue
Teoria Musical I
Der demographische Wandel in Deutschland
Ad

Similar to Next generation linked in talent search (20)

PPTX
Machine Learning for Search at LinkedIn
PPTX
Search Ranking Across Heterogeneous Information Sources
PPTX
SCP_lnkd_Presentation_kdd2014.pptx
PPTX
Skills, Reputation, and Search
PDF
Keynote Peter Skomoroch - skills, reputation, and search
PDF
KEYNOTE: Skills, Reputation and Search
PDF
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
PDF
DataEngConf SF16 - Methods for Content Relevance at LinkedIn
PPTX
Learning to Rank Personalized Search Results in Professional Networks
PPTX
Personalizing Search at LinkedIn
PPTX
Aiinpractice2017deepaklongversion
PPTX
Computing Professional Identity for the Economic Graph
PDF
Women In Big Data Meetup 2020, Hosted by LinkedIn
PDF
Recommending job ads to people
PDF
Quarterly Product Release Webinar: Q1 Edition
PDF
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
PPTX
OSINT - Linkedin Search Slides
PPTX
Stanford 2014 Tech Entrepreneurship Slides - Talent, Opportunity, and Data @ ...
PDF
Large-scale Social Recommendation Systems: Challenges and Opportunity
PDF
Recsys2016 Tutorial by Xavier and Deepak
Machine Learning for Search at LinkedIn
Search Ranking Across Heterogeneous Information Sources
SCP_lnkd_Presentation_kdd2014.pptx
Skills, Reputation, and Search
Keynote Peter Skomoroch - skills, reputation, and search
KEYNOTE: Skills, Reputation and Search
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
DataEngConf SF16 - Methods for Content Relevance at LinkedIn
Learning to Rank Personalized Search Results in Professional Networks
Personalizing Search at LinkedIn
Aiinpractice2017deepaklongversion
Computing Professional Identity for the Economic Graph
Women In Big Data Meetup 2020, Hosted by LinkedIn
Recommending job ads to people
Quarterly Product Release Webinar: Q1 Edition
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
OSINT - Linkedin Search Slides
Stanford 2014 Tech Entrepreneurship Slides - Talent, Opportunity, and Data @ ...
Large-scale Social Recommendation Systems: Challenges and Opportunity
Recsys2016 Tutorial by Xavier and Deepak
Ad

Recently uploaded (20)

PPTX
Primary and secondary sources, and history
DOC
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
PPTX
Hydrogel Based delivery Cancer Treatment
PPTX
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
PPTX
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
PPTX
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
PPTX
Human Mind & its character Characteristics
PDF
oil_refinery_presentation_v1 sllfmfls.pdf
PPTX
fundraisepro pitch deck elegant and modern
PPTX
Learning-Plan-5-Policies-and-Practices.pptx
PPTX
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
PPTX
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
PPTX
English-9-Q1-3-.pptxjkshbxnnxgchchxgxhxhx
PPTX
Emphasizing It's Not The End 08 06 2025.pptx
PPTX
Effective_Handling_Information_Presentation.pptx
PDF
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
PPTX
nose tajweed for the arabic alphabets for the responsive
PPTX
Anesthesia and it's stage with mnemonic and images
PPTX
Intro to ISO 9001 2015.pptx wareness raising
PPTX
Introduction-to-Food-Packaging-and-packaging -materials.pptx
Primary and secondary sources, and history
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
Hydrogel Based delivery Cancer Treatment
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
Human Mind & its character Characteristics
oil_refinery_presentation_v1 sllfmfls.pdf
fundraisepro pitch deck elegant and modern
Learning-Plan-5-Policies-and-Practices.pptx
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
English-9-Q1-3-.pptxjkshbxnnxgchchxgxhxhx
Emphasizing It's Not The End 08 06 2025.pptx
Effective_Handling_Information_Presentation.pptx
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
nose tajweed for the arabic alphabets for the responsive
Anesthesia and it's stage with mnemonic and images
Intro to ISO 9001 2015.pptx wareness raising
Introduction-to-Food-Packaging-and-packaging -materials.pptx

Next generation linked in talent search

  • 1. Next-Generation LinkedIn Talent Search Ryan Wu Staff Machine Learning Scientist & Tech Lead LinkedIn Corporation
  • 2. Outline Mission & Product Expertise Search Candidate Discovery via Similar Profiles Modeling Search By Example
  • 3. 3 • 200+ countries and territories • 2+ new members per second
  • 4. 4 ● Dual Roles of Search ○ Enable talent discover opportunity ○ Help companies to search for the right talent
  • 5. Standardized Member Profile At its core, LinkedIn is a digital representation of the business world—a collection of the people, companies, educations, skills, jobs, and the connections between them. Our products use this data to connect members with relevant information, contacts, content, and opportunities. The success of these products relies on data standardization, our ability to understand user data and to effectively make use of this information.
  • 7. Agenda Mission & Product Expertise Search Candidate Discovery via Similar Profiles Modeling Search By Example
  • 8. Expertise (Skill) Search at LinkedIn ▪ Skills – 104’s of standardized skills – Members get endorsed for skills listed on their profile. – Represent professional expertise. 8
  • 9. Searching by Skills ▪ Unique challenges to LinkedIn expertise Search – Scale: 450M members x 35K standardized skills – Sparsity of skills in profiles – Personalization 9 …
  • 10. Skill Reputation Scores [BigData’15] 10 ▪ Decision Maker: searcher ▪ Record: Professional career ▪ Skill reputation: member expertise on a skill
  • 11. Estimating Skill Reputation 11 Endorse profile browsemap ? .85 .45 ? ? .35 ? .42 ? ? ? .05 Members Skills P(expert| member, skill) Supervised Learning algorithm
  • 12. Estimating Skill Reputation 12 Endorse profile browse map ? .85 .45 ? ? .35 ? .42 ? ? ? .05 Members Skills 0.5 1 0.7 0 0 0.6 0.1 0 0.2 0.3 0.5 0.5 0.7 0.2 Members Skills Each row is a representation of a member in latent space Each column represents a skill in latent space Matrix Factorization
  • 13. Estimating Skill Reputation 13 Endorse profile browsemap ? .85 .45 ? ? .35 ? .42 ? .02 ? ? Members Skills 0.5 1 0.7 0 0 0.6 0.1 0 0.2 0.3 0.5 0.5 0.7 0.2 Members Skills .6 .85 .45 .14 .21 .35 .3 .42 .12 .02 .03 .05 Members Skills Fill in unknown cells in the original matrix
  • 14. Agenda Mission & Product Expertise Search Candidate Discovery via Similar Profiles Modeling Search By Example
  • 15. Similar People How you rank for profile views People You May Hire Lead Recommendations Similar Profiles Recommender
  • 16. Title : Software Engineer, Research Engineer, Research Assistant Specialty : Machine Learning, Data Analysis, Hadoop, Networks Company: Cisco, Linkedin, Penn State Summary: Software Engineer, Research Engineer, Machine Learning, Data Analysis, Networks, Research Assistant How to model a profile with career trajectory? - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: Networks - Company: Penn State - Title: Research Assistant - Duration: (2006.9-2010.6) Keywords Profile Model Sequence Profile Model
  • 17. How to match two profiles ? Title : Software Engineer, RA Specialty: ML, Networks… Company: Cisco, Linkedin, Penn State Summary: Software Engineer ML, Networks, … Title : Software Engineer, Ph.D. Specialty: ML, DM, Mobile… Company: Yahoo, Linkedin, Intel, Dartmouth Summary: Software Engineer ML, DM, Mobile … - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: ML - Company: Linkedin - Title: Software Engineer - Duration: (2012.7-2013.3) - Summary: Mobile - Company: Intel - Title: Software Engineer - Duration: (2010.5-2012.3) - Summary: Hadoop, DM - Company: Yahoo - Title: Research Scientist - Duration: (2008.8-2010.4) Similar Profiles Similar Career Paths ?
  • 18. Set of Positions - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: Networks - Company: Penn State - Title: Research Assistant - Duration: (2006.9-2010.6) - Summary: ML - Company: Linkedin - Title: Software Engineer - Duration: (2012.7-2013.3) - Summary: Mobile - Company: Intel - Title: Software Engineer - Duration: (2010.5-2012.3) - Summary: Hadoop, DM - Company: Yahoo - Title: Research Scientist - Duration: (2008.8-2010.4) • Profile 2 - Summary: Sensor - Company: Dartmouth - Title: Ph.D. Student - Duration: (2002.9-2008.7) • Profile 1
  • 19. Sequence of Positions • Profile 1 - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: Networks - Company: Penn State - Title: Research Assistant - Duration: (2006.9-2010.6) • Profile 2 - Summary: ML - Company: Linkedin - Title: Software Engineer - Duration: (2012.7-2013.3) - Summary: Mobile - Company: Intel - Title: Software Engineer - Duration: (2010.5-2012.3) - Summary: Hadoop, DM - Company: Yahoo - Title: Research Scientist - Duration: (2008.8-2010.4) - Summary: Sensor - Company: Dartmouth - Title: Ph.D. Student - Duration: (2002.9-2008.7)
  • 20. Agenda Mission & Product Expertise Search Candidate Discovery via Similar Profiles Modeling Search By Example
  • 22. Challenges in Search by Example 1. Given a set of users determine a. Most relevant skills b. Most relevant titles + seniority c. Related Companies 1. Maintaining high level of personalization for the user
  • 23. Architecture : Search by Ideal Candidate The first part in the function (f1) estimates how a result r is relevant to query q and searcher s, as in the standard personalized search. The second part (f2) aims to guarantee a direct similarity between a result and input ideal candidates (IC). λ is a parameter controlling decay rate
  • 24. Some of the contributing features 1. Skills reputation 2. Career Similarity 3. Related Companies
  • 25. References 1. Search by Ideal Candidates: Next Generation of Talent Search at LinkedIn, Ha-Thuc, Viet; Xu, Ye; Pradeep Kanduri, Satya; Wu, Xianren; Dialani, Vijay; Yan, Yan; Gupta, Abhishek; Sinha, Shakti, WWW 2015 2. Personalized expertise search at linkedin, V. Ha-Thuc, G. Venkataraman, M. Rodriguez, S. Sinha, S. Sundaram, and L. Guo, In Proceedings of the 4th IEEE International Conference Big Data (BigData), IEEE, 2015. 3. Modeling Professional Similarity by mining Professional Career Trajectories, Y. Xu; Z. Li; A. Gupta; A. Bugdayci; A. Bhasin, In Proceedings of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, IEEE, 2014. Thanks to my collaborators: 1. Viet Ha-Thuc 2. Ye Xu 3. Satya Kanduri 4. Vijay Dialani 5. Yan Yan

Editor's Notes

  • #4: LinkedIn is the world’s largest professional network, with more than 400M members from all over the world. Its mission: “To connect the world's professionals to make them more productive and successful”. Our member base is growing fast with more than 2 members joining the network every second.
  • #5: Our vision is to create the first economic graph digitally mapping the global economy to connect talent with opportunity. In this vision, search plays a big role. On one hand, search helps members to discover opportunity available to them. On the other hand, companies could use search to find the right talent for them.Thus, search plays a central role at LinkedIn. There is a huge variety in profiles found on LinkedIn and we have identified millions of companies, thousands of titles and thousands of skills that can be used to locate a professional that one may intend to hire.
  • #7: Given the role of LinkedIn search, we provide multiple search products specializing for different use-cases. The first product is flagship search, which is shown on the left side. This is open to all of our members to find job opportunities, to search for people to connect with or to discover content on LinkedIn to develop their careers. We also offer premium products like Sales Navigator and Recruiter Search. Sales Navigator is to help members discover sales opportunities and help them connect with the opportunities Recruiter search is to help corporation around the world find the right talent for them.
  • #9: Search on LinkedIn has two aspects: find and be found. The former is how the system retrieves and ranks results given a query. The later is how our members show case their professional expertise in order to be found. On LinkedIn a typical way to do that is via skills. Here are typical skills of bigdata engineer. There are about 40K std skills on the site. As you can see in the picture, members can endorse their connections on some certain skills. So, if you get endorsement from your peers or managers or from experts on the skills, that’s a strong signal to represent your professional expertise. Given the skills on our eco-system, one of the unique value propositions of LinkedIn is the ability to search by skills.
  • #10: The problem of expertise search on LinkedIn has a set of unique challenges. The first is scale. We have 450MM x 35K, which is by far bigger than expert finding task in the previous research, e.g., the task in TREC. Thus, it’s challenging to infer expertise level of each member of every skill he or she might have The second issue that a member typically doesn’t list all of the skills they know on profile ... It is crucial to infer expertise on the skills that members might have but don’t explicitly list Finally, Linkedin search is highly personalized. For instance, … searchers do not only care of expertise of results. So, the search ranking function has to combine those things together.
  • #11: In our case, the decision maker is a searcher with a job position in mind looking for candidates. The record of each candidate is his professional career represented by a LinkeIn profile, including … Reputation reflects the level of expertise of the candidate on certain skill. Eventually, similar FICO scores, which is a source that lenders look at to make a judgment whether to give a loan, our goal for skill reputation scores is one the factors (not the only one) that the searcher counts on to make a judgment on whether to hire the candidate.
  • #12: Given the goal, here is our approach. In the first step, we use a supervised learning algorithm combining a bunch of signals on the site such as … to infer scores for a subset of (member, skill) pairs. The score means … At this step, the matrix is very sparse since we can be quite certain only for a small percentage of the pairs.
  • #13: Once we have the initial matrix, we factorize it. This steps is quite similar to collaborative filtering and LSI, for those of you who are familiar with them. Our member-skill matrix is similar to Netfllix movie rating matrix or document-term matrix in LSI, where a cell is frequency of a term in a document. After factorization, each row in the member matrix could be understood as an unnormalized distribution over latent topics, where a latent topic is a cluster of related skills. Similarly, a column in skill matrix represent a skill in the latent topic space.
  • #14: Then, we compute dot-product between member latent matrix and skill latent matrix to reconstruct the original member-skill matrix as well as fill in the unknown cells Take skill co-occurrence patterns to infer missing skills: members knowing “Big Data” are also likely to know “Hadoop”
  • #16: LinkedIn maintains the professional profile for over 300 million members. With such a large number of profiles, high-quality profile discovery becomes a challenging problem. To solve the problem, we developed Similar Profiles recommender , which is a fundemantal technology of linkedin’s people recommendations systems. In addition to serving as an independent product, it also powers various profile discovery systems in Linkedin, for example, People You May Want to Hire is a personalized candidate discovery engine for recruiters that takes into account all of the context with regards to recruiting activities Similar Profiles Recommender System helps recruiters and hiring managers discover other similar talent by pivoting of a model user profile.
  • #17: Representation of profiles: IN current similar profile recommender, one profile is modeled as a set of fields, such as tile, specialty and company. For each field, we extract the keywords and create a vector of keywords to represent the field Obviously , this model is not able to capture temporal information, not to mention to represent a career path. A natual way to solve this problem is to keep the sequence of work positions. And for each position, we can describe its different dimensions such as The timeline is a sequence of work positions for people’s. Each position is a particular job experience, containing time duration, title information, along with key words summary.
  • #18: No need to explain the solution. Just focus on the problem. We use keywords based matching methods to find similarity scores for field-pairs. An overall similarity score is calculated using a weighted linear combination of these scores. Then this overall score is normalized using a logit function, so that it can be interpreted as a probability of being similar. Here, weights reflect the relative importance of field pairs that are matched. These weights are learned by fitting a logistic regression model on train- ing data obtained from active recruiter usage of Similar Profiles product.
  • #19: Considering each career path can also be modeled as a set of position, we can find optimal match of pairs of positions from two sets.
  • #20: We first point out the different between the set based and sequence base. Generally speaking, given two sequences of nodes (profiles), we conduct sequence alignment to calculate the similarity between them Each node here is a representation of one par- ticular work experience. In sequence alignment algorithm [16], the sequence level sim- ilarity is measured by calculating the sum of the optimal align- ment of node pairs.
  • #22: Search by Ideal Candidate: Retains the ability to switch between the ideal candidate and the search by criteria flows Ability to abstract intent from the list of candidates Uses latent features to automatically expand the list of companies and list of industries and skills Takes Skill reputation and title correlation into account.