Skills, Reputation, and Search
Pete Skomoroch
Principal Data Scientist, LinkedIn
Vision: Create Economic Opportunity for Every Professional
2
Location
LinkedIn: The Professional Profile of Record
©2012 LinkedIn Corporation. All Rights Reserved. 3
200+MMembers 200M Member
Profiles
LinkedIn Search: Connecting Talent with Opportunity
4
Skills Correlated with the Job Title “Data Scientist”
5
Skills Related to “Big Data”
6
Information Retrieval
7
Soul Retrieval
8
9
Lucene on LinkedIn
10
Lucene Endorsement Graph
11
Solr on LinkedIn
12
Solr Endorsement Graph
13
Reputation: Building the Endorsement Graph
14
15
Viral Growth: 1 Billion Endorsements in 5 Months
How Did We Gather this Data?
16
1. Desire + Social Proof
2. Viral Loops + Network Effects
3. Data Foundation + Recommendation Algorithms
17
1) Desire & Social Proof
A
endorses
B
B
notified
B “accepts”
endorsement
B
endorses
C
B
endorses
D
Endorsement
recommendations
Email NotificationNews Feed
2) Viral Loops & Network Effects
3) Data Foundation: Skills & Suggested Skills
19
Data Foundation: LinkedIn Skills
20
Social Tagging Accelerates Adoption
Suggested
endorsements
Skill recommendations
Skill marketing
©2012 LinkedIn Cororation. All Rights Reserved.
Virality only
Outline
22
Skill discovery
Skill tagging
Skill recommendations
Suggested endorsements
Skill Discovery: Unsupervised Topics from Profiles
23
Extract
Topic Clustering & Phrase Sense Disambiguation
24
Deduplication Signals from Mechanical Turk
25
Sample Task for Mechanical Turk Workers
26
Skill Phrase Deduplication
27
Outline
28
Skill discovery
Skill tagging
Skill recommendations
Suggested endorsements
Lead designer and engineer for the implementation of a user-
centric, fully-configurable UI for data aggregation and reporting.
Developed over 20 SaaS custom applications using Python,
Javascript and RoR.
Tagging Skill Phrases
 Tagging: Extract potential skill phrases from text
 Standardize unambiguous phrase variants
29
JavaScript RoR SaaS Python
ror
rubyonrails
ruby on rails development
ruby rails
ruby on rail
Ruby on Rails
Document
(ex: Profile)
Tokenization
Skills Tagger
Phrases
(up to 6 words)
Skills Classifier
Skills
(unordered)
Skills
(ranked by relevance)
Outline
30
Skill discovery
Skill tagging
Skill recommendations
Suggested endorsements
Skill Inference
 How suggested/inferred skills work:
– The skill likelihood is a conditional model
– Probabilities are combined using a Naïve Bayes
Classifier
 If you are an engineer at Apple, you probably know
about iPhone Development.
31
Profile
Extract
attributes
- Company ID
- Title ID
- Groups ID
- Industry ID
- …
Skills Classifier
Skills
(ranked by likelihood)
Feature
Vectors
Skills, Reputation, and Search
Skills, Reputation, and Search
Skills, Reputation, and Search
Skills, Reputation, and Search
Skills, Reputation, and Search
Skill Recommendations for Your LinkedIn Profile
37
49% Conversion
4% Conversion
Outline
38
Skill discovery
Skill tagging
Skill recommendations
Suggested endorsements
Social Tagging via Skill Endorsements
39
Social Tagging Accelerates Adoption
Skill endorsements
Skill recommendations
Skill marketing
©2012 LinkedIn Cororation. All Rights Reserved.
Data Amplifies Desire
41
1. Desire + Social Proof
2. Viral Loops + Network Effects
3. Data Catalyst + Recommendation Algorithms
Over 58 Million Profiles are now Tagged with Skills
42
All This Data Flows Back Into Our Lucene Index
43
Helping us Connect Talent & Opportunity
44
Location
Questions?
We’re hiring: data.linkedin.com
@peteskomoroch
©2012 LinkedIn Corporation. All Rights Reserved. 45
CONTACT
Pete Skomoroch
@peteskomoroch
http://data.linkedin.com

More Related Content

PPTX
SF Data Science: Developing Data Products
PPTX
Developing Data Products
PPTX
Building Competitive Moats With Data
PPTX
LinkedIn Data Products
PDF
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
PPTX
Bg wesleyan liberal arts to silicon valley oct 2016
PDF
Big Data Ecosystem @ LinkedIn
PPTX
Data Science at LinkedIn - Data-Driven Products & Insights
SF Data Science: Developing Data Products
Developing Data Products
Building Competitive Moats With Data
LinkedIn Data Products
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg wesleyan liberal arts to silicon valley oct 2016
Big Data Ecosystem @ LinkedIn
Data Science at LinkedIn - Data-Driven Products & Insights

What's hot (20)

PDF
Chief Data Officer Agenda Webinar: How CDOs Should Work with Lawyers
PDF
Artificial Intelligence Beyond Theory & Concepts - Our AI Summer Academy Empo...
PDF
Democratizing Intelligence - Sri Ambati, CEO & Co-Founder, H2O.ai
PDF
1. The Importance of Graphs in Government
PDF
E2.0 fmw for apps ro 2010 11-30 v.02
PPTX
Tamm & kitt
PDF
2017 06-14-getting started with data science
PDF
Sql Relay Nottingham Keynote Oct 7th 2015
PPT
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...
PDF
Kush stats alpha
PDF
Is Data Scientist still the Sexiest job of the 21st century?
PDF
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
PDF
Data & Services / Service Lab London
PDF
II-SDV 2012 Merging Information from Structured and Unstructured Information ...
PPT
Employment trends 2011 by zylog
PDF
II-SDV 2012 Actionable Intelligence for the Whole Enterprise
PDF
Ds.ai applied ai-workshop-
PPTX
Colossal Data for Dramatic Effect
PDF
Getting started in ds (july 17) atlanta
PDF
Conversational Architecture, CAVE Language, Data Stewardship
Chief Data Officer Agenda Webinar: How CDOs Should Work with Lawyers
Artificial Intelligence Beyond Theory & Concepts - Our AI Summer Academy Empo...
Democratizing Intelligence - Sri Ambati, CEO & Co-Founder, H2O.ai
1. The Importance of Graphs in Government
E2.0 fmw for apps ro 2010 11-30 v.02
Tamm & kitt
2017 06-14-getting started with data science
Sql Relay Nottingham Keynote Oct 7th 2015
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...
Kush stats alpha
Is Data Scientist still the Sexiest job of the 21st century?
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
Data & Services / Service Lab London
II-SDV 2012 Merging Information from Structured and Unstructured Information ...
Employment trends 2011 by zylog
II-SDV 2012 Actionable Intelligence for the Whole Enterprise
Ds.ai applied ai-workshop-
Colossal Data for Dramatic Effect
Getting started in ds (july 17) atlanta
Conversational Architecture, CAVE Language, Data Stewardship
Ad

Similar to Skills, Reputation, and Search (20)

PDF
Keynote Peter Skomoroch - skills, reputation, and search
PDF
KEYNOTE: Skills, Reputation and Search
PDF
Talent Bin
PDF
Talentbin Sales Deck
PPTX
Find the 'Unfindable' with TalentBin by Monster!
PPTX
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
PDF
LinkedIn Data Cleansing-converted.pdf
PPTX
Sla canada student nov 25 2021
PPTX
Next generation linked in talent search
PPTX
Linked in corporate presentation [2aug'12]
PPTX
Free tips march 2013 staffing deck
PDF
Startds9.19.17sd
PDF
Data sci sd-11.6.17
PPTX
Free tips march 2013 staffing deck
PPTX
Using LinkedIn for Job Search
PDF
Getstarteddssd12717sd
PPT
Agency tips finalv3 092013
PPT
LinkedIn as the Ultimate Weapon for Business [Josef Kadlec]
PPTX
Gaps May 2011 Presentation1
PPT
LinkedIn Frisco Connect Career Search Network
Keynote Peter Skomoroch - skills, reputation, and search
KEYNOTE: Skills, Reputation and Search
Talent Bin
Talentbin Sales Deck
Find the 'Unfindable' with TalentBin by Monster!
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
LinkedIn Data Cleansing-converted.pdf
Sla canada student nov 25 2021
Next generation linked in talent search
Linked in corporate presentation [2aug'12]
Free tips march 2013 staffing deck
Startds9.19.17sd
Data sci sd-11.6.17
Free tips march 2013 staffing deck
Using LinkedIn for Job Search
Getstarteddssd12717sd
Agency tips finalv3 092013
LinkedIn as the Ultimate Weapon for Business [Josef Kadlec]
Gaps May 2011 Presentation1
LinkedIn Frisco Connect Career Search Network
Ad

More from Peter Skomoroch (13)

PPTX
Bridging the AI Gap: Building Stakeholder Support
PDF
Managing Machines: The New AI Dev Stack
PDF
Product Management for AI
PDF
Executive Briefing: Why managing machines is harder than you think
PPT
O'Reilly Strata: Distilling Data Exhaust
PPTX
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
PDF
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
PDF
Street Fighting Data Science
PDF
Data Mashups -Data Science Summit
KEY
Geo Analytics Tutorial - Where 2.0 2011
PDF
Rapid Data Exploration With Hadoop
PDF
Prototyping Data Intensive Apps: TrendingTopics.org
PDF
Elasticwulf Pycon Talk
Bridging the AI Gap: Building Stakeholder Support
Managing Machines: The New AI Dev Stack
Product Management for AI
Executive Briefing: Why managing machines is harder than you think
O'Reilly Strata: Distilling Data Exhaust
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
Street Fighting Data Science
Data Mashups -Data Science Summit
Geo Analytics Tutorial - Where 2.0 2011
Rapid Data Exploration With Hadoop
Prototyping Data Intensive Apps: TrendingTopics.org
Elasticwulf Pycon Talk

Recently uploaded (20)

PDF
STKI Israel Market Study 2025 version august
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
DOCX
Basics of Cloud Computing - Cloud Ecosystem
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Microsoft Excel 365/2024 Beginner's training
PPTX
Modernising the Digital Integration Hub
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
STKI Israel Market Study 2025 version august
Module 1.ppt Iot fundamentals and Architecture
Taming the Chaos: How to Turn Unstructured Data into Decisions
Basics of Cloud Computing - Cloud Ecosystem
Improvisation in detection of pomegranate leaf disease using transfer learni...
Convolutional neural network based encoder-decoder for efficient real-time ob...
Custom Battery Pack Design Considerations for Performance and Safety
OpenACC and Open Hackathons Monthly Highlights July 2025
Final SEM Unit 1 for mit wpu at pune .pptx
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
Credit Without Borders: AI and Financial Inclusion in Bangladesh
A contest of sentiment analysis: k-nearest neighbor versus neural network
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
Flame analysis and combustion estimation using large language and vision assi...
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Microsoft Excel 365/2024 Beginner's training
Modernising the Digital Integration Hub
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...

Skills, Reputation, and Search