SlideShare a Scribd company logo
Data Mashups
  May 12, 2011
  Data Scientist Summit
                          Turning Data Exhaust into
  Pete Skomoroch
  LinkedIn                                 Insights
  @peteskomoroch
We have an explosion of data

               • DataWrangling
               • InfoChimps
               • Data.gov
               • Factual
               • SimpleGeo
And the tools to make sense of it

                • Hadoop
                • NoSQL
                •R
                • Python
                • Mechanical Turk
Diverse datasets = better signal
Data Mashups -Data Science Summit
Data Mashups -Data Science Summit
Find a meaningful problem

                                             • Identify pain points
                                             • Work on stuff that
                                               matters

                                             • Focus on
                                               underutilized data

 http://www.flickr.com/photos/aloshbennett/
Trendingtopics.org @hourlytrends
LinkedIn Skills
The best mashups are actionable

               • Reveal patterns
               • Enable predictions
               • Recommendations
Mashup: Skills & Cities
Yuba City, California: 21.3% Unemployment
Ames, Iowa: 4.7% Unemployment
Make data mashups work for you
 • Open Data = powerful mashups
 • Mashup > sum of its parts
 • Focus on meaningful problems
 • Actionable mashups are better

More Related Content

PPT
O'Reilly Strata: Distilling Data Exhaust
PDF
Journey of The Connected Enterprise - Knowledge Graphs - Smart Data
PDF
Intro to Data Science
PDF
Harnessing search engines for KM
PDF
Building a New Platform for Customer Analytics
PDF
Unlock your Big Data with Analytics and BI on Office 365
PDF
2017 06-14-getting started with data science
PDF
Claudia Gold: Learning Data Science Online
O'Reilly Strata: Distilling Data Exhaust
Journey of The Connected Enterprise - Knowledge Graphs - Smart Data
Intro to Data Science
Harnessing search engines for KM
Building a New Platform for Customer Analytics
Unlock your Big Data with Analytics and BI on Office 365
2017 06-14-getting started with data science
Claudia Gold: Learning Data Science Online

What's hot (20)

PDF
Thinkful DC - Intro to Data Science
PDF
Thinkful - Intro to Data Science - Washington DC
PPTX
Presentation at Bio IT World West: To AI or Not to AI, Presented by Simon Tay...
PPTX
UCSD: Building a Big Data Culture - It Takes a Village
PPTX
Twitter in One Hour (or Less) for Lawyers
PDF
The Rise of the CDO in Today's Enterprise
PDF
Be a Data Scientist in 8 steps!
PDF
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
PPTX
Fasten you seatbelt and listen to the Data Steward
PPTX
How to use your data science team: Becoming a data-driven organization
PDF
Clare Corthell: Learning Data Science Online
PDF
Data Visualization: A Quick Tour for Data Science Enthusiasts
PPTX
Graph Thinking: Why it Matters
PPTX
How To Become a Data Scientist in Iran Marketplace
PPTX
Plenary Keynote Intro at Bio IT World West - Diane Burley, Lucidworks VP Content
PDF
Visual Personal Branding & Infographic Resumes
PPTX
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
PDF
Intro to Python for Data Science
PDF
Illustrating Graphs Visually through Neo4j Bloom
PDF
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Thinkful DC - Intro to Data Science
Thinkful - Intro to Data Science - Washington DC
Presentation at Bio IT World West: To AI or Not to AI, Presented by Simon Tay...
UCSD: Building a Big Data Culture - It Takes a Village
Twitter in One Hour (or Less) for Lawyers
The Rise of the CDO in Today's Enterprise
Be a Data Scientist in 8 steps!
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
Fasten you seatbelt and listen to the Data Steward
How to use your data science team: Becoming a data-driven organization
Clare Corthell: Learning Data Science Online
Data Visualization: A Quick Tour for Data Science Enthusiasts
Graph Thinking: Why it Matters
How To Become a Data Scientist in Iran Marketplace
Plenary Keynote Intro at Bio IT World West - Diane Burley, Lucidworks VP Content
Visual Personal Branding & Infographic Resumes
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Intro to Python for Data Science
Illustrating Graphs Visually through Neo4j Bloom
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Ad

Similar to Data Mashups -Data Science Summit (20)

PDF
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
PDF
Big databigideasit4bc
PPTX
Big Data and the Art of Data Science
PDF
Practical Applications of Visual Analytics
PPTX
Strata Online_road_to_enterprise_data_2011
PDF
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
PPTX
Value Mining: How Entity Extraction Informs Analysis
PDF
Rapid Data Exploration With Hadoop
PDF
Big Data and NoSQL in Microsoft-Land
PPTX
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
PPTX
Ml pluss ejan2013
PDF
Data science
PPT
InfiniteGraph Presentation from Oct 21, 2010 DBTA Webcast
PDF
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
PDF
Pivotal Data Warehouse in the Age of Digital Transformation
PDF
Getting started in data science (4:3)
PDF
Getting started in data science (4:3)
PPT
Big Data = Big Decisions
PDF
NYC Data Web (static version) - A Semantic, Open Public Data Exchange for NYC
PDF
Getting Started in Data Science
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big databigideasit4bc
Big Data and the Art of Data Science
Practical Applications of Visual Analytics
Strata Online_road_to_enterprise_data_2011
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
Value Mining: How Entity Extraction Informs Analysis
Rapid Data Exploration With Hadoop
Big Data and NoSQL in Microsoft-Land
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Ml pluss ejan2013
Data science
InfiniteGraph Presentation from Oct 21, 2010 DBTA Webcast
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
Pivotal Data Warehouse in the Age of Digital Transformation
Getting started in data science (4:3)
Getting started in data science (4:3)
Big Data = Big Decisions
NYC Data Web (static version) - A Semantic, Open Public Data Exchange for NYC
Getting Started in Data Science
Ad

More from Peter Skomoroch (14)

PPTX
Bridging the AI Gap: Building Stakeholder Support
PDF
Managing Machines: The New AI Dev Stack
PDF
Product Management for AI
PDF
Executive Briefing: Why managing machines is harder than you think
PPTX
Building Competitive Moats With Data
PPTX
SF Data Science: Developing Data Products
PPTX
Skills, Reputation, and Search
PPTX
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
PPTX
Developing Data Products
PDF
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
PDF
Street Fighting Data Science
KEY
Geo Analytics Tutorial - Where 2.0 2011
PDF
Prototyping Data Intensive Apps: TrendingTopics.org
PDF
Elasticwulf Pycon Talk
Bridging the AI Gap: Building Stakeholder Support
Managing Machines: The New AI Dev Stack
Product Management for AI
Executive Briefing: Why managing machines is harder than you think
Building Competitive Moats With Data
SF Data Science: Developing Data Products
Skills, Reputation, and Search
LinkedIn Endorsements: Reputation, Virality, and Social Tagging
Developing Data Products
Practical Problem Solving with Data - Onlab Data Conference, Tokyo
Street Fighting Data Science
Geo Analytics Tutorial - Where 2.0 2011
Prototyping Data Intensive Apps: TrendingTopics.org
Elasticwulf Pycon Talk

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Cloud computing and distributed systems.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation_ Review paper, used for researhc scholars
Cloud computing and distributed systems.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
NewMind AI Monthly Chronicles - July 2025
20250228 LYD VKU AI Blended-Learning.pptx
Big Data Technologies - Introduction.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
NewMind AI Weekly Chronicles - August'25 Week I
Understanding_Digital_Forensics_Presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Data Mashups -Data Science Summit