SlideShare a Scribd company logo
THINK BREADTH,
NOT DEPTH
Seong Lee
O P E N
D A T A
S C I E N C E
C O N F E R E N C E_
BOSTON 2015
@opendatasci
THINK
BREADTH,
NOT DEPTH
HOW NEW DATA GENERATES NEW ALPHA
OUR AGENDA
• My Story With Quantitative Investing
• The Problem Of Small Fish Big Pond
• The Solution: Big Fish Small Pond
• How This Applies To Quantitative Finance
• How You Can Do This Too
• Conclusion
MY STORY
• I work with algorithms, clients, and ideas
• My job is to combine all three into one finished product
• Everyone has different ideas but wants the same thing
• Alpha but everyone uses the same data
• Wall Street Analyst Estimates – PEAD
• Build something that beats the market?
THE PROBLEM
• Too many people compete in the same arena
• Small fish in a big pond
• Finance: Wall Street
• Business: Samsung, Apple
THE SOLUTION
• Where no one is, is the best place for you to succeed
• Wall Street: Don’t go to MS, go to the boutique firm next
door
• Business: A smartphone for 3rd world toxin detection in
water systems
• AKA new data sources
REAL LIFE EXAMPLE
• EVERYONE uses wall street analyst earnings estimates
• Wall Street vs. Crowd (Estimize)
• Estimize crowd predictions are 67% more accurate
• Algorithm perform 2x better on Crowd vs. Wall Street
WALLSTREET
CROWDSOURCED
AVOIDING THE TWITTER LEAK
• April 28, 2015 Twitter earnings were leaked
• 20% drop in market cap
• $5 billion drop in market cap
• Accern (News Sentiment) analyzed +100,000 articles
• 1 day before leak, Accern shorted Twitter
• SunTrust downgraded Twitter -> Strong Negative
sentiment
AVOIDING THE TWITTER LEAK
HOW YOU CAN DO THIS
TOO
• Kimono Labs
• API for scraping websites
• Quandl
• Thousands and thousands of datasets
• Social Media
• CO Everywhere, Ground Signal
• Location based social media information
CONCLUSION
• Think breadth and diversity
• Email: slee@quantopian.com

More Related Content

PDF
Breadth vs. Depth
PDF
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...
PDF
Structuring reading for breadth, depth and lifelong learning to support the c...
PDF
Lee Hecht Harrison Startup
PDF
SMU Starting a Business | Social Media | Digital Strategy | Startup
PPTX
Univeristy of illinois finance presentation
PPTX
Changes in Venture Capital + Building 500 Startups (Istanbul, Sept 2013)
PDF
Digitalization and Innovation - Today and Tomorrow
Breadth vs. Depth
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...
Structuring reading for breadth, depth and lifelong learning to support the c...
Lee Hecht Harrison Startup
SMU Starting a Business | Social Media | Digital Strategy | Startup
Univeristy of illinois finance presentation
Changes in Venture Capital + Building 500 Startups (Istanbul, Sept 2013)
Digitalization and Innovation - Today and Tomorrow

Similar to Think Breadth, Not Depth (20)

PDF
Social Media Simplified: The Benefits and Getting Started
PDF
Event Marketing Strategies - 8 Social Media Tactics to Ignite Ticket Sales
PPTX
Different types of startups, markets and whys
PPT
Lean Startup Challenge Boston
PDF
Sustaining open data innovations
PDF
Disruption, Collaboration & Love
PDF
Alternative Sources of Funding - Entrepreneurship 101
PDF
Ten Inventions that will Revolutionize Retail
PPTX
Alternative Business Models: open-source, crowd funding and tokenisation
PDF
Changes in Venture Capital & Building 500 Startups (Sao Paulo, Sept 2013)
PPTX
Good Idea, Bad Startup (UCLA ECON 106E)
PPTX
Market Study workshop for Startup Weekend Liege
PPTX
Tcm concept discovery stage introduction
PDF
Product Market Fit
PDF
Building 500 Startups: #500STRONG
PDF
Ver 1.10 the venture capital ecosystem feb 2015
PDF
Level Seven - Big data in interactive marketing
PPTX
Startup Next Seattle - Product Market Fit by Joanna Lord
PDF
From an idea to a Startup
PDF
Investing in Tech Startups & Building Startup Ecosystems (PreMoney Miami, Mar...
Social Media Simplified: The Benefits and Getting Started
Event Marketing Strategies - 8 Social Media Tactics to Ignite Ticket Sales
Different types of startups, markets and whys
Lean Startup Challenge Boston
Sustaining open data innovations
Disruption, Collaboration & Love
Alternative Sources of Funding - Entrepreneurship 101
Ten Inventions that will Revolutionize Retail
Alternative Business Models: open-source, crowd funding and tokenisation
Changes in Venture Capital & Building 500 Startups (Sao Paulo, Sept 2013)
Good Idea, Bad Startup (UCLA ECON 106E)
Market Study workshop for Startup Weekend Liege
Tcm concept discovery stage introduction
Product Market Fit
Building 500 Startups: #500STRONG
Ver 1.10 the venture capital ecosystem feb 2015
Level Seven - Big data in interactive marketing
Startup Next Seattle - Product Market Fit by Joanna Lord
From an idea to a Startup
Investing in Tech Startups & Building Startup Ecosystems (PreMoney Miami, Mar...
Ad

More from odsc (20)

PPT
Understanding the Chief Data Officer
PPTX
Machine-In-The-Loop for Knowledge Discovery
PPT
API Driven Development
PPTX
Mobile technology Usage by Humanitarian Programs: A Metadata Analysis
PPTX
Productionizing Deep Learning From the Ground Up
PPT
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive
PPT
Data Science at Dow Jones: Monetizing Data, News and Information
PDF
Spark, Python and Parquet
PPTX
Building a Predictive Analytics Solution with Azure ML
PPT
Beyond Names
PPT
How Woman are Conquering the S&P 500
PPTX
Domain Expertise and Unstructured Data
PPTX
Kaggle The Home of Data Science
PPT
Open Source Tools & Data Science Competitions
PPT
Machine Learning with scikit-learn
PPT
Bridging the Gap Between Data and Insight using Open-Source Tools
PDF
Top 10 Signs of the Textpocalypse
PPTX
The Art of Data Science
PPTX
Frontiers of Open Data Science Research
PPTX
Feature Engineering
Understanding the Chief Data Officer
Machine-In-The-Loop for Knowledge Discovery
API Driven Development
Mobile technology Usage by Humanitarian Programs: A Metadata Analysis
Productionizing Deep Learning From the Ground Up
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive
Data Science at Dow Jones: Monetizing Data, News and Information
Spark, Python and Parquet
Building a Predictive Analytics Solution with Azure ML
Beyond Names
How Woman are Conquering the S&P 500
Domain Expertise and Unstructured Data
Kaggle The Home of Data Science
Open Source Tools & Data Science Competitions
Machine Learning with scikit-learn
Bridging the Gap Between Data and Insight using Open-Source Tools
Top 10 Signs of the Textpocalypse
The Art of Data Science
Frontiers of Open Data Science Research
Feature Engineering
Ad

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
A Presentation on Artificial Intelligence
PDF
KodekX | Application Modernization Development
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Machine learning based COVID-19 study performance prediction
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
NewMind AI Monthly Chronicles - July 2025
Mobile App Security Testing_ A Comprehensive Guide.pdf
Spectral efficient network and resource selection model in 5G networks
A Presentation on Artificial Intelligence
KodekX | Application Modernization Development
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
Diabetes mellitus diagnosis method based random forest with bat algorithm
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Reach Out and Touch Someone: Haptics and Empathic Computing
Machine learning based COVID-19 study performance prediction
Per capita expenditure prediction using model stacking based on satellite ima...
Network Security Unit 5.pdf for BCA BBA.
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
20250228 LYD VKU AI Blended-Learning.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Unlocking AI with Model Context Protocol (MCP)
CIFDAQ's Market Insight: SEC Turns Pro Crypto
“AI and Expert System Decision Support & Business Intelligence Systems”
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
NewMind AI Monthly Chronicles - July 2025

Think Breadth, Not Depth

  • 1. THINK BREADTH, NOT DEPTH Seong Lee O P E N D A T A S C I E N C E C O N F E R E N C E_ BOSTON 2015 @opendatasci
  • 2. THINK BREADTH, NOT DEPTH HOW NEW DATA GENERATES NEW ALPHA
  • 3. OUR AGENDA • My Story With Quantitative Investing • The Problem Of Small Fish Big Pond • The Solution: Big Fish Small Pond • How This Applies To Quantitative Finance • How You Can Do This Too • Conclusion
  • 4. MY STORY • I work with algorithms, clients, and ideas • My job is to combine all three into one finished product • Everyone has different ideas but wants the same thing • Alpha but everyone uses the same data • Wall Street Analyst Estimates – PEAD • Build something that beats the market?
  • 5. THE PROBLEM • Too many people compete in the same arena • Small fish in a big pond • Finance: Wall Street • Business: Samsung, Apple
  • 6. THE SOLUTION • Where no one is, is the best place for you to succeed • Wall Street: Don’t go to MS, go to the boutique firm next door • Business: A smartphone for 3rd world toxin detection in water systems • AKA new data sources
  • 7. REAL LIFE EXAMPLE • EVERYONE uses wall street analyst earnings estimates • Wall Street vs. Crowd (Estimize) • Estimize crowd predictions are 67% more accurate • Algorithm perform 2x better on Crowd vs. Wall Street
  • 10. AVOIDING THE TWITTER LEAK • April 28, 2015 Twitter earnings were leaked • 20% drop in market cap • $5 billion drop in market cap • Accern (News Sentiment) analyzed +100,000 articles • 1 day before leak, Accern shorted Twitter • SunTrust downgraded Twitter -> Strong Negative sentiment
  • 12. HOW YOU CAN DO THIS TOO • Kimono Labs • API for scraping websites • Quandl • Thousands and thousands of datasets • Social Media • CO Everywhere, Ground Signal • Location based social media information
  • 13. CONCLUSION • Think breadth and diversity • Email: slee@quantopian.com

Editor's Notes

  • #5: And from working with all of these different clients, I’ve realized that everyone has different ideas. Most of these different ideas are generally small tweaks on existing methods The question is not a yes/no answer, but what will give me the highest probability of beating JP Morgan? Morgan Stanley? Renaissance Establish the PEAD first An extremely well documented phenomon that happens on the earnings announcement date So a companuy like apple will announe earnings about four times a year When I first started my job at Quantopian, Jess Stauth, our VP of Quant Strategy, came to me with the challenge of writing a PEAD algorithm
  • #6: Too many people are competing in the same arena Take Wall Street -> Boston College graduate competing against Harvard Graduate Take Business -> HTC or Nokia competing against AAPL and Samsung Finally, something that actually relevant -> let’s look at a bar If you’re a small business, you need to compete in a different arena
  • #7: The world of quantitative finance, there are two different ways to compete in this arena: mathematics, data Does it seem weird that we have the solution in different slides -> can we make them all congruent? E.g. put this slide towards the end where wall street, business, and life are afterwards?
  • #8: Explain WHY the crowd is unconventional Sticking point – Need to talk about the algorithm itself briefly? Looking at an earnings surprise, buy/sell and hold position for 5 days.
  • #13: This should be extremely specific for different types of life situations