SlideShare a Scribd company logo
Query-time Nonparametric
Regression with Temporally
Bounded Models
Gus Heck & David Smiley
#Activate18 #ActivateSearch
Agenda
• Presenters
• Query time Nonparametric Regression
• Demo – Suggesting tagged images
• Time Routed Aliases
• Demo – Creating a Time Routed Alias
Presenters
Patrick (Gus) Heck
• Solr Contributor since 2013
• Consulting since 2012
• Enterprise Search since
2010
• Apache Ant contributor
2003-2004
• Web Applications since 2003
David Smiley
• Lucene/Solr Committer
(PMC)
• Consulting
• Author of first book on Solr
• Presentations & Training
In the beginning..
Dave Mackey - Vision for an online coaching
platform leveraging machine learning
technology to help companies help their
middle managers.
Engaged me first as Chief Architect and later
as CTO to bring this vision to reality.
It worked, but was not funded...
Brief Overview of Simply Coached
Simply Coached provided online career coaching
system
Key Goal was connecting users with relevant
curated content
Relevance was determined by
• Identifying topics the user is interested in by
observing and learning from click-throughs
• Several other customized metrics
Learning What Interests the User
Goals:
• Suggest things of interest, things the user is willing
to learn
• Make predictions based on the user’s behavior
• Unique prediction per user
• Predictions based on entire user base
• Avoid calcification of the predictive model, old data
needs to be sunset regularly.
Modeling Candidate - Neural Nets
Simply Coached started right at the dawn of
the “3rd wave” of neural networks
Limitations of Neural Networks
• They don’t adapt well to a changing
problem
• We would need to retrain predictive
network regularly
• Require massive data for training
Modeling Candidate - Regression
Not as “cool” as deep learning, but more suitable
• Can converge faster with less data
• But, most techniques are assuming that the
residuals are normally distributed
• Many of our features will be binary
• Normality is right out the window
I decided to look for a Non-Parametric Regression
technique
Non-Parametric Multiplicative Regression
• Can predict continuous variable
• Good for sorting/ranking
• Kernel smoother, presence/absence
kernels already well known
• Non-parametric
Nice discrete sub-calculations...
Used in Habitat Ecology for predicting habitat suitability
https://ir.library.oregonstate.edu/concern/defaults/z029p910z
NPMR Equations
Performance across multiple factors (features)
Local Mean estimator
Example kernels continuous or
presence/absence:
Key realization: the products and sums for i,j can
pre precalculated:
I called these pre-calculated portions “Partials” and
added temporal metadata
But wait... What are we predicting?
Needed a continuous statistic to predict
Thesis: Given a good teaser etc., the user
will click more rapidly on more interesting
articles.
Suggest documents in order of predicted
reaction time
Demo
Three users:
• Sporty Black likes black sports cars
• Cooper Blue likes blue coupes
• Racy Yeller likes yellow formula cars
System was trained by presenting ~20 sets of 9
randomly selected cars and clicking with
varying speed
Images copyright Andrew Mauldin, used with permission
Streaming
Expression!
As mentioned,
NPMR prediction
was only part of the
overall product
It occupies the top
line of Boxes A1 and
B1
The actual demo expression
Streaming Expressions
The final sorted list of documents can be calculated
The bold portion is the final summation
Continuously running, recalculating
sums/products once a minute
Included traditional document indexing
(article scanner) and also streaming
expression update()
(send_per_user_sum_update)
By end, running continuously for months
at a time unattended.
http://www.jesterj.org for more info about
JesterJ
Indexing With JesterJ
Adapting and Scaling
• Adapt at query time with filters, could also
filter in other ways with additional metadata
• Four Dimensions:
1. Content to recommend - constant set
2. Users - per user pricing to the rescue
3. Activity - price for average activity level
4. Time - data accumulates over time
Time Routed Aliases are a solution to #4
Time Routed Aliases
David Smiley
Have a Lot of Timestamped Data?
And you need keyword search and/or analytics
(faceting/aggregations) capabilities.
Examples:
• Logs
• Sensor data (IoT)
• Social media posts
Characteristics: tons of docs, continuously flowing in, limited
retention
Strategies
Hash Partitioned:
• One collection, hash routed shards (built-in)
Time Partitioned:
• One collection, time routed shards (DIY)
• Time partitioned collections (DIY)
• Time partitioned collections via TRAs (built-in)
“Partitioned” = “Routed” = “Organized”
One Collection, Hash Routed Shards
Hash on ID, even distribution
router.name=compositeId
+ Easy (default)
+ High write throughput
- Deleted dead-weight
- Poor realtime search
- Queries execute everywhere
- Inflexible sizing
- … thus Expensive (uniform
hardware requirements)
Deleted docs
Live docs
See DocExpirationUpdateProcessorFactory
shard1 shard4 shard7
2 5 8
9
Time Partitioned Data
Generally better...
• New data always goes to most recent partition
• Variable or equal sized (depends on approach)
• Addresses all negatives of hash routing
• Write throughput can be addressed by sharding within partition
• Opportunities to “optimize” aged indexes
• Flexibility in assigning partitions to better or cheaper hardware
• … all leads to cost savings
How?...
2017-01 2017-02 2017-03 2017-04
2017-05
2017-06 2017-07 2017-08
2017-09
Time Partitioned Data
Implementation Strategies:
• One Collection, router.name=implicit
• See this sample code: (DIY) https://github.com/cga-harvard/hhypermap-
bop/tree/master/bop-core/solr-
plugins/src/main/java/edu/harvard/gis/hhypermap/bop/solrplugins by me
• Multiple Collections
• See this blog: (DIY)
http://blog.cloudera.com/blog/2013/10/collection-aliasing-near-real-time-search-for-
really-big-data/ by Mark Miller
• TRAs… (built-in)
2017-01 2017-02 2017-03 2017-04
2017-05
2017-06 2017-07 2017-08
2017-09
About Collection Aliases
SolrCloud supports collection aliases
Aliases point to one or more collections
Ex: Alias “tra-demo” → 2017-09, 2017-08, 2017-07, ...
2017-01 2017-02 2017-03 2017-04
2017-05
2017-06 2017-07 2017-08
2017-09
Time Routed Aliases
Aliases have new tricks up their sleeves…
• Aliases now have metadata (API to read & edit)
• Can create a “time routed alias” w/ first collection
• Collections in a TRA can
• Route update requests to the correct collections
• Adds/deletes collections automatically
• data driven
• just-in-time or preemptively
Mutable configuration, mostly
New
in Solr 7.3
TRA Creation (using V2 API)
curl http://localhost:8983/api/c -H
'Content-type:application/json' -d '{
"create-alias":{
"name": "tra-demo",
"router": {
"name": "time",
"field": "evt_dt",
"start": "2018-01-01T00:00:00Z",
"interval": "+1DAY",
"autoDeleteAge": "-2DAY"
},
"create-collection": {
"config": "_default",
"numShards": 2,
"maxShardsPerNode": 2
}
}
}'
TRAs, the fine print
TODOs
• Size capped (e.g. 5M docs / collection)
• Query routing to subset of collections
• Better “auto-scaling” tie-ins
• “optimize” of older collections
• TRA deletion, ease of use
TRAs may not be for everyone
• Partitioning is strict and must be adjacent (no gaps)
• Doesn’t work with CDCR
Thank you!
Gus Heck & David Smiley
#Activate18 #ActivateSearch

More Related Content

PPTX
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...
PPTX
Vectors in Search - Towards More Semantic Matching
PPTX
AI from your data lake: Using Solr for analytics
PPTX
H2O World - Clustering & Feature Extraction on Text - Seth Redmore
PDF
Neo4j in Depth
PPTX
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...
PPTX
Neo4j - graph database for recommendations
PPTX
Using Neo4j from Java
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...
Vectors in Search - Towards More Semantic Matching
AI from your data lake: Using Solr for analytics
H2O World - Clustering & Feature Extraction on Text - Seth Redmore
Neo4j in Depth
Big Data Analytics 2: Leveraging Customer Behavior to Enhance Relevancy in Pe...
Neo4j - graph database for recommendations
Using Neo4j from Java

What's hot (20)

PDF
20141216 graph database prototyping ams meetup
PDF
Neo4j Training Cypher
PDF
Intro to graphs for HR analytics
PPTX
Data Day Seattle 2015: Sarah Guido
PDF
Getting started with Graph Databases & Neo4j
PDF
Bootstrapping Recommendations with Neo4j
PDF
Intro to Cypher
PPTX
The openCypher Project - An Open Graph Query Language
PDF
Neo4j Fundamentals
PDF
Exploring Direct Concept Search - Steve Rowe, Lucidworks
PPTX
Building Recommendation Platforms with Hadoop
PDF
Data modeling with neo4j tutorial
PDF
Solr for Data Science
PPT
Neo4J : Introduction to Graph Database
PPTX
Introducing Neo4j graph database
PPTX
Windy City DB - Recommendation Engine with Neo4j
PPTX
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
PDF
Using graphs for recommendations
PDF
Sparking Science up with Research Recommendations
PDF
How Graph Databases efficiently store, manage and query connected data at s...
20141216 graph database prototyping ams meetup
Neo4j Training Cypher
Intro to graphs for HR analytics
Data Day Seattle 2015: Sarah Guido
Getting started with Graph Databases & Neo4j
Bootstrapping Recommendations with Neo4j
Intro to Cypher
The openCypher Project - An Open Graph Query Language
Neo4j Fundamentals
Exploring Direct Concept Search - Steve Rowe, Lucidworks
Building Recommendation Platforms with Hadoop
Data modeling with neo4j tutorial
Solr for Data Science
Neo4J : Introduction to Graph Database
Introducing Neo4j graph database
Windy City DB - Recommendation Engine with Neo4j
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Using graphs for recommendations
Sparking Science up with Research Recommendations
How Graph Databases efficiently store, manage and query connected data at s...
Ad

Similar to Query-time Nonparametric Regression with Temporally Bounded Models - Patrick Heck, Needham Software & David Smiley, D W Smiley LLC (20)

PPTX
Interactive Analytics in Human Time
PPT
Realtime search at Yammer
PPT
Real Time Search at Yammer
PPT
Real-time Search at Yammer - By Aleksandrovsky Boris
PPTX
June 2014 HUG: Interactive analytics over hadoop
PPTX
OrientDB - Time Series and Event Sequences - Codemotion Milan 2014
PPTX
Il tempo vola: rappresentare e manipolare sequenze di eventi e time series co...
PDF
Time Series With OrientDB - Fosdem 2015
PDF
Efficient Similarity Computation for Collaborative Filtering in Dynamic Envir...
PDF
Reflected intelligence evolving self-learning data systems
PPTX
Approaching real-time-hadoop
PPTX
Summit EU Machine Learning
PPT
Mr bi amrp
PDF
SDSC18 and DSATL Meetup March 2018
PDF
Practical Data Science Workshop - Recommendation Systems - Collaborative Filt...
PPTX
Austin Scales- Clickstream Analytics at Bazaarvoice
PDF
Advanced Data Modeling with Apache Cassandra
PDF
Scalable Time-Versioning Support for Property Graph Databases
PDF
MongoDB for Analytics
KEY
rorosyd - Test Driven Search Development
Interactive Analytics in Human Time
Realtime search at Yammer
Real Time Search at Yammer
Real-time Search at Yammer - By Aleksandrovsky Boris
June 2014 HUG: Interactive analytics over hadoop
OrientDB - Time Series and Event Sequences - Codemotion Milan 2014
Il tempo vola: rappresentare e manipolare sequenze di eventi e time series co...
Time Series With OrientDB - Fosdem 2015
Efficient Similarity Computation for Collaborative Filtering in Dynamic Envir...
Reflected intelligence evolving self-learning data systems
Approaching real-time-hadoop
Summit EU Machine Learning
Mr bi amrp
SDSC18 and DSATL Meetup March 2018
Practical Data Science Workshop - Recommendation Systems - Collaborative Filt...
Austin Scales- Clickstream Analytics at Bazaarvoice
Advanced Data Modeling with Apache Cassandra
Scalable Time-Versioning Support for Property Graph Databases
MongoDB for Analytics
rorosyd - Test Driven Search Development
Ad

More from Lucidworks (20)

PDF
Search is the Tip of the Spear for Your B2B eCommerce Strategy
PDF
Drive Agent Effectiveness in Salesforce
PPTX
How Crate & Barrel Connects Shoppers with Relevant Products
PPTX
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
PPTX
Connected Experiences Are Personalized Experiences
PDF
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
PPTX
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
PPTX
Preparing for Peak in Ecommerce | eTail Asia 2020
PPTX
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
PPTX
AI-Powered Linguistics and Search with Fusion and Rosette
PDF
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
PPTX
Webinar: Smart answers for employee and customer support after covid 19 - Europe
PDF
Smart Answers for Employee and Customer Support After COVID-19
PPTX
Applying AI & Search in Europe - featuring 451 Research
PPTX
Webinar: Accelerate Data Science with Fusion 5.1
PDF
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
PPTX
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
PPTX
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
PPTX
Webinar: Building a Business Case for Enterprise Search
PPTX
Why Insight Engines Matter in 2020 and Beyond
Search is the Tip of the Spear for Your B2B eCommerce Strategy
Drive Agent Effectiveness in Salesforce
How Crate & Barrel Connects Shoppers with Relevant Products
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Connected Experiences Are Personalized Experiences
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
Preparing for Peak in Ecommerce | eTail Asia 2020
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
AI-Powered Linguistics and Search with Fusion and Rosette
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
Webinar: Smart answers for employee and customer support after covid 19 - Europe
Smart Answers for Employee and Customer Support After COVID-19
Applying AI & Search in Europe - featuring 451 Research
Webinar: Accelerate Data Science with Fusion 5.1
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
Webinar: Building a Business Case for Enterprise Search
Why Insight Engines Matter in 2020 and Beyond

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPT
Teaching material agriculture food technology
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
cuic standard and advanced reporting.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
KodekX | Application Modernization Development
PDF
Encapsulation theory and applications.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Electronic commerce courselecture one. Pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Modernizing your data center with Dell and AMD
PPTX
MYSQL Presentation for SQL database connectivity
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Teaching material agriculture food technology
The Rise and Fall of 3GPP – Time for a Sabbatical?
CIFDAQ's Market Insight: SEC Turns Pro Crypto
cuic standard and advanced reporting.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation_ Review paper, used for researhc scholars
Advanced methodologies resolving dimensionality complications for autism neur...
Mobile App Security Testing_ A Comprehensive Guide.pdf
KodekX | Application Modernization Development
Encapsulation theory and applications.pdf
Spectral efficient network and resource selection model in 5G networks
Unlocking AI with Model Context Protocol (MCP)
Electronic commerce courselecture one. Pdf
Machine learning based COVID-19 study performance prediction
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Modernizing your data center with Dell and AMD
MYSQL Presentation for SQL database connectivity
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Query-time Nonparametric Regression with Temporally Bounded Models - Patrick Heck, Needham Software & David Smiley, D W Smiley LLC

  • 1. Query-time Nonparametric Regression with Temporally Bounded Models Gus Heck & David Smiley #Activate18 #ActivateSearch
  • 2. Agenda • Presenters • Query time Nonparametric Regression • Demo – Suggesting tagged images • Time Routed Aliases • Demo – Creating a Time Routed Alias
  • 3. Presenters Patrick (Gus) Heck • Solr Contributor since 2013 • Consulting since 2012 • Enterprise Search since 2010 • Apache Ant contributor 2003-2004 • Web Applications since 2003 David Smiley • Lucene/Solr Committer (PMC) • Consulting • Author of first book on Solr • Presentations & Training
  • 4. In the beginning.. Dave Mackey - Vision for an online coaching platform leveraging machine learning technology to help companies help their middle managers. Engaged me first as Chief Architect and later as CTO to bring this vision to reality. It worked, but was not funded...
  • 5. Brief Overview of Simply Coached Simply Coached provided online career coaching system Key Goal was connecting users with relevant curated content Relevance was determined by • Identifying topics the user is interested in by observing and learning from click-throughs • Several other customized metrics
  • 6. Learning What Interests the User Goals: • Suggest things of interest, things the user is willing to learn • Make predictions based on the user’s behavior • Unique prediction per user • Predictions based on entire user base • Avoid calcification of the predictive model, old data needs to be sunset regularly.
  • 7. Modeling Candidate - Neural Nets Simply Coached started right at the dawn of the “3rd wave” of neural networks Limitations of Neural Networks • They don’t adapt well to a changing problem • We would need to retrain predictive network regularly • Require massive data for training
  • 8. Modeling Candidate - Regression Not as “cool” as deep learning, but more suitable • Can converge faster with less data • But, most techniques are assuming that the residuals are normally distributed • Many of our features will be binary • Normality is right out the window I decided to look for a Non-Parametric Regression technique
  • 9. Non-Parametric Multiplicative Regression • Can predict continuous variable • Good for sorting/ranking • Kernel smoother, presence/absence kernels already well known • Non-parametric Nice discrete sub-calculations... Used in Habitat Ecology for predicting habitat suitability https://ir.library.oregonstate.edu/concern/defaults/z029p910z
  • 10. NPMR Equations Performance across multiple factors (features) Local Mean estimator Example kernels continuous or presence/absence: Key realization: the products and sums for i,j can pre precalculated: I called these pre-calculated portions “Partials” and added temporal metadata
  • 11. But wait... What are we predicting? Needed a continuous statistic to predict Thesis: Given a good teaser etc., the user will click more rapidly on more interesting articles. Suggest documents in order of predicted reaction time
  • 12. Demo Three users: • Sporty Black likes black sports cars • Cooper Blue likes blue coupes • Racy Yeller likes yellow formula cars System was trained by presenting ~20 sets of 9 randomly selected cars and clicking with varying speed Images copyright Andrew Mauldin, used with permission
  • 13. Streaming Expression! As mentioned, NPMR prediction was only part of the overall product It occupies the top line of Boxes A1 and B1
  • 14. The actual demo expression
  • 15. Streaming Expressions The final sorted list of documents can be calculated The bold portion is the final summation
  • 16. Continuously running, recalculating sums/products once a minute Included traditional document indexing (article scanner) and also streaming expression update() (send_per_user_sum_update) By end, running continuously for months at a time unattended. http://www.jesterj.org for more info about JesterJ Indexing With JesterJ
  • 17. Adapting and Scaling • Adapt at query time with filters, could also filter in other ways with additional metadata • Four Dimensions: 1. Content to recommend - constant set 2. Users - per user pricing to the rescue 3. Activity - price for average activity level 4. Time - data accumulates over time Time Routed Aliases are a solution to #4
  • 19. Have a Lot of Timestamped Data? And you need keyword search and/or analytics (faceting/aggregations) capabilities. Examples: • Logs • Sensor data (IoT) • Social media posts Characteristics: tons of docs, continuously flowing in, limited retention
  • 20. Strategies Hash Partitioned: • One collection, hash routed shards (built-in) Time Partitioned: • One collection, time routed shards (DIY) • Time partitioned collections (DIY) • Time partitioned collections via TRAs (built-in) “Partitioned” = “Routed” = “Organized”
  • 21. One Collection, Hash Routed Shards Hash on ID, even distribution router.name=compositeId + Easy (default) + High write throughput - Deleted dead-weight - Poor realtime search - Queries execute everywhere - Inflexible sizing - … thus Expensive (uniform hardware requirements) Deleted docs Live docs See DocExpirationUpdateProcessorFactory shard1 shard4 shard7 2 5 8 9
  • 22. Time Partitioned Data Generally better... • New data always goes to most recent partition • Variable or equal sized (depends on approach) • Addresses all negatives of hash routing • Write throughput can be addressed by sharding within partition • Opportunities to “optimize” aged indexes • Flexibility in assigning partitions to better or cheaper hardware • … all leads to cost savings How?... 2017-01 2017-02 2017-03 2017-04 2017-05 2017-06 2017-07 2017-08 2017-09
  • 23. Time Partitioned Data Implementation Strategies: • One Collection, router.name=implicit • See this sample code: (DIY) https://github.com/cga-harvard/hhypermap- bop/tree/master/bop-core/solr- plugins/src/main/java/edu/harvard/gis/hhypermap/bop/solrplugins by me • Multiple Collections • See this blog: (DIY) http://blog.cloudera.com/blog/2013/10/collection-aliasing-near-real-time-search-for- really-big-data/ by Mark Miller • TRAs… (built-in) 2017-01 2017-02 2017-03 2017-04 2017-05 2017-06 2017-07 2017-08 2017-09
  • 24. About Collection Aliases SolrCloud supports collection aliases Aliases point to one or more collections Ex: Alias “tra-demo” → 2017-09, 2017-08, 2017-07, ... 2017-01 2017-02 2017-03 2017-04 2017-05 2017-06 2017-07 2017-08 2017-09
  • 25. Time Routed Aliases Aliases have new tricks up their sleeves… • Aliases now have metadata (API to read & edit) • Can create a “time routed alias” w/ first collection • Collections in a TRA can • Route update requests to the correct collections • Adds/deletes collections automatically • data driven • just-in-time or preemptively Mutable configuration, mostly New in Solr 7.3
  • 26. TRA Creation (using V2 API) curl http://localhost:8983/api/c -H 'Content-type:application/json' -d '{ "create-alias":{ "name": "tra-demo", "router": { "name": "time", "field": "evt_dt", "start": "2018-01-01T00:00:00Z", "interval": "+1DAY", "autoDeleteAge": "-2DAY" }, "create-collection": { "config": "_default", "numShards": 2, "maxShardsPerNode": 2 } } }'
  • 27. TRAs, the fine print TODOs • Size capped (e.g. 5M docs / collection) • Query routing to subset of collections • Better “auto-scaling” tie-ins • “optimize” of older collections • TRA deletion, ease of use TRAs may not be for everyone • Partitioning is strict and must be adjacent (no gaps) • Doesn’t work with CDCR
  • 28. Thank you! Gus Heck & David Smiley #Activate18 #ActivateSearch