SlideShare a Scribd company logo
Proximity Targeting
at Scale
Presented By :
Rohan Raj ( Lead Engineer, MIQ )
2
Agenda
Proximity Targeting
Tech Ecosystem,
Challenges &
Optimizations
Results
3
4
Activating Marketing Intelligence
through AiQ
AiQ is our technology that provides
modular, API-based analytics
services to rapidly build data solutions
for successful real-time business
outcomes.
As a result, we take a process that
might normally take a couple of
weeks and reduce time-to-value to a
couple of hours.
CONNECT
DISCOVER
ACTION
Onboard, unify and store any dataset,
making data organized, useful and
meaningful
Perform advanced analytics and process
datasets for insights & algorithmic
deployment
Export decisions to your
marketing technology platform
AiQ
5
Daily Scale
genda
80 Billion Ad
Impressions
5000+
Strategies
10+TB
Data
900,000
CPU mins
1000+
Campaigns
750 million
users
6
7
Proximity Targeting & Hyperlocal
8
11:00 am
Your Location
9:00 am
Coffee Shop
10:00 am
Competitor
Location 1
Customer Data - Without a Story
[random & disconnected]
6:30 pm
Gym
1:00 pm
Competitor
Location 2
9
11:00 am
Your Location
9:00 am
Coffee Shop
10:00 am
Competitor
Location 1
1:00 pm
Competitor
Location 2
6:30 pm
Gym
Coffee enthusiast and
maybe stays in the vicinity of
the coffee shop to pick it up
before he goes on with his
day
User visited the Client
location at 11am along with
a bunch of other competitor
locations, so looks like he was
actively shopping
Competitors who
offer similar
products like the
client who
customers also visit
Time when he visits the
gym if we wanted to
capture time to target
users to consume fitness
related products. He
travels 10km to get to the
gym from the coffee shop
SATURDAY
Customer Data - With a Story
10
Motion - Capabilities
User Journeys Audience Targeting Visit Trends : Unique, Repeat
Precise Uplift Rank Store (Footfall insights) Competitor conquesting
11
12
Big Data Processing Ecosystem
13
Major Technical Challenges how
we overcame these
14
Geocoding
OLC
● It’s designed to be used as a street
address. (similar to phone numbers)
● Not able to provide needed precision,
when radii around POI varies widely.
● Nearby places don’t have shared
prefixes.
● Looks Like this - 87G8Q257+5QP
Geohash
● Scalable and efficient for
programmatic usages and joins.
● Provides the needed precisions when
size of POIs or target physical stores
vary enormously.
● Hierarchical
● Nearby places have shared prefixes.
● Looks Like this - dr5ru7v
15
Managing Parallelism - Skew
cost is involved
16
Managing Parallelism - Overheads
17
Latency numbers (Humanized) ….
18
Optimizations- Joins, Shuffles & More
Registration Make Model Engine_size
AB12CDE Ford Fiesta 1.0
FG23HIJ Ford Fiesta 1.1
KL34MNO Ford Fiesta 1.5
PQ45RST Nissan Qashqai 1.6
UV56WXY Hyundai i20 1.4
ZA67BCD Ford Mondeo 2.0
Make Model Engine_size Price
Ford Fiesta 1.0 10110
Ford Fiesta 1.1 2500
Ford Fiesta 1.5 13653
Ford Fiesta 1.4 16700
Ford Fiesta 1.2 8965
Ford Fiesta 1.6 7320
Nissan Qashqai 1.5 14567
Nissan Qashqai 1.6 11432
Partition 1
Partition 2
Partition 3
Partition 4
...
A
B
19
❑ Shuffle partitions
○ spark.sql.shuffle.partitions = 200
❑ Broadcast joins
❑ Salting
❑ Databricks skew.hints df.hint("skew", "col1")
○ df.hint("skew", ["col1","col2"])
○ df.hint("skew", "col1", "value")
from pyspark.sql.functions import broadcast
result = broadcast(A).join(B,["join_col"],"left")
Optimizations- Joins, Shuffles & More
20
Cluster optimization & Persistence
● Spot Nodes
● Heterogeneous clusters
● EBS-autoscaling
21
Results
20K $ → 4.5K $
Cost
21 Hours →3 Hours
Time
30 M $ / yearly
Revenue generated
Cluster
optimizations
shuffle
partitions
geohashingskew & join
optimization
22
❑ https://medium.com/miq-tech-and-analytics/geohash-vs-open-location-codes-understanding-the-tw
o-geocoding-techniques-in-advertising-space-8002452201c8
❑ https://medium.com/miq-tech-and-analytics/understanding-customer-lifetime-value-in-retail-62eb62
f44994
❑ https://medium.com/miq-tech-and-analytics/decision-trees-optimizations-for-programmatic-media-buying-e
77e4410bfe8
❑ https://medium.com/miq-tech-and-analytics/meet-jarvis-79ad50f3ccb8
❑ https://medium.com/miq-tech-and-analytics/https-medium-com-nagaraj-mediaiq-how-we-use-s3-select-for-
schema-validation-and-filtering-data-at-miq-52cf036bf9be
Further Readings ...
23
Thank You
It is a long established fact that a reader will be distracted by the readable
content of a page when looking at its layout.

More Related Content

PDF
From the Road to the Field Go Mobile with SAP Hybris Service Cloud
PDF
Facilitate Operation Efficiency with SAP Hybris Cloud for Customer
PPTX
Hub16: Workforce planning at Tableau: Finding time compression and process ag...
PDF
ASUG Annual Conference BI Community Brochure
PDF
Amsterdam Data Portal
 
PDF
Mastering SAP Technologies Keynote - Simon Dale - Making Sense of the New SAP
PPTX
Ben Morgan - AGI Asset Management SIG (Sep 2013)
PDF
Two go by_sap_presentation-en
From the Road to the Field Go Mobile with SAP Hybris Service Cloud
Facilitate Operation Efficiency with SAP Hybris Cloud for Customer
Hub16: Workforce planning at Tableau: Finding time compression and process ag...
ASUG Annual Conference BI Community Brochure
Amsterdam Data Portal
 
Mastering SAP Technologies Keynote - Simon Dale - Making Sense of the New SAP
Ben Morgan - AGI Asset Management SIG (Sep 2013)
Two go by_sap_presentation-en

What's hot (20)

PDF
IoT on SAP HANA Cloud Platform
PDF
Connect SAP Cloud Platform with SAP Hybris Commerce
PDF
Effortless HVAC simulation using ApacheHVAC
PPTX
SAP Integration for non developers Teched 2017
PPT
PDF
How Productive is Your Organization?
PDF
Push the Customer Experience Beyond Expectations
PDF
6RS Overview RILA Conference 2018
PPTX
Overview Deploy in Cloud - Dynamic 365 FinOps
PDF
5 Critical elements in budgeting
PDF
Infographic- Accelerate your Innovation and growth with an OEM partnership
PDF
A Guide to a Successful SAP Hybris Commerce Cloud Project
PPTX
SM Energy and Akili Discuss How to Accelerate Your New Asset Assimilation
PPTX
Dish Supply Chain: Journey to Collaborative Planning
PDF
Embrace And Extend Your Analytics
PDF
SAP Services | SAP Services in Houston | SAP Services &Support
PPTX
Maximize SAP Ariba Solution ROI Through Optimized Governance, Compliance, and...
PDF
Productionizing Data Science at Experience
PDF
Nothing Is Too Big Or Too Complex For 360Suite — Unlock The Door To Your Upgr...
PDF
Unlock Smart Driving with SAP Hybris Solution Innovations
IoT on SAP HANA Cloud Platform
Connect SAP Cloud Platform with SAP Hybris Commerce
Effortless HVAC simulation using ApacheHVAC
SAP Integration for non developers Teched 2017
How Productive is Your Organization?
Push the Customer Experience Beyond Expectations
6RS Overview RILA Conference 2018
Overview Deploy in Cloud - Dynamic 365 FinOps
5 Critical elements in budgeting
Infographic- Accelerate your Innovation and growth with an OEM partnership
A Guide to a Successful SAP Hybris Commerce Cloud Project
SM Energy and Akili Discuss How to Accelerate Your New Asset Assimilation
Dish Supply Chain: Journey to Collaborative Planning
Embrace And Extend Your Analytics
SAP Services | SAP Services in Houston | SAP Services &Support
Maximize SAP Ariba Solution ROI Through Optimized Governance, Compliance, and...
Productionizing Data Science at Experience
Nothing Is Too Big Or Too Complex For 360Suite — Unlock The Door To Your Upgr...
Unlock Smart Driving with SAP Hybris Solution Innovations
Ad

Similar to Proximity Targeting at Scale using Big Data Platforms (20)

PDF
How to Convert Your SAP BusinessObjects Unused Licenses to SAP Analytics Cloud
PPTX
SAP Database Platform, ASE & IoT Roadmap
PPTX
SML_Presentation_SD5_MM_QM_PP_SD_FI_EXCE[1].pptx
PDF
Faurecia's roadmap to digitalization
PPTX
Anypoint DataGraph - Consume & Re-use your APIs faster | MuleSoft Mysore Meet...
PPTX
3 Benefits of Multi-Temperature Data Management for Data Analytics
PPTX
SAP’s Intelligent Enterprise Strategy
PDF
Inawsidom - Data Journey
PPTX
Sap Leonardo - what is it, and why would I want one?
PPT
SAP-ERP By Satya Kiran
PDF
KTern.AI-SAP-DXaaS-Workshop-PLAN
PDF
CWIN17 India / sap i dc - automations in the ad world - 27sep2017- anil lakhan
PPTX
DXC ESO for SAP Client Event presentation
PDF
SAP S4 HANA Cloud Solution Overview1.pdf
PDF
Russia CEE-SECR 2014 - Leveraging SAP Hana
PDF
Achieving Business Value by Fusing Hadoop and Corporate Data
PPTX
Strategic Procurement at BASF: Transforming Strategic Supplier Management at ...
PPTX
Hub16: VMware – Enabling business modeling and sales planning with Anaplan
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
Leading in Digital Manufacturing
How to Convert Your SAP BusinessObjects Unused Licenses to SAP Analytics Cloud
SAP Database Platform, ASE & IoT Roadmap
SML_Presentation_SD5_MM_QM_PP_SD_FI_EXCE[1].pptx
Faurecia's roadmap to digitalization
Anypoint DataGraph - Consume & Re-use your APIs faster | MuleSoft Mysore Meet...
3 Benefits of Multi-Temperature Data Management for Data Analytics
SAP’s Intelligent Enterprise Strategy
Inawsidom - Data Journey
Sap Leonardo - what is it, and why would I want one?
SAP-ERP By Satya Kiran
KTern.AI-SAP-DXaaS-Workshop-PLAN
CWIN17 India / sap i dc - automations in the ad world - 27sep2017- anil lakhan
DXC ESO for SAP Client Event presentation
SAP S4 HANA Cloud Solution Overview1.pdf
Russia CEE-SECR 2014 - Leveraging SAP Hana
Achieving Business Value by Fusing Hadoop and Corporate Data
Strategic Procurement at BASF: Transforming Strategic Supplier Management at ...
Hub16: VMware – Enabling business modeling and sales planning with Anaplan
Upgrade and Innovation Strategies for SAP ERP Customers
Leading in Digital Manufacturing
Ad

More from Tech Triveni (20)

PDF
UI Dev in Big data world using open source
PDF
Why should a Java programmer shifts towards Functional Programming Paradigm
PDF
Reactive - Is it really a Magic Pill?
PDF
Let’s go reactive with JAVA
PDF
Tackling Asynchrony with Kotlin Coroutines
PDF
Programmatic Ad Tracking: Let the power of Reactive Microservices do talking
PDF
Let's refine your Scala Code
PDF
Supercharged imperative programming with Haskell and Functional Programming
PDF
Observability at scale with Neural Networks: A more proactive approach
PDF
Semi-Supervised Insight Generation from Petabyte Scale Text Data
PDF
Finding the best solution for Image Processing
PDF
Effecting Pure Change - How anything ever gets done in functional programming...
PDF
Becoming a Functional Programmer - Harit Himanshu (Nomis Solutions)
PDF
Live coding session on AI / ML using Google Tensorflow (Python) - Tanmoy Deb ...
PDF
Distributing the SMACK stack - Kubernetes VS DCOS - Sahil Sawhney (Knoldus Inc.)
PDF
Blue Pill / Red Pill : The Matrix of thousands of data streams - Himanshu Gup...
PDF
UX in Big Data Analytics - Paramjit Jolly (Guavus)
PDF
Simplified Scala Monads And Transformation - Harmeet Singh (Knoldus Inc.)
PDF
Micro Frontends Architecture - Jitendra kumawat (Guavus)
PDF
Apache CarbonData+Spark to realize data convergence and Unified high performa...
UI Dev in Big data world using open source
Why should a Java programmer shifts towards Functional Programming Paradigm
Reactive - Is it really a Magic Pill?
Let’s go reactive with JAVA
Tackling Asynchrony with Kotlin Coroutines
Programmatic Ad Tracking: Let the power of Reactive Microservices do talking
Let's refine your Scala Code
Supercharged imperative programming with Haskell and Functional Programming
Observability at scale with Neural Networks: A more proactive approach
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Finding the best solution for Image Processing
Effecting Pure Change - How anything ever gets done in functional programming...
Becoming a Functional Programmer - Harit Himanshu (Nomis Solutions)
Live coding session on AI / ML using Google Tensorflow (Python) - Tanmoy Deb ...
Distributing the SMACK stack - Kubernetes VS DCOS - Sahil Sawhney (Knoldus Inc.)
Blue Pill / Red Pill : The Matrix of thousands of data streams - Himanshu Gup...
UX in Big Data Analytics - Paramjit Jolly (Guavus)
Simplified Scala Monads And Transformation - Harmeet Singh (Knoldus Inc.)
Micro Frontends Architecture - Jitendra kumawat (Guavus)
Apache CarbonData+Spark to realize data convergence and Unified high performa...

Recently uploaded (20)

PPTX
Spectroscopy.pptx food analysis technology
PDF
KodekX | Application Modernization Development
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPT
Teaching material agriculture food technology
PDF
cuic standard and advanced reporting.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Spectroscopy.pptx food analysis technology
KodekX | Application Modernization Development
Programs and apps: productivity, graphics, security and other tools
sap open course for s4hana steps from ECC to s4
Digital-Transformation-Roadmap-for-Companies.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation_ Review paper, used for researhc scholars
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Teaching material agriculture food technology
cuic standard and advanced reporting.pdf
Approach and Philosophy of On baking technology
Understanding_Digital_Forensics_Presentation.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Unlocking AI with Model Context Protocol (MCP)
MYSQL Presentation for SQL database connectivity
Reach Out and Touch Someone: Haptics and Empathic Computing
Profit Center Accounting in SAP S/4HANA, S4F28 Col11

Proximity Targeting at Scale using Big Data Platforms

  • 1. Proximity Targeting at Scale Presented By : Rohan Raj ( Lead Engineer, MIQ )
  • 3. 3
  • 4. 4 Activating Marketing Intelligence through AiQ AiQ is our technology that provides modular, API-based analytics services to rapidly build data solutions for successful real-time business outcomes. As a result, we take a process that might normally take a couple of weeks and reduce time-to-value to a couple of hours. CONNECT DISCOVER ACTION Onboard, unify and store any dataset, making data organized, useful and meaningful Perform advanced analytics and process datasets for insights & algorithmic deployment Export decisions to your marketing technology platform AiQ
  • 5. 5 Daily Scale genda 80 Billion Ad Impressions 5000+ Strategies 10+TB Data 900,000 CPU mins 1000+ Campaigns 750 million users
  • 6. 6
  • 8. 8 11:00 am Your Location 9:00 am Coffee Shop 10:00 am Competitor Location 1 Customer Data - Without a Story [random & disconnected] 6:30 pm Gym 1:00 pm Competitor Location 2
  • 9. 9 11:00 am Your Location 9:00 am Coffee Shop 10:00 am Competitor Location 1 1:00 pm Competitor Location 2 6:30 pm Gym Coffee enthusiast and maybe stays in the vicinity of the coffee shop to pick it up before he goes on with his day User visited the Client location at 11am along with a bunch of other competitor locations, so looks like he was actively shopping Competitors who offer similar products like the client who customers also visit Time when he visits the gym if we wanted to capture time to target users to consume fitness related products. He travels 10km to get to the gym from the coffee shop SATURDAY Customer Data - With a Story
  • 10. 10 Motion - Capabilities User Journeys Audience Targeting Visit Trends : Unique, Repeat Precise Uplift Rank Store (Footfall insights) Competitor conquesting
  • 11. 11
  • 13. 13 Major Technical Challenges how we overcame these
  • 14. 14 Geocoding OLC ● It’s designed to be used as a street address. (similar to phone numbers) ● Not able to provide needed precision, when radii around POI varies widely. ● Nearby places don’t have shared prefixes. ● Looks Like this - 87G8Q257+5QP Geohash ● Scalable and efficient for programmatic usages and joins. ● Provides the needed precisions when size of POIs or target physical stores vary enormously. ● Hierarchical ● Nearby places have shared prefixes. ● Looks Like this - dr5ru7v
  • 15. 15 Managing Parallelism - Skew cost is involved
  • 18. 18 Optimizations- Joins, Shuffles & More Registration Make Model Engine_size AB12CDE Ford Fiesta 1.0 FG23HIJ Ford Fiesta 1.1 KL34MNO Ford Fiesta 1.5 PQ45RST Nissan Qashqai 1.6 UV56WXY Hyundai i20 1.4 ZA67BCD Ford Mondeo 2.0 Make Model Engine_size Price Ford Fiesta 1.0 10110 Ford Fiesta 1.1 2500 Ford Fiesta 1.5 13653 Ford Fiesta 1.4 16700 Ford Fiesta 1.2 8965 Ford Fiesta 1.6 7320 Nissan Qashqai 1.5 14567 Nissan Qashqai 1.6 11432 Partition 1 Partition 2 Partition 3 Partition 4 ... A B
  • 19. 19 ❑ Shuffle partitions ○ spark.sql.shuffle.partitions = 200 ❑ Broadcast joins ❑ Salting ❑ Databricks skew.hints df.hint("skew", "col1") ○ df.hint("skew", ["col1","col2"]) ○ df.hint("skew", "col1", "value") from pyspark.sql.functions import broadcast result = broadcast(A).join(B,["join_col"],"left") Optimizations- Joins, Shuffles & More
  • 20. 20 Cluster optimization & Persistence ● Spot Nodes ● Heterogeneous clusters ● EBS-autoscaling
  • 21. 21 Results 20K $ → 4.5K $ Cost 21 Hours →3 Hours Time 30 M $ / yearly Revenue generated Cluster optimizations shuffle partitions geohashingskew & join optimization
  • 22. 22 ❑ https://medium.com/miq-tech-and-analytics/geohash-vs-open-location-codes-understanding-the-tw o-geocoding-techniques-in-advertising-space-8002452201c8 ❑ https://medium.com/miq-tech-and-analytics/understanding-customer-lifetime-value-in-retail-62eb62 f44994 ❑ https://medium.com/miq-tech-and-analytics/decision-trees-optimizations-for-programmatic-media-buying-e 77e4410bfe8 ❑ https://medium.com/miq-tech-and-analytics/meet-jarvis-79ad50f3ccb8 ❑ https://medium.com/miq-tech-and-analytics/https-medium-com-nagaraj-mediaiq-how-we-use-s3-select-for- schema-validation-and-filtering-data-at-miq-52cf036bf9be Further Readings ...
  • 23. 23 Thank You It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout.