SlideShare a Scribd company logo
Data Science Company 
DataScience for e-commerce 
Infofarm - Seminar 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
25/11/2014
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Agenda 
• About us 
• What is Data Science? 
e-commerce vs Data Science vs BigData 
• Example Data Science applications in e-commerce 
some inspiration to see your opportunities… 
• Applying Data Science 
how to get started with all this?
About us 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Speakers 
• Niels Trescinski 
e-commerce Consultant 
– Fenego (Intershop) 
– Elision (Hybris) 
• Günther Van Roey 
Technical (IT) Consultant 
– InfoFarm (BigData & Data Science) 
– XT-i (software development and integration) 
– PHPro (website development)
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
InfoFarm - Team 
• Mixed skills team 
– 2 Data Scientist 
• Mathematics 
• Statistics 
– 4 BigData Consultants 
– 1 Infra specialist 
– n Cronos colleagues 
with various background 
• Certifications 
– CCDH - Cloudera Certified Hadoop Developer 
– CCAD - Cloudera Certified Hadoop Administrator 
– OCJP – Oracle Certified Java Programmer
InfoFarm + Fenego & Elision – e-commerce! 
Highly focused on 
e-commerce 
Business Knowledge 
Highly focused on 
Data Science and 
Big Data 
Technical Knowledge 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Introduction: what is Data Science? 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
What is data science? 
• Data Scientist: “A person who is better at statistics than 
any software engineer and better at software 
engineering than any statistician” 
- Josh Wills 
• “Getting meaning from data” 
Finding patterns (data mining) 
• Complementing business 
knowledge with figures
Data Science & Big Data 
• Relevance for e-commerce - use data to: 
– Increment conversion 
– Increment operational efficiency 
– Understand your customers’ needs 
– Make better offers 
– Make better recommendations 
– … 
• Many successful online business thank their position to 
smart data usage: 
– Google was the first search engine that didn’t index by keyword 
– Amazon is the e-commerce leader thanks to BigData 
– NetFlix is a world leader in personalized recommendations 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Data Science & Big Data 
• Most of us don’t run a business like the ones referred to in 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
stereotypical Big Data cases 
• Big Data does not necessarily means or requires much data 
• Data Science is very affordable to companies of all sizes 
• Typical Data Science projects are 10’s of man-days of work
Data Science & Big Data 
• Non-structured data: weblogs, social media content, … 
• Secondary use of data sources is the key 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
– eg: Weblogs 
• Are there to log webserver activity 
• But can also tell you how people find, compare and choose products! 
– eg: ERP / Cash register software 
• Prints bills 
• But can also tell you what products are typically bought together in a shop 
• Many data is present, valuable information is hidden in it!
Topics not covered in this seminar 
• Very interesting topics that we will gladly 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
elaborate upon another time: 
– Statistical Tools (R, SPSS, …) 
– Mathematical models 
– Machine Learning Techniques (Clustering, Classification, …) 
– BigData Tools & Platforms (Hadoop, Spark, …) 
– Data processing tools (Pig, Hive, …)
Example Data Science applications: 
#1: Recommendations 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Recommendations – Why? How? 
– Why? 
• Attempt to cross-sell or up-sell 
• Provide customers with alternatives that might please them even more 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
– Traditional approach 
• No recommendations at all 
• Products in the same category 
• Manually managed cross-selling opportunities per product 
– Why are these approaches fundamentally flawed? 
• They all start from the seller perspective, not the customer! 
• “We know what you should be buying” 
• Manual recommendations are too costly and time-consuming to 
maintain – even impossible with large catalogs
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Recommendations 
– Online vs Offline 
• Main focus on online, but why? 
• Who knows best what products to recommend? 
• Learn from your data, don’t take decisions based on a feeling. 
– Time based recommendations 
• Recommend or cross sell different products depending on 
– season? 
– holiday? 
– weather? 
– Customer based recommendations 
• Learn from your customers and their past. 
• Android vs iOS smartphones.
Showing (too) 
similar products? 
No color alternatives? 
No glossy/matte 
alternatives? 
No product 
Recommendations – Traditional approach 
recommendations 
at all 
(Link to category 
without match with 
specific product) 
Which roller would be 
appropriate? 
No primer + paint 
combo? 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Recommendations – what does Amazon do? 
Cross-selling 
as realized with other 
(similar?) customers 
Starts from customer 
point of view! 
Recommendations 
based on perceived 
customer journeys 
Re-use the product 
comparisons that 
previous customers 
did! 
DATA 
DRIVEN! 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Recommendations – Other ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Data Science ideas 
– “x % of the people who looked at this item eventually bought product X or Y” 
– Get cross-selling information from ERP in the physical shops and let this feed the 
webshop recommendations! 
– Similar product in different price ranges 
(“best-buy alternative”, “deluxe alternative”) 
– ... 
• This is very achievable for a webshop of any size 
– Just generate ideas, and test to see what actually increases sales! 
• Secondary use of various kinds of non-structured data = BigData ! 
– Weblogs of e-commerce site (use to deduct customer journeys) 
– ERP info with bills and/or invoices (use to deduct cross-selling in physical shops) 
– Product information (product categorization, …)
Example Data Science applications: 
#2: Physical stores vs webshop 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Impact physical store on online? 
– Are online sales higher when physical store is nearby? 
– Where to open a new store? 
– How to approach your customers to motivate 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Impact physical shops - Why bother? 
• Determine strength of online brand vs physical brand 
– Is online sales driven by brand awareness? 
– Or is there quite a balance between the two? 
– Omni-channel strategy? 
• Know what would be the impact of opening/closing a 
physical shop, also on the online business 
– Support management decisions with facts & figures 
• Depends heavily on sector/product/case/… 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Impact physical shops - example 
• Analysis for a retailer: Physical shops vs online sales 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Impact physical shops - example 
• Impact of opening a physical shop on local online sales 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
(brand awareness?)
Impact physical shops – now what? 
• Use this correlation information: 
– As extra input for determining new shop locations 
– Publish folders focusing on online in non-covered areas 
– Use popup-stores to get brand awareness and drive online sales 
– Discounts per region 
– Google Adwords campaigns focusing on regions with limited 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
brand presence 
– Customer segmentation based on this information
Example Data Science applications: 
#3: Dynamic Pricing 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Dynamic prices 
– End of life products? 
– Relevancy of products. 
– (Local) competition. 
– Customer!
Dynamic Prices – some ideas 
• Auto-combination special offers based on cross-selling 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
info 
• Monitor stock & manage promotions accordingly 
– Example: stock of calendars in December 
(value decreases over time…) 
– Example: Customer history: needs incentive to buy? 
Why not give a small 
discount if bought 
together? 
Testing will show if 
and for which 
products and 
customers this 
increases revenue!
Dynamic Prices – some ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Pricing vs competition 
scraping competition websites 
• Analysis of tenders vs deals 
– What type of deals do we typically win, and which not? 
= Data mining on CRM data! 
– How can we optimize our chances to make a deal? 
Which tenders should we invest in? What offer should we make? 
• Remark: in B2C scenarios, can be difficult / unwanted to 
use dynamic prices. Mind the legal impact!
Example Data Science applications: 
#4: Personalized offerings 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Personalized offering 
– Loyal (online) customer vs new customers. 
– Browsing habits and patterns. 
– Spending patterns. 
– Personalized discounts and/or content?
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Personalized offerings 
• Customer should be central in the webshop 
– Provide a truly personalized shopping experience 
– Like high-end physical shops with personal approach to VIP 
customers 
• Gather data about your customer 
– Surfing history – what products where looked at? How long? … 
– What products were bought? When? 
– Brand preference? 
– Product-segment preference? (budget, high-end, best-buy?) 
– Abandoned shopping carts 
• Take action based on information mined from this data 
– Triggered e-mails, personal recommendations, …
Personalized offerings – some ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Imply social media 
– Are there any connections of our customer that wrote product 
recommendations that might convince him to buy? 
– Do we know the shopping behaviour of some of the customers’ 
connections? Are they in line with his/hers? Can we use this to 
make better recommendations? 
• Anticipate customer behaviour 
– Use all customer contact moments 
eg: if customer calls customer service, they should know what 
products the customer was looking at during his last visit to the 
webshop 
– Prediction model (surfing behaviour vs % deal making) 
eg: 
Low chance? Go to checkout immediately. 
High chance? Offer extra cross-selling opportunities
Example Data Science applications: 
#5: Gather external data 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Gather external data, zoom & magnify 
– Explore search trends within Google. 
– Detect what is hot on social media. 
– Magnify to the results and set clear goals/actions. 
– Take action! 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Gather and use external data 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• example: how to sell a Smartwatch? 
– It’s a new product, how to market it effectively? 
– eg: SEO in line with trending topics on twitter, facebook posts, … 
– eg: SEO in line with used search terms 
• Added value: combining external data sources with own data 
• Some ideas 
– Find and follow your contacts on LinkedIn 
previous/future employers of your contacts may be great prospects for 
your B2B business! 
– Use weather info to adapt the featured product offering 
Data Science exercise: do we find any correlation between the weather 
and the product sales figures?
Example Data Science applications: 
#6: Anticipatory shipping 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Anticipatory shipping 
– Patent pending by Amazon. 
– Ships an order before it is placed. 
– Order history, search, wish list and click behaviour!
Anticipatory shipping 
• High-tech? Actually not complex at all … 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Steps: 
– Gather many info on past orders 
(customer info, country, product info, price, product group, 
product combinations, time of day, season, …) 
– Build a prediction model predicting “cancelled or not” based on 
all this information 
– Assess the quality of the model by training it with 90% of your 
historical orders and testing it with 10% of your historical orders 
– Pass each new order’s info and predict the likelihood of this 
order getting cancelled (0 .. 100%) and act accordingly
Example Data Science applications: 
#7: Customer Service optimizations 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Customer service 
– Losing sales/conversion/money by poor customer service. 
– Optimize information for all communication channels. 
– Which issues are your customers concerned with? 
– Allocate resources better!
Customer Service – Some Ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Text mining 
– Mood analysis: detect negative messages on social media, forum, … 
Put TODO on action list of customer care to contact with certain priority 
– Auto-classification of e-mails, letters, messages: 
Is this e-mail a question or a complaint? 
Is it about the quality of the product or financial (wrong invoice, …)? 
Automatic routing of messages to the right person! (operational optimization) 
• Social media 
– Social media status of customer (scoring based on profile) 
What’s would be the impact of this customer being unhappy about our service? 
• Omnichannel insights 
– What did this customer buy of look at? 
– How did he rate the last bought products? 
– Which contacts (mail, phone, …) did we have and what seems to be the most 
effective deal trigger?
Applying Data Science 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Applying Data Science 
• Data Science does not replace business knowledge 
– Need to find balance between the two 
– Confirm or deny assumed business knowledge 
– Detect changing trends early (customer behaviour, …) 
• Not a development cycle, rather exploratory process: 
– Formulate hypotheses 
– Data mining and modeling 
– A/B testing (test new idea on x % of your customers/products/…) 
– Conclusions: did the test group show better conversion? 
– Rollout or cancel and start over! 
• Potential issues 
– Privacy law and other legal restrictions 
– Feedback loops, information leakage, wrong assumptions 
eg: trying to gather customer preferences when an order could as well have 
been a gift to someone else (perfume, …)
Questions? 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye

More Related Content

PPTX
Applied Data Science for E-Commerce
PPTX
Big Data Analytics with Hadoop
PPTX
3 pillars of big data : structured data, semi structured data and unstructure...
PPTX
bigquery.pptx
PPTX
Big Data
PPTX
Business Intelligence
DOCX
Mcq tableau.docx
PDF
Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber
Applied Data Science for E-Commerce
Big Data Analytics with Hadoop
3 pillars of big data : structured data, semi structured data and unstructure...
bigquery.pptx
Big Data
Business Intelligence
Mcq tableau.docx
Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber

What's hot (20)

PDF
Spark DataFrames and ML Pipelines
PDF
Introduction To Data Science
PPTX
Presentation on Big Data
PPTX
Data visualization
PDF
Tools and techniques for data science
PDF
Introduction to Data Science and Analytics
PPTX
What is big data?
PDF
Big Data: Its Characteristics And Architecture Capabilities
PPTX
Tableau slideshare
PPTX
Big data-ppt
PPTX
Introduction to Data Visualization
PPTX
PPT on Data Science Using Python
PPTX
Introduction of data science
PPTX
Big Data Analytics
PPTX
Big Data PPT by Rohit Dubey
PDF
Introduction to Graph Databases
PDF
Overview of recommender system
PPTX
Centralised and distributed database
PPTX
Big data analytics
Spark DataFrames and ML Pipelines
Introduction To Data Science
Presentation on Big Data
Data visualization
Tools and techniques for data science
Introduction to Data Science and Analytics
What is big data?
Big Data: Its Characteristics And Architecture Capabilities
Tableau slideshare
Big data-ppt
Introduction to Data Visualization
PPT on Data Science Using Python
Introduction of data science
Big Data Analytics
Big Data PPT by Rohit Dubey
Introduction to Graph Databases
Overview of recommender system
Centralised and distributed database
Big data analytics
Ad

Viewers also liked (20)

PPTX
Data mining with Google analytics
PDF
Data Science and Machine Learning for eCommerce and Retail
PDF
Applying machine learning to product categorization
PDF
Magento scalability from the trenches (Meet Magento Sweden 2016)
PDF
How Data Science can increase Ecommerce profits
PDF
Surprising failure factors when implementing eCommerce and Omnichannel eBusiness
PDF
Omnichannel Customer Experience
PDF
Machine Learning in Ecommerce
DOCX
Boosting conversion rates on ecommerce using deep learning algorithms
PDF
E-commerce product classification with deep learning
PPTX
Digital analytics with R - Sydney Users of R Forum - May 2015
PDF
Marketing Analytics with R Lifting Campaign Success Rates
PPTX
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PPTX
Google Analytics Data Mining with R
PDF
Big Data in Retail - Examples in Action
PPTX
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerce
PPTX
Data Driven Decisions seminar
PPTX
Boosting big data with apache spark
PPTX
Big Data with Apache Hadoop
PPTX
SF Data Science: Developing Data Products
Data mining with Google analytics
Data Science and Machine Learning for eCommerce and Retail
Applying machine learning to product categorization
Magento scalability from the trenches (Meet Magento Sweden 2016)
How Data Science can increase Ecommerce profits
Surprising failure factors when implementing eCommerce and Omnichannel eBusiness
Omnichannel Customer Experience
Machine Learning in Ecommerce
Boosting conversion rates on ecommerce using deep learning algorithms
E-commerce product classification with deep learning
Digital analytics with R - Sydney Users of R Forum - May 2015
Marketing Analytics with R Lifting Campaign Success Rates
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Google Analytics Data Mining with R
Big Data in Retail - Examples in Action
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerce
Data Driven Decisions seminar
Boosting big data with apache spark
Big Data with Apache Hadoop
SF Data Science: Developing Data Products
Ad

Similar to Data Science for e-commerce (20)

PPTX
Harvesting business Value with Data Science
PPTX
Introduction to (Big) Data Science
PPTX
User-Centered Information Architecture for e-Commerce
PDF
User-Centered Information Architecture for e-commerce
PDF
Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...
PPTX
Schneider 1. E-Business Across World & Time.pptx
PPTX
Air Force Research Laboratory Small Business Hub Collider - Small Business Ma...
PPTX
6 steps to create a data-driven content strategy
PDF
Data Science for Digital Commerce
PPTX
Content Marketing For Building And Construction Products
PDF
Qhuba office depot schelle - seminar omni channel marketing 2014-11-25 brief
PDF
Online Industry - New Era
PDF
Big Data in e-Commerce
PDF
Business Intelligence Projects Portfolio
PPT
New Marketing for the New Economy - Kotler
PPT
Motarme NUI Galway Technology Marketing Seminar Dec 2012
PDF
Machine Learning for retail and ecommerce
PPT
Research retail software products
PPTX
AI in Retail
PDF
IDM Assignment revision certificate Nov '11
Harvesting business Value with Data Science
Introduction to (Big) Data Science
User-Centered Information Architecture for e-Commerce
User-Centered Information Architecture for e-commerce
Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...
Schneider 1. E-Business Across World & Time.pptx
Air Force Research Laboratory Small Business Hub Collider - Small Business Ma...
6 steps to create a data-driven content strategy
Data Science for Digital Commerce
Content Marketing For Building And Construction Products
Qhuba office depot schelle - seminar omni channel marketing 2014-11-25 brief
Online Industry - New Era
Big Data in e-Commerce
Business Intelligence Projects Portfolio
New Marketing for the New Economy - Kotler
Motarme NUI Galway Technology Marketing Seminar Dec 2012
Machine Learning for retail and ecommerce
Research retail software products
AI in Retail
IDM Assignment revision certificate Nov '11

Recently uploaded (20)

PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to machine learning and Linear Models
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPT
Miokarditis (Inflamasi pada Otot Jantung)
Business Ppt On Nestle.pptx huunnnhhgfvu
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Business Acumen Training GuidePresentation.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Clinical guidelines as a resource for EBP(1).pdf
ISS -ESG Data flows What is ESG and HowHow
STUDY DESIGN details- Lt Col Maksud (21).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
IB Computer Science - Internal Assessment.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Quality review (1)_presentation of this 21
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to machine learning and Linear Models
IBA_Chapter_11_Slides_Final_Accessible.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Miokarditis (Inflamasi pada Otot Jantung)

Data Science for e-commerce

  • 1. Data Science Company DataScience for e-commerce Infofarm - Seminar Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 25/11/2014
  • 2. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Agenda • About us • What is Data Science? e-commerce vs Data Science vs BigData • Example Data Science applications in e-commerce some inspiration to see your opportunities… • Applying Data Science how to get started with all this?
  • 3. About us Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 4. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Speakers • Niels Trescinski e-commerce Consultant – Fenego (Intershop) – Elision (Hybris) • Günther Van Roey Technical (IT) Consultant – InfoFarm (BigData & Data Science) – XT-i (software development and integration) – PHPro (website development)
  • 5. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be InfoFarm - Team • Mixed skills team – 2 Data Scientist • Mathematics • Statistics – 4 BigData Consultants – 1 Infra specialist – n Cronos colleagues with various background • Certifications – CCDH - Cloudera Certified Hadoop Developer – CCAD - Cloudera Certified Hadoop Administrator – OCJP – Oracle Certified Java Programmer
  • 6. InfoFarm + Fenego & Elision – e-commerce! Highly focused on e-commerce Business Knowledge Highly focused on Data Science and Big Data Technical Knowledge Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 7. Introduction: what is Data Science? Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 8. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be What is data science? • Data Scientist: “A person who is better at statistics than any software engineer and better at software engineering than any statistician” - Josh Wills • “Getting meaning from data” Finding patterns (data mining) • Complementing business knowledge with figures
  • 9. Data Science & Big Data • Relevance for e-commerce - use data to: – Increment conversion – Increment operational efficiency – Understand your customers’ needs – Make better offers – Make better recommendations – … • Many successful online business thank their position to smart data usage: – Google was the first search engine that didn’t index by keyword – Amazon is the e-commerce leader thanks to BigData – NetFlix is a world leader in personalized recommendations Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 10. Data Science & Big Data • Most of us don’t run a business like the ones referred to in Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be stereotypical Big Data cases • Big Data does not necessarily means or requires much data • Data Science is very affordable to companies of all sizes • Typical Data Science projects are 10’s of man-days of work
  • 11. Data Science & Big Data • Non-structured data: weblogs, social media content, … • Secondary use of data sources is the key Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be – eg: Weblogs • Are there to log webserver activity • But can also tell you how people find, compare and choose products! – eg: ERP / Cash register software • Prints bills • But can also tell you what products are typically bought together in a shop • Many data is present, valuable information is hidden in it!
  • 12. Topics not covered in this seminar • Very interesting topics that we will gladly Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be elaborate upon another time: – Statistical Tools (R, SPSS, …) – Mathematical models – Machine Learning Techniques (Clustering, Classification, …) – BigData Tools & Platforms (Hadoop, Spark, …) – Data processing tools (Pig, Hive, …)
  • 13. Example Data Science applications: #1: Recommendations Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 14. Recommendations – Why? How? – Why? • Attempt to cross-sell or up-sell • Provide customers with alternatives that might please them even more Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be – Traditional approach • No recommendations at all • Products in the same category • Manually managed cross-selling opportunities per product – Why are these approaches fundamentally flawed? • They all start from the seller perspective, not the customer! • “We know what you should be buying” • Manual recommendations are too costly and time-consuming to maintain – even impossible with large catalogs
  • 15. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Recommendations – Online vs Offline • Main focus on online, but why? • Who knows best what products to recommend? • Learn from your data, don’t take decisions based on a feeling. – Time based recommendations • Recommend or cross sell different products depending on – season? – holiday? – weather? – Customer based recommendations • Learn from your customers and their past. • Android vs iOS smartphones.
  • 16. Showing (too) similar products? No color alternatives? No glossy/matte alternatives? No product Recommendations – Traditional approach recommendations at all (Link to category without match with specific product) Which roller would be appropriate? No primer + paint combo? Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 17. Recommendations – what does Amazon do? Cross-selling as realized with other (similar?) customers Starts from customer point of view! Recommendations based on perceived customer journeys Re-use the product comparisons that previous customers did! DATA DRIVEN! Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 18. Recommendations – Other ideas Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Data Science ideas – “x % of the people who looked at this item eventually bought product X or Y” – Get cross-selling information from ERP in the physical shops and let this feed the webshop recommendations! – Similar product in different price ranges (“best-buy alternative”, “deluxe alternative”) – ... • This is very achievable for a webshop of any size – Just generate ideas, and test to see what actually increases sales! • Secondary use of various kinds of non-structured data = BigData ! – Weblogs of e-commerce site (use to deduct customer journeys) – ERP info with bills and/or invoices (use to deduct cross-selling in physical shops) – Product information (product categorization, …)
  • 19. Example Data Science applications: #2: Physical stores vs webshop Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 20. Impact physical store on online? – Are online sales higher when physical store is nearby? – Where to open a new store? – How to approach your customers to motivate Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 21. Impact physical shops - Why bother? • Determine strength of online brand vs physical brand – Is online sales driven by brand awareness? – Or is there quite a balance between the two? – Omni-channel strategy? • Know what would be the impact of opening/closing a physical shop, also on the online business – Support management decisions with facts & figures • Depends heavily on sector/product/case/… Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 22. Impact physical shops - example • Analysis for a retailer: Physical shops vs online sales Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 23. Impact physical shops - example • Impact of opening a physical shop on local online sales Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be (brand awareness?)
  • 24. Impact physical shops – now what? • Use this correlation information: – As extra input for determining new shop locations – Publish folders focusing on online in non-covered areas – Use popup-stores to get brand awareness and drive online sales – Discounts per region – Google Adwords campaigns focusing on regions with limited Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be brand presence – Customer segmentation based on this information
  • 25. Example Data Science applications: #3: Dynamic Pricing Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 26. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Dynamic prices – End of life products? – Relevancy of products. – (Local) competition. – Customer!
  • 27. Dynamic Prices – some ideas • Auto-combination special offers based on cross-selling Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be info • Monitor stock & manage promotions accordingly – Example: stock of calendars in December (value decreases over time…) – Example: Customer history: needs incentive to buy? Why not give a small discount if bought together? Testing will show if and for which products and customers this increases revenue!
  • 28. Dynamic Prices – some ideas Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Pricing vs competition scraping competition websites • Analysis of tenders vs deals – What type of deals do we typically win, and which not? = Data mining on CRM data! – How can we optimize our chances to make a deal? Which tenders should we invest in? What offer should we make? • Remark: in B2C scenarios, can be difficult / unwanted to use dynamic prices. Mind the legal impact!
  • 29. Example Data Science applications: #4: Personalized offerings Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 30. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Personalized offering – Loyal (online) customer vs new customers. – Browsing habits and patterns. – Spending patterns. – Personalized discounts and/or content?
  • 31. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Personalized offerings • Customer should be central in the webshop – Provide a truly personalized shopping experience – Like high-end physical shops with personal approach to VIP customers • Gather data about your customer – Surfing history – what products where looked at? How long? … – What products were bought? When? – Brand preference? – Product-segment preference? (budget, high-end, best-buy?) – Abandoned shopping carts • Take action based on information mined from this data – Triggered e-mails, personal recommendations, …
  • 32. Personalized offerings – some ideas Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Imply social media – Are there any connections of our customer that wrote product recommendations that might convince him to buy? – Do we know the shopping behaviour of some of the customers’ connections? Are they in line with his/hers? Can we use this to make better recommendations? • Anticipate customer behaviour – Use all customer contact moments eg: if customer calls customer service, they should know what products the customer was looking at during his last visit to the webshop – Prediction model (surfing behaviour vs % deal making) eg: Low chance? Go to checkout immediately. High chance? Offer extra cross-selling opportunities
  • 33. Example Data Science applications: #5: Gather external data Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 34. Gather external data, zoom & magnify – Explore search trends within Google. – Detect what is hot on social media. – Magnify to the results and set clear goals/actions. – Take action! Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 35. Gather and use external data Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • example: how to sell a Smartwatch? – It’s a new product, how to market it effectively? – eg: SEO in line with trending topics on twitter, facebook posts, … – eg: SEO in line with used search terms • Added value: combining external data sources with own data • Some ideas – Find and follow your contacts on LinkedIn previous/future employers of your contacts may be great prospects for your B2B business! – Use weather info to adapt the featured product offering Data Science exercise: do we find any correlation between the weather and the product sales figures?
  • 36. Example Data Science applications: #6: Anticipatory shipping Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 37. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Anticipatory shipping – Patent pending by Amazon. – Ships an order before it is placed. – Order history, search, wish list and click behaviour!
  • 38. Anticipatory shipping • High-tech? Actually not complex at all … Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Steps: – Gather many info on past orders (customer info, country, product info, price, product group, product combinations, time of day, season, …) – Build a prediction model predicting “cancelled or not” based on all this information – Assess the quality of the model by training it with 90% of your historical orders and testing it with 10% of your historical orders – Pass each new order’s info and predict the likelihood of this order getting cancelled (0 .. 100%) and act accordingly
  • 39. Example Data Science applications: #7: Customer Service optimizations Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 40. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Customer service – Losing sales/conversion/money by poor customer service. – Optimize information for all communication channels. – Which issues are your customers concerned with? – Allocate resources better!
  • 41. Customer Service – Some Ideas Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Text mining – Mood analysis: detect negative messages on social media, forum, … Put TODO on action list of customer care to contact with certain priority – Auto-classification of e-mails, letters, messages: Is this e-mail a question or a complaint? Is it about the quality of the product or financial (wrong invoice, …)? Automatic routing of messages to the right person! (operational optimization) • Social media – Social media status of customer (scoring based on profile) What’s would be the impact of this customer being unhappy about our service? • Omnichannel insights – What did this customer buy of look at? – How did he rate the last bought products? – Which contacts (mail, phone, …) did we have and what seems to be the most effective deal trigger?
  • 42. Applying Data Science Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 43. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Applying Data Science • Data Science does not replace business knowledge – Need to find balance between the two – Confirm or deny assumed business knowledge – Detect changing trends early (customer behaviour, …) • Not a development cycle, rather exploratory process: – Formulate hypotheses – Data mining and modeling – A/B testing (test new idea on x % of your customers/products/…) – Conclusions: did the test group show better conversion? – Rollout or cancel and start over! • Potential issues – Privacy law and other legal restrictions – Feedback loops, information leakage, wrong assumptions eg: trying to gather customer preferences when an order could as well have been a gift to someone else (perfume, …)
  • 44. Questions? Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye