SlideShare a Scribd company logo
Takashi Umeda (梅田卓志)
@umekoumeda
Oct. 20th , 2012
Purchase prediction
by statistical analysis
楽天技術研究所
Rakuten Institute of Technology
Value Proposition
Third Reality
Vision Tokyo & NY
& Paris
Strategic R&D organization for Rakuten
Biography
3
• Takashi Umeda
• Twitter : @umekoumeda
Profile
Work
Purchase prediction >>> Users’ Benefits
Prediction of purchase interval
Seasonality forecasting
Preference prediction
Biography
4
• Takashi Umeda
• Twitter : @umekoumeda
Profile
Work
Purchase prediction >>> Users’ Benefits
Prediction of purchase interval
Seasonality forecasting
Preference prediction
Objective
• Predict the users’ purchase interval
• Focus on non-durable goods
5
Example of application
30 days 30 days 30 days
Past FuturePast Future
Buy Buy Buy Buy
Example of application
30 days 30 days 30 days
Past FuturePast Future
Buy Buy Buy Buy
Remind!
We can remind users
just before next purchase
30 days
そろそろ、買い時では?
Users’ benefits
Prevent users from forgetting to purchase
Empty
I forgot to
purchase!
It’s time to
purchase!
Few
NG OK
Notification
How can we predict
purchase interval ?
Data set
Purchase history in rice category
Target users :
Users purchasing over 4 times in one year
Pick Up
Example of users with only fixed intervals
BUY
30 days
BUY BUY
31 days 29 days
BUY
Past Future
BUY
30 days
BUY BUY
31 days 29 days
BUY
Past Future
All purchase intervals are fixed. It’s about 30 days.
We call those users as “Users with only fixed intervals”
Example of users with only fixed intervals
Coverage of users with only fixed intervals
Target
Users with only fixed intervals
(Predictable users)
About 11%
• Target : Users purchasing over 4 times in 1 year
Too low coverage !
Not practical !
Example of users with a few outlier intervals
BUY
31days 60 days 29 days
BUY BUY BUY
30 days
BUY
BUY
31days 60 days 29 days
BUY BUY BUY
30 days
BUY
Most of purchase intervals are fixed.
It’s about 30 days.
Example of users with a few outlier intervals
BUY
31days 60 days 29 days
BUY BUY BUY
30 days
BUY
A few intervals are outlier intervals
Example of users with a few outlier intervals
BUY
31days 60 days 29 days
BUY BUY BUY
30 days
• There are a lots of users with
many fixed and a few outlier intervals
• We call those users as
“Users with a few outlier intervals”
BUY
Example of users with a few outlier intervals
BUY
31days 60 days 29 days
BUY BUY BUY
30 days
BUY
Cause for the outlier interval
Why outlier intervals happened
?
31days 60 days 29 days30 days
Cause for the outlier interval
5kg 10kg 5kg 5kg 5kg
• If consumer purchased more, interval had been longer
• This type of users account for 22 %
Coverage of users with a few outlier intervals
Predictable users
47%
Users with a few outlier intervals
36%
• Target : Users purchasing over 4 times in 1 year
Target
Users with only fixed intervals
11%
Trends in any other categories
Predictable users exist in not only rice category
but also any other categories.
Ratio of predictable users
33.4 % 55.3 % 46.1 % 43.2 %
Items which we should show
57%
In the reminding system, it’s better to show
the item which has been purchased before.
Users repeatedly purchase
the same item at the same shop
Users repeatedly purchase
different items
- Items sold at the same shop (10%)
- Same priced items (14%)
43%
Items which we should show
21%
In the reminding system, it’s better to show
various kinds of items.
Users repeatedly purchase
the same item at the same shop
Users repeatedly purchase
different items
- Items sold at the same shop (32%)
- Same priced items (31%)
79%
Summary
There are many predictable users
47% in the rice category
We can remind users at the right moment !
It makes users happy !
Prevent users from forgetting to purchase item
Message
There are many
Fixed interval users
It make users happy !
• In rice category,
those users account for 47%.
• Many categories have same trends
By using detected fixed interval,
We can remind users
just before next purchase
Users can avoid from forgetting to
purchase regular buying items
Purchase Prediction
Users’ Benefits
Message
There are many
Fixed interval users
It make users happy !
• In rice category,
those users account for 47%.
• Many categories have same trends
By using detected fixed interval,
We can remind users
just before next purchase
Users can avoid from forgetting to
purchase regular buying items
Purchase Prediction
Users’ Benefits
If you come up with any idea,
feel free to tweet via twitter
@umekoumeda

More Related Content

PPTX
Consumer adoption process
PDF
Gingernet bonbon ppt
PPT
解決正確的問題 - 如何讓數據發揮影響力?
PPT
Machine Learning and Statistical Analysis
PPTX
Intelligent Electronic Commerce Service Based on Understanding of User Behaviors
PDF
Latent Semantic Transliteration using Dirichlet Mixture
PPTX
[RakutenTechConf2013][C-4_3] Our Goals and Activities at Rakuten Institute o...
PPTX
[RakutenTechConf2013] [LT] Giving Life to your IDEAS to Survive in Evolving Era
Consumer adoption process
Gingernet bonbon ppt
解決正確的問題 - 如何讓數據發揮影響力?
Machine Learning and Statistical Analysis
Intelligent Electronic Commerce Service Based on Understanding of User Behaviors
Latent Semantic Transliteration using Dirichlet Mixture
[RakutenTechConf2013][C-4_3] Our Goals and Activities at Rakuten Institute o...
[RakutenTechConf2013] [LT] Giving Life to your IDEAS to Survive in Evolving Era

Viewers also liked (7)

PPTX
[RakutenTechConf2013] [C4-1] Text detection in product images
PDF
Unsupervised Extraction of Attributes and Their Values from Product Description
PDF
Scaling and High Performance Storage System: LeoFS
PDF
Latent Class Transliteration based on Source Language Origin
PPTX
[RakutenTechConf2013] [D-3_2] Counting Big Data by Streaming Algorithms
PPTX
[RakutenTechConf2013] [C-4_2] Building Structured Data from Product Descriptions
PDF
The Egison Programming Language
[RakutenTechConf2013] [C4-1] Text detection in product images
Unsupervised Extraction of Attributes and Their Values from Product Description
Scaling and High Performance Storage System: LeoFS
Latent Class Transliteration based on Source Language Origin
[RakutenTechConf2013] [D-3_2] Counting Big Data by Streaming Algorithms
[RakutenTechConf2013] [C-4_2] Building Structured Data from Product Descriptions
The Egison Programming Language
Ad

Similar to Purchase prediction by statistical analysis (統計技術を用いた商品購買予測) (20)

PPTX
International management of future group ppt
PPTX
Consumer Behaviour final latest.pptx
PPTX
HTPM Project.pptx
PPT
Mm unit 2 point 1
PPT
Mm unit 2 point 1
PPT
Mm unit 2 point 1
PPTX
Consumer behavior - Decision making & Types
PPTX
Shoppers' stop tragetting the young
PPTX
Shoppers Stop Targetting the young
PPT
Market basket analysis
PPTX
Unit 3 marketing management
PPTX
Consumer Behavior Introduction and Understanding
PPTX
Consumer Behavior.pptx
PPTX
GTM Strategy for E-commerce Website into third layer clothing
PPTX
Siam turakij research presentation
PPT
demand forecasting Latest.ppt demand forecasting ppt
PPTX
Marketing Essentials for MBA - Part 1
PPTX
Consumer behavior and advertising research
PPTX
Presentation on marketing
PPTX
Demand forecasting ppt
International management of future group ppt
Consumer Behaviour final latest.pptx
HTPM Project.pptx
Mm unit 2 point 1
Mm unit 2 point 1
Mm unit 2 point 1
Consumer behavior - Decision making & Types
Shoppers' stop tragetting the young
Shoppers Stop Targetting the young
Market basket analysis
Unit 3 marketing management
Consumer Behavior Introduction and Understanding
Consumer Behavior.pptx
GTM Strategy for E-commerce Website into third layer clothing
Siam turakij research presentation
demand forecasting Latest.ppt demand forecasting ppt
Marketing Essentials for MBA - Part 1
Consumer behavior and advertising research
Presentation on marketing
Demand forecasting ppt
Ad

More from Rakuten Group, Inc. (20)

PDF
EPSS (Exploit Prediction Scoring System)モニタリングツールの開発
PPTX
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
PDF
楽天における安全な秘匿情報管理への道のり
PDF
What Makes Software Green?
PDF
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
PDF
DataSkillCultureを浸透させる楽天の取り組み
PDF
大規模なリアルタイム監視の導入と展開
PDF
楽天における大規模データベースの運用
PDF
楽天サービスを支えるネットワークインフラストラクチャー
PDF
楽天の規模とクラウドプラットフォーム統括部の役割
PDF
Rakuten Services and Infrastructure Team.pdf
PDF
The Data Platform Administration Handling the 100 PB.pdf
PDF
Supporting Internal Customers as Technical Account Managers.pdf
PDF
Making Cloud Native CI_CD Services.pdf
PDF
How We Defined Our Own Cloud.pdf
PDF
Travel & Leisure Platform Department's tech info
PDF
Travel & Leisure Platform Department's tech info
PDF
OWASPTop10_Introduction
PDF
Introduction of GORA API Group technology
PDF
100PBを越えるデータプラットフォームの実情
EPSS (Exploit Prediction Scoring System)モニタリングツールの開発
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
楽天における安全な秘匿情報管理への道のり
What Makes Software Green?
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
DataSkillCultureを浸透させる楽天の取り組み
大規模なリアルタイム監視の導入と展開
楽天における大規模データベースの運用
楽天サービスを支えるネットワークインフラストラクチャー
楽天の規模とクラウドプラットフォーム統括部の役割
Rakuten Services and Infrastructure Team.pdf
The Data Platform Administration Handling the 100 PB.pdf
Supporting Internal Customers as Technical Account Managers.pdf
Making Cloud Native CI_CD Services.pdf
How We Defined Our Own Cloud.pdf
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
OWASPTop10_Introduction
Introduction of GORA API Group technology
100PBを越えるデータプラットフォームの実情

Purchase prediction by statistical analysis (統計技術を用いた商品購買予測)

  • 1. Takashi Umeda (梅田卓志) @umekoumeda Oct. 20th , 2012 Purchase prediction by statistical analysis
  • 2. 楽天技術研究所 Rakuten Institute of Technology Value Proposition Third Reality Vision Tokyo & NY & Paris Strategic R&D organization for Rakuten
  • 3. Biography 3 • Takashi Umeda • Twitter : @umekoumeda Profile Work Purchase prediction >>> Users’ Benefits Prediction of purchase interval Seasonality forecasting Preference prediction
  • 4. Biography 4 • Takashi Umeda • Twitter : @umekoumeda Profile Work Purchase prediction >>> Users’ Benefits Prediction of purchase interval Seasonality forecasting Preference prediction
  • 5. Objective • Predict the users’ purchase interval • Focus on non-durable goods 5
  • 6. Example of application 30 days 30 days 30 days Past FuturePast Future Buy Buy Buy Buy
  • 7. Example of application 30 days 30 days 30 days Past FuturePast Future Buy Buy Buy Buy Remind! We can remind users just before next purchase 30 days そろそろ、買い時では?
  • 8. Users’ benefits Prevent users from forgetting to purchase Empty I forgot to purchase! It’s time to purchase! Few NG OK Notification
  • 9. How can we predict purchase interval ?
  • 10. Data set Purchase history in rice category Target users : Users purchasing over 4 times in one year Pick Up
  • 11. Example of users with only fixed intervals BUY 30 days BUY BUY 31 days 29 days BUY Past Future
  • 12. BUY 30 days BUY BUY 31 days 29 days BUY Past Future All purchase intervals are fixed. It’s about 30 days. We call those users as “Users with only fixed intervals” Example of users with only fixed intervals
  • 13. Coverage of users with only fixed intervals Target Users with only fixed intervals (Predictable users) About 11% • Target : Users purchasing over 4 times in 1 year Too low coverage ! Not practical !
  • 14. Example of users with a few outlier intervals BUY 31days 60 days 29 days BUY BUY BUY 30 days BUY
  • 15. BUY 31days 60 days 29 days BUY BUY BUY 30 days BUY Most of purchase intervals are fixed. It’s about 30 days. Example of users with a few outlier intervals
  • 16. BUY 31days 60 days 29 days BUY BUY BUY 30 days BUY A few intervals are outlier intervals Example of users with a few outlier intervals
  • 17. BUY 31days 60 days 29 days BUY BUY BUY 30 days • There are a lots of users with many fixed and a few outlier intervals • We call those users as “Users with a few outlier intervals” BUY Example of users with a few outlier intervals
  • 18. BUY 31days 60 days 29 days BUY BUY BUY 30 days BUY Cause for the outlier interval Why outlier intervals happened ?
  • 19. 31days 60 days 29 days30 days Cause for the outlier interval 5kg 10kg 5kg 5kg 5kg • If consumer purchased more, interval had been longer • This type of users account for 22 %
  • 20. Coverage of users with a few outlier intervals Predictable users 47% Users with a few outlier intervals 36% • Target : Users purchasing over 4 times in 1 year Target Users with only fixed intervals 11%
  • 21. Trends in any other categories Predictable users exist in not only rice category but also any other categories. Ratio of predictable users 33.4 % 55.3 % 46.1 % 43.2 %
  • 22. Items which we should show 57% In the reminding system, it’s better to show the item which has been purchased before. Users repeatedly purchase the same item at the same shop Users repeatedly purchase different items - Items sold at the same shop (10%) - Same priced items (14%) 43%
  • 23. Items which we should show 21% In the reminding system, it’s better to show various kinds of items. Users repeatedly purchase the same item at the same shop Users repeatedly purchase different items - Items sold at the same shop (32%) - Same priced items (31%) 79%
  • 24. Summary There are many predictable users 47% in the rice category We can remind users at the right moment ! It makes users happy ! Prevent users from forgetting to purchase item
  • 25. Message There are many Fixed interval users It make users happy ! • In rice category, those users account for 47%. • Many categories have same trends By using detected fixed interval, We can remind users just before next purchase Users can avoid from forgetting to purchase regular buying items Purchase Prediction Users’ Benefits
  • 26. Message There are many Fixed interval users It make users happy ! • In rice category, those users account for 47%. • Many categories have same trends By using detected fixed interval, We can remind users just before next purchase Users can avoid from forgetting to purchase regular buying items Purchase Prediction Users’ Benefits If you come up with any idea, feel free to tweet via twitter @umekoumeda