SlideShare a Scribd company logo
1
ML in Proptech
-
Concept to Production
2
- Established 2018
- Milestones
- Future
Hello, who dis?
- Dev
- Data Eng @ scale
- FP monkey
Boyan Demir
- XP with SL, now into RL
- Armed bandit
- Afraid of commit(ment)
- Every adventure starts with a leap of faith
- Fun with pipelines
- A look inside the data
- Recommender Systems tl;dr
- How to solve it
- How to ship it
- Results
- Lessons learned
The Story Today
3
44
Real Estate Inside/Out
- Old business, old tech
- Riding a giant wave
- Resistance to using tools
…but the customer
is changing
- Need information abundancy
- Expect being served 24/7
- It’s all about the me me me
5
It all starts with
dubious design…
(SQL/Normalization)
…goes through
a phase of magic…
(Clojure/Kafka)
…into the data lake
6
Data Exploration
- Lack of culture around the
importance of keeping a record
- Register the absolute minimum
required
- Free input is the devil
- Single day journey
- Migration from obsolete systems
without normalization
7
Meme slide goes here / It’s
time for MACHINE LEARNING*
*if on presentation - it cures cancer and fights aliens
8
How Netflix(Youtube) do(n’t) it
- Needs data. Lots of it
- Cold start problem
- Recs based on user behaviour
CF
- Cold start for new users
- Feature based similarity
- Model per user, not a shared one
CB
9
The Recommender we need,
but not the one we deserve
9
- Constant influx of new users
- Very long repeat customer cycle
- Perishable items
- Limited interaction between
user/items
- Buy/Rent are very different paths
- There is no such dataset(yet)
How to solve it
10
Here We Discuss
The Future
11
Microlibs
Pattern
-
Notebooks to Library
ML Cores
(serialized)
HTTP
to he max
How to ship it
12
The Effect
- Served ~ 17000 customers, via 4
channels(mailing, sms, chatbot, mobile)
- ~35% open rate of communications with an
average around 2.4 clicks per open
- ~16% re-activation rate (inquiry/viewing)
13
Lessons Learned
- Integrate high in the stack
- Data quality >>>> Model complexity
- Business value does not need the
most advanced tech
- Digitalization should ideally
precede advanced ML
- Available tools will not always fit
every specific problem
- It’s hard being a fast company in a
slow business world 14
THE
END
BOYAN@GAIDA.AI + D.TONCHEV@GAIDA.AI 15

More Related Content

PDF
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016
PDF
Being a Data Science Product Manager
DOCX
How an E-commerce Brand Increased Conversions by 40% with AI-Powered Product ...
PPTX
Personalization using Machine Learning
PPTX
AI Development Services for Smarter, Future-Ready Solutions | SunTec India Cl...
PPTX
Ezml Stanford 2015
PDF
MongoDB.local Austin 2018: Building Intelligent Apps with MongoDB & Google Cloud
PDF
ADV Slides: Data Curation for Artificial Intelligence Strategies
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016
Being a Data Science Product Manager
How an E-commerce Brand Increased Conversions by 40% with AI-Powered Product ...
Personalization using Machine Learning
AI Development Services for Smarter, Future-Ready Solutions | SunTec India Cl...
Ezml Stanford 2015
MongoDB.local Austin 2018: Building Intelligent Apps with MongoDB & Google Cloud
ADV Slides: Data Curation for Artificial Intelligence Strategies

Similar to ML in Proptech - Concept to Production (20)

PDF
Machine Learning for retail and ecommerce
PPTX
HacktoberFestPune - DSC MESCOE x DSC PVGCOET
PDF
Deep learning for e-commerce: current status and future prospects
PPTX
Interview for saby upadhyay
PPTX
Interview for saby upadhyay
PPTX
Building Intelligent Apps with MongoDB and Google Cloud - Jane Fine
PDF
Recoding the Customer Experience
PPTX
MongoDB.local Sydney 2019: Building Intelligent Apps with MongoDB & Google Cloud
PDF
How to make data science products
PPTX
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
PDF
productionising-recommenders
PDF
Strategic AI Integration in Engineering Teams
PPTX
Ria Sankar on Building AI Products
PDF
Deep Link Analytics Empowered by AI + Graph + Verticals
PDF
Frequently Bought Together Recommendations Based on Embeddings
DOCX
Profile based recommendation for Airbnb users-Project report
PDF
Ai in marketing: How Business & Marketing can benefit from adopting AI & ML
PDF
MongoDB.local DC 2018: Building Intelligent Apps with MongoDB & Google Cloud
PPTX
[DSC Europe 23][AICommerce]Ratko Nikolic Fashion-forward Transforming E-Comme...
PDF
Using AI for Ecommerce Analytics
Machine Learning for retail and ecommerce
HacktoberFestPune - DSC MESCOE x DSC PVGCOET
Deep learning for e-commerce: current status and future prospects
Interview for saby upadhyay
Interview for saby upadhyay
Building Intelligent Apps with MongoDB and Google Cloud - Jane Fine
Recoding the Customer Experience
MongoDB.local Sydney 2019: Building Intelligent Apps with MongoDB & Google Cloud
How to make data science products
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
productionising-recommenders
Strategic AI Integration in Engineering Teams
Ria Sankar on Building AI Products
Deep Link Analytics Empowered by AI + Graph + Verticals
Frequently Bought Together Recommendations Based on Embeddings
Profile based recommendation for Airbnb users-Project report
Ai in marketing: How Business & Marketing can benefit from adopting AI & ML
MongoDB.local DC 2018: Building Intelligent Apps with MongoDB & Google Cloud
[DSC Europe 23][AICommerce]Ratko Nikolic Fashion-forward Transforming E-Comme...
Using AI for Ecommerce Analytics
Ad

More from Data Science Society (20)

PDF
[Data Meetup] Data Science in Finance - Factor Models in Finance
PDF
[Data Meetup] Data Science in Finance - Building a Quant ML pipeline
PPTX
[Data Meetup] Data Science in Journalism - Tanbih, QCRI and MIT
PPTX
Computer Vision in Real Estate
PPTX
Lessons Learned: Linked Open Data implemented in 2 Use Cases
PPT
AI methods for localization in noisy environment
PPTX
Object Identification and Detection Hackathon Solution
PPTX
Data Science for Open Innovation in SMEs and Large Corporations
PDF
Air Pollution in Sofia - Solution through Data Science by Kiwi team
PPTX
Machine Learning in Astrophysics
PPTX
#AcademiaDatathon Finlists' Solution of Crypto Datathon Case
PPTX
Coreference Extraction from Identric’s Documents - Solution of Datathon 2018
PDF
DNA Analytics - What does really goes into Sausages - Datathon2018 Solution
PDF
Relationships between research tasks and data structure (basic methods and a...
PDF
Data science tools - A.Marchev and K.Haralampiev
PDF
Problems of Application of Machine Learning in the CRM - panel
PDF
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
PDF
Intelligent Question Answering Using the Wisdom of the Crowd, Preslav Nakov
PDF
Master class Hristo Hadjitchonev - Aubg
PPTX
Open Data reveals corruption practices - case from Datathon 2017
[Data Meetup] Data Science in Finance - Factor Models in Finance
[Data Meetup] Data Science in Finance - Building a Quant ML pipeline
[Data Meetup] Data Science in Journalism - Tanbih, QCRI and MIT
Computer Vision in Real Estate
Lessons Learned: Linked Open Data implemented in 2 Use Cases
AI methods for localization in noisy environment
Object Identification and Detection Hackathon Solution
Data Science for Open Innovation in SMEs and Large Corporations
Air Pollution in Sofia - Solution through Data Science by Kiwi team
Machine Learning in Astrophysics
#AcademiaDatathon Finlists' Solution of Crypto Datathon Case
Coreference Extraction from Identric’s Documents - Solution of Datathon 2018
DNA Analytics - What does really goes into Sausages - Datathon2018 Solution
Relationships between research tasks and data structure (basic methods and a...
Data science tools - A.Marchev and K.Haralampiev
Problems of Application of Machine Learning in the CRM - panel
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
Intelligent Question Answering Using the Wisdom of the Crowd, Preslav Nakov
Master class Hristo Hadjitchonev - Aubg
Open Data reveals corruption practices - case from Datathon 2017
Ad

Recently uploaded (20)

PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Transcultural that can help you someday.
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Database Infoormation System (DBIS).pptx
PPT
Quality review (1)_presentation of this 21
PDF
Business Analytics and business intelligence.pdf
PPTX
Leprosy and NLEP programme community medicine
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Qualitative Qantitative and Mixed Methods.pptx
Supervised vs unsupervised machine learning algorithms
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
IBA_Chapter_11_Slides_Final_Accessible.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Transcultural that can help you someday.
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Reliability_Chapter_ presentation 1221.5784
Clinical guidelines as a resource for EBP(1).pdf
ISS -ESG Data flows What is ESG and HowHow
oil_refinery_comprehensive_20250804084928 (1).pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Database Infoormation System (DBIS).pptx
Quality review (1)_presentation of this 21
Business Analytics and business intelligence.pdf
Leprosy and NLEP programme community medicine
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Introduction to Knowledge Engineering Part 1
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Qualitative Qantitative and Mixed Methods.pptx

ML in Proptech - Concept to Production

  • 2. 2 - Established 2018 - Milestones - Future Hello, who dis? - Dev - Data Eng @ scale - FP monkey Boyan Demir - XP with SL, now into RL - Armed bandit - Afraid of commit(ment)
  • 3. - Every adventure starts with a leap of faith - Fun with pipelines - A look inside the data - Recommender Systems tl;dr - How to solve it - How to ship it - Results - Lessons learned The Story Today 3
  • 4. 44 Real Estate Inside/Out - Old business, old tech - Riding a giant wave - Resistance to using tools …but the customer is changing - Need information abundancy - Expect being served 24/7 - It’s all about the me me me
  • 5. 5 It all starts with dubious design… (SQL/Normalization) …goes through a phase of magic… (Clojure/Kafka) …into the data lake
  • 6. 6 Data Exploration - Lack of culture around the importance of keeping a record - Register the absolute minimum required - Free input is the devil - Single day journey - Migration from obsolete systems without normalization
  • 7. 7 Meme slide goes here / It’s time for MACHINE LEARNING* *if on presentation - it cures cancer and fights aliens
  • 8. 8 How Netflix(Youtube) do(n’t) it - Needs data. Lots of it - Cold start problem - Recs based on user behaviour CF - Cold start for new users - Feature based similarity - Model per user, not a shared one CB
  • 9. 9 The Recommender we need, but not the one we deserve 9 - Constant influx of new users - Very long repeat customer cycle - Perishable items - Limited interaction between user/items - Buy/Rent are very different paths - There is no such dataset(yet)
  • 10. How to solve it 10
  • 11. Here We Discuss The Future 11
  • 12. Microlibs Pattern - Notebooks to Library ML Cores (serialized) HTTP to he max How to ship it 12
  • 13. The Effect - Served ~ 17000 customers, via 4 channels(mailing, sms, chatbot, mobile) - ~35% open rate of communications with an average around 2.4 clicks per open - ~16% re-activation rate (inquiry/viewing) 13
  • 14. Lessons Learned - Integrate high in the stack - Data quality >>>> Model complexity - Business value does not need the most advanced tech - Digitalization should ideally precede advanced ML - Available tools will not always fit every specific problem - It’s hard being a fast company in a slow business world 14