SlideShare a Scribd company logo
#1 New York
#1 Software
#7 Overall Rank
#1 in 2015
Deloitte’s
Technology
Fast 500
#1 in 2015 CRAIN'S
New York
FAST 50
MODEL DEPLOYMENT
MADE EASY
WITH PMML
We understand how to connect data to human needs to create fulfilling mobile moments
Our solutions
Advertising Platform Digital ContentSocial Data Platform Virtual Reality
Data team 20%
of employees are data experts
Holding BSC, MSC and
PHD, Intelligence units
8 Data scientists
About me
Physics Ph.D
Superconductor-to-insulator transition
in ultrathin films
Converted to data science
1.5 years at StartApp
Optimization (ad -> user)
Deployment
Data
Real time
2.5B Data
Requests
served daily
200K/s
model evaluations
25 milliseconds
Response time
15TB
of data per day
Request Enrichment Candidate ads Scoring Delivery
Total 25 ms
Real-time platform work
Explore
data
Build
feature
set
Train
model
Deploy
model
Measure
Our data scientist workflow
Reflectionsonlife
Platform developer
Explore
data
Build
feature
set
Train
model
Deploy
model
Measure
Data scientist
Explain
(again)
The gap
Using a developer
Knows system architecture
Can tailor code to scale
May be a nice guy!
Cons
Usually won’t know models
Translation of model proves to be difficult
Long time to production
What if something doesn’t work – who’s to
fix it?
Pros
HIGH MODEL UPDATE FREQUENCY
Weekly, daily, hourly
HIGH THROUGHPUT
200K requests/second
LOW LATENCY
Max. 25 ms end-to-end
VARIOUS MODELS
LR, RF, etc.
Our
challenges
MODEL SCORING
STANDARD
Maintained continuously for 20 years
Industry-oriented
XML format
Declarative
Many models:
Ensembles (!)
Regressions
Trees
Neural networks
K-nearest neighbors
What’s PMML, you ask?
What’s PMML, you ask?
Maintained continuously for 20 years
Industry-oriented
XML format
Declarative
Many models:
Ensembles (!)
Maintained continuously for 20 years
Industry-oriented
XML format
Declarative
Many models:
Ensembles (!)
Regressions
Trees
Neural networks
K-nearest neighbors
What’s PMML, you ask?
Regressions
Trees
Neural networks
K-nearest neighbors
Maintained continuously for 20 years
Industry-oriented
XML format
Declarative
Many models:
Ensembles (!)
Real-time platform work
Input Validation
Pre-
processing
Model
scoring
Post-
processing Output
r2pmml JPMML
Transformers
More than meets the eye
When PMML is done
PMML
PMML
1 2
43
LIVE DEMO!
Siesta
The gap - closed
Platform developer Data scientist
Explore
data
Build
feature
set
Train
model
Deploy
model
Measure
THANK YOU :)
We’re hiring!
www.startapp.com/careers/
shachaf.poran@startapp.com

More Related Content

PPT
PMML - Predictive Model Markup Language
PDF
PMML Execution of R Built Predictive Solutions
PDF
A Short PMML Tutorial by LatentView
PPTX
Introduction To MATLAB
PDF
Scaling & Transforming Stitch Fix's Visibility into What Folks will love
PDF
Ad Click Prediction - Paper review
PDF
How to Become a Thought Leader in Your Niche
PDF
Visual Design with Data
PMML - Predictive Model Markup Language
PMML Execution of R Built Predictive Solutions
A Short PMML Tutorial by LatentView
Introduction To MATLAB
Scaling & Transforming Stitch Fix's Visibility into What Folks will love
Ad Click Prediction - Paper review
How to Become a Thought Leader in Your Niche
Visual Design with Data

Similar to Model deployment made easy with PMML (20)

PDF
Productionising Machine Learning Models
PPTX
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
PDF
Ideas spracklen-final
PDF
Pragmatic Machine Learning @ ML Spain
PDF
Real World End to End machine Learning Pipeline
PDF
Best Practices for Big Data Analytics with Machine Learning by Datameer
PDF
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
PDF
AIIA - Charting the Path to Intelligent Operations with Machine Learning - At...
PDF
DS Life Cycle
PDF
DS Life Cycle
PPTX
Best practices machine learning final
PPTX
It’s all about me_ From big data models to personalized experience Presentation
PDF
Dataiku productive application to production - pap is may 2015
PDF
Intelligent Mobility: Machine Learning in the Mobility Industry
PDF
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
PDF
Machine learning model to production
PDF
DataScientist Job : Between Myths and Reality.pdf
PDF
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
PDF
Intro to machine learning for web folks @ BlendWebMix
PPTX
Datascience and Azure(v1.0)
Productionising Machine Learning Models
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
Ideas spracklen-final
Pragmatic Machine Learning @ ML Spain
Real World End to End machine Learning Pipeline
Best Practices for Big Data Analytics with Machine Learning by Datameer
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
AIIA - Charting the Path to Intelligent Operations with Machine Learning - At...
DS Life Cycle
DS Life Cycle
Best practices machine learning final
It’s all about me_ From big data models to personalized experience Presentation
Dataiku productive application to production - pap is may 2015
Intelligent Mobility: Machine Learning in the Mobility Industry
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Machine learning model to production
DataScientist Job : Between Myths and Reality.pdf
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Intro to machine learning for web folks @ BlendWebMix
Datascience and Azure(v1.0)
Ad

Recently uploaded (20)

PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Introduction to machine learning and Linear Models
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Computer network topology notes for revision
PPT
Quality review (1)_presentation of this 21
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
1_Introduction to advance data techniques.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
annual-report-2024-2025 original latest.
Reliability_Chapter_ presentation 1221.5784
Introduction to machine learning and Linear Models
Database Infoormation System (DBIS).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Galatica Smart Energy Infrastructure Startup Pitch Deck
Computer network topology notes for revision
Quality review (1)_presentation of this 21
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
1_Introduction to advance data techniques.pptx
ISS -ESG Data flows What is ESG and HowHow
oil_refinery_comprehensive_20250804084928 (1).pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Mega Projects Data Mega Projects Data
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
annual-report-2024-2025 original latest.
Ad

Model deployment made easy with PMML

Editor's Notes

  • #4: Soda: social data platform Ad platform: ad platform Immerse: One of first companies to embed ads in VR. Digital content: Help physical brands connect to the mobile digital world.