SlideShare a Scribd company logo
Developing Extensions for RapidMiner …rapidly 
November 17th, 2014 
Sabrina Kirstein
RapidMiner Company Overview 
2 
Easy-to-use, blazing fast, and very easy to integrate with any IT infrastructure 
Support from a thriving communityof contributors creating new extensions and applications 
Processes designed in RapidMiner can be one-click deployedto RapidMiner Server or RapidMiner Cloud 
A unique Marketplacefor independent developers to publish their innovative extensions 
RapidMiner delivers the power of predictive analytics to business users. No programming required. 
More than 60 connectors (incl. SAP, Hadoop, Cloud connectors like Twitter and Zapier) allowing easy access to structured and unstructured data.
RapidMiner History 
3 
Cloud 
•Cloud 
•Hadoop 
Business Source 
•Commercial Editions 
•Community Editions 
•Client and Server 
Open Source 
•Command Line 
•Initial Workbench 
Open Source 
•Complete Workbench 
•CommunityExtensions 
•Marketplace 
Community Growth 
2007 
2010 
2013 
2014 
5,000 
30,000 
150,000 
250,000
RapidMiner Metrics 
4 
60+ 
Employees 
Worldwide 
100+ 
Active Developers 
600+ 
Customers in over 50 Countries 
40,000+ 
Downloads per Month 
35,000+ 
Active Deployments with over 250,000 Users
Product Overview 
5
RapidMiner Studio 
•With access to over 1500 different operators, the Java-based visual environment of RapidMiner allows for rapid data mining process development 
6 
Visual Process Design Environment
Accelerators 
7 
Wizard 
•Selection of data and label (e.g. churn) column. 
•Label column contains missings values if unknown –those will be predicted 
Results 
•Predictions (individuals, churn predictions) 
•Descriptive model 
•Model accuracy and lift chart
RapidMiner Cloud Repository & Execution 
8
RapidMiner Server 
9 
The RapidMiner Server provides enterprise-wide process development and process to web- service conversion with dynamic dashboards and data visualizations.
Extensions and the Marketplace 
10 
http://marketplace.rapidminer.com
ExistingExtensions 
11 
Edda–Extensions for Binominal Text Classification 
Instance selection and Prototype based rules 
RapidMiner Finance and Economics Extension 
Multimedia Mining Extension
RapidMiner Finance and Economics Extension 
Edda–Extensions for Binominal Text Classification 
ExistingExtensions 
Confidential 
12 
Instance selection and Prototype based rules 
Multimedia Mining Extension
Linked Open Data Extension 
•Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 
•Goal: Predict the popularity of new books 
13 
…
Linked Open Data Extension 
•Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 
•Goal: Predict the popularity of new books 
14 
… 
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> 
PREFIX ontology: <http://dbpedia.org/ontology/> 
select distinct ?book ?author ?isbn?country ?abstract ?pages ?language 
where { 
?book rdf:typeontology:Book. 
?book ontology:author?author . 
?book ontology:abstract?abstract . 
?book ontology:isbn?isbn. 
?book ontology:numberOfPages?pages . 
?book ontology:language?language . 
?book ontology:country?country . 
}
Linked Open Data Extension 
•Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 
•Goal: Predict the popularity of new books 
15 
… 
…
Text-/Web-Mining Extensions 
16
Multimedia Mining Extension 
17
WhiBoExtension 
18
MLWizardExtension 
19 
1. Define data location 
2. Evaluation of different models
MLWizardExtension 
20 
3. Load the best model 
4. The process will be designed for you
HowtoextendRapidMiner Studio 
Confidential 21
HowtoextendRapidMiner Studio 
Confidential 22 
gitclone https://github.com/rapidminer/rapidminer-extension-tutorial.gitgradleinstallExtension 
•Live Demo: 
–Extension skeleton 
–Operators 
–Special data objects 
–Advanced Extension elements 
–Accelerators 
•Documentation 
http://www.rapidminer.com/documentation
HowtointegrateRapidMiner 
•By web services: 
23 
Web Service API 
1.Export process as a web 
service in RM Server 
2.Select output format 
(JSON, XML, PNG, …) 
3. 
•HTTP POST to that URL 
•Read process results from HTTP response 
or 
•<iframe> into other Website
HowtointegrateRapidMiner 
•OEM: 
24 
Java 
1.RapidMiner can be easily invoked 
2.Call RapidMiner.init() 
3.Use the code: 
Create processes, run processes or transform data
RapidMinerUSA 
RapidMiner, Inc. (Headquarters) 
10 Fawcett St 
Cambridge, MA 02138 
United States 
E-mailcontact-us@rapidminer.com 
Phone+1 -617 -401 -7708 
Fax+1 -617 -401 -7709 
THANK YOU 
25 
RapidMinerGermany 
RapidMinerGmbH 
StockumerStr. 475 
44227 Dortmund 
Germany 
E-mailcontact-de@rapidminer.com 
Phone+49 -231 -425 786 9-0 
Fax+49 -231 -425 786 9-9 
RapidMinerUK 
RapidMinerLtd. 
QuatroHouse, Frimley Road 
CamberleyGU16 7ER 
United Kingdom 
E-mailcontact-uk@rapidminer.com 
Phone+44 1276 804 426 
Fax+1 -617 -401 –7709 
www.rapidminer.com 
RapidMiner Hungary 
RapidMiner Kft 
Iparutca5 
1095 Budapest 
Hungary 
E-mailcontact-hu@rapidminer.com 
Phone+44 1276 804 426 
Fax+1 -617 -401 -7709

More Related Content

PPTX
RapidMiner: Introduction To Rapid Miner
PDF
Introduction to RapidMiner Studio V7
PPTX
M Chambers and RapidMiner Overview for Babson class
PPTX
Rapid miner
PDF
Fast Data processing with RFX
PPTX
Big Data at Tube: Events to Insights to Action
PDF
Request CCCB Services
PPTX
Data mining tools (R , WEKA, RAPID MINER, ORANGE)
RapidMiner: Introduction To Rapid Miner
Introduction to RapidMiner Studio V7
M Chambers and RapidMiner Overview for Babson class
Rapid miner
Fast Data processing with RFX
Big Data at Tube: Events to Insights to Action
Request CCCB Services
Data mining tools (R , WEKA, RAPID MINER, ORANGE)

What's hot (20)

PDF
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
PPTX
Accelerating Delivery of Data Products - The EBSCO Way
PDF
Scalable Data Management for Kafka and Beyond | Dan Rice, BigID
PPTX
IoFMT – Internet of Fleet Management Things
PDF
Converging Database Transactions and Analytics
PPTX
Implementing BigPetStore with Apache Flink
PDF
Scaling to Infinity - Open Source meets Big Data
PPTX
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
PPTX
Presto for apps deck varada prestoconf
PPTX
StreamSet ETL tool
PPTX
Mutable data @ scale
PPTX
Building the Foundation for a Latency-Free Life
PDF
II-SDV 2016 RightsDirect
PPTX
Hadoop data access layer v4.0
PDF
Testistanbul 2016 - Keynote: "Enterprise Challenges of Test Data" by Rex Black
PDF
Amundsen at Brex and Looker integration
PDF
MongoDB .local Houston 2019: Building an IoT Streaming Analytics Platform to ...
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PDF
Дмитрий Попович "How to build a data warehouse?"
PPTX
Presto query optimizer: pursuit of performance
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
Accelerating Delivery of Data Products - The EBSCO Way
Scalable Data Management for Kafka and Beyond | Dan Rice, BigID
IoFMT – Internet of Fleet Management Things
Converging Database Transactions and Analytics
Implementing BigPetStore with Apache Flink
Scaling to Infinity - Open Source meets Big Data
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
Presto for apps deck varada prestoconf
StreamSet ETL tool
Mutable data @ scale
Building the Foundation for a Latency-Free Life
II-SDV 2016 RightsDirect
Hadoop data access layer v4.0
Testistanbul 2016 - Keynote: "Enterprise Challenges of Test Data" by Rex Black
Amundsen at Brex and Looker integration
MongoDB .local Houston 2019: Building an IoT Streaming Analytics Platform to ...
HBase Global Indexing to support large-scale data ingestion at Uber
Дмитрий Попович "How to build a data warehouse?"
Presto query optimizer: pursuit of performance
Ad

Viewers also liked (12)

ODP
Mining the Web of Linked Data with RapidMiner
PPTX
RapidMiner: Introduction To Rapid Miner
PPTX
RapidMiner: Setting Up A Process
PPT
Data mining tools
PPTX
Hadoop World 2011: Radoop: a Graphical Analytics Tool for Big Data - Gabor Ma...
PPTX
Data mining tools overall
PDF
RapidMiner, an entrance to explore MIMIC-III?
PPTX
Data Mining: Implementation of Data Mining Techniques using RapidMiner software
PPTX
ODP
Exploiting Linked Open Data as Background Knowledge in Data Mining
PPTX
RapidMiner: Rapid Miner Products
PPTX
Terminology Machine Learning
Mining the Web of Linked Data with RapidMiner
RapidMiner: Introduction To Rapid Miner
RapidMiner: Setting Up A Process
Data mining tools
Hadoop World 2011: Radoop: a Graphical Analytics Tool for Big Data - Gabor Ma...
Data mining tools overall
RapidMiner, an entrance to explore MIMIC-III?
Data Mining: Implementation of Data Mining Techniques using RapidMiner software
Exploiting Linked Open Data as Background Knowledge in Data Mining
RapidMiner: Rapid Miner Products
Terminology Machine Learning
Ad

Similar to Slides PAPIs.io'14 RapidMiner (20)

PDF
Sabrina Kirstein @ RapidMiner
PPTX
RAPIDMINER: Rapidminerproducts
PPTX
RAPIDMINER: Rapidminer products
PPTX
rabidminer_Teamddsfa dfasdfasd fadfas.pptx
PDF
Text Mining with Rapid Miner.
PDF
Text Mining with RapidMiner
PPTX
RapidMiner: Rapid Miner Products
PDF
RapidMiner - From Data Mining To Decision Making In One Platform.pdf
PDF
Cross Device Ad Targeting at Scale
PDF
Knime &amp; bioinformatics
PPT
RapidInsight for OpenNMS
PDF
Elastic Web Mining
PDF
Big Data with KNIME.pdf
PDF
Distributed Database practicals
PPTX
WEB MINING.
PPTX
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
PDF
Semantic Web Mining
KEY
Big data and APIs for PHP developers - SXSW 2011
PDF
YM-RMWisdom15 final
PDF
Key Considerations for Putting Hadoop in Production SlideShare
Sabrina Kirstein @ RapidMiner
RAPIDMINER: Rapidminerproducts
RAPIDMINER: Rapidminer products
rabidminer_Teamddsfa dfasdfasd fadfas.pptx
Text Mining with Rapid Miner.
Text Mining with RapidMiner
RapidMiner: Rapid Miner Products
RapidMiner - From Data Mining To Decision Making In One Platform.pdf
Cross Device Ad Targeting at Scale
Knime &amp; bioinformatics
RapidInsight for OpenNMS
Elastic Web Mining
Big Data with KNIME.pdf
Distributed Database practicals
WEB MINING.
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Semantic Web Mining
Big data and APIs for PHP developers - SXSW 2011
YM-RMWisdom15 final
Key Considerations for Putting Hadoop in Production SlideShare

Recently uploaded (20)

PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Getting Started with Data Integration: FME Form 101
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
A Presentation on Artificial Intelligence
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PPTX
TLE Review Electricity (Electricity).pptx
PPTX
Tartificialntelligence_presentation.pptx
PPTX
1. Introduction to Computer Programming.pptx
PDF
project resource management chapter-09.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
Mushroom cultivation and it's methods.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Approach and Philosophy of On baking technology
PDF
Hindi spoken digit analysis for native and non-native speakers
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Building Integrated photovoltaic BIPV_UPV.pdf
SOPHOS-XG Firewall Administrator PPT.pptx
Getting Started with Data Integration: FME Form 101
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
A Presentation on Artificial Intelligence
Group 1 Presentation -Planning and Decision Making .pptx
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
TLE Review Electricity (Electricity).pptx
Tartificialntelligence_presentation.pptx
1. Introduction to Computer Programming.pptx
project resource management chapter-09.pdf
Chapter 5: Probability Theory and Statistics
A comparative study of natural language inference in Swahili using monolingua...
A novel scalable deep ensemble learning framework for big data classification...
Mushroom cultivation and it's methods.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Approach and Philosophy of On baking technology
Hindi spoken digit analysis for native and non-native speakers

Slides PAPIs.io'14 RapidMiner

  • 1. Developing Extensions for RapidMiner …rapidly November 17th, 2014 Sabrina Kirstein
  • 2. RapidMiner Company Overview 2 Easy-to-use, blazing fast, and very easy to integrate with any IT infrastructure Support from a thriving communityof contributors creating new extensions and applications Processes designed in RapidMiner can be one-click deployedto RapidMiner Server or RapidMiner Cloud A unique Marketplacefor independent developers to publish their innovative extensions RapidMiner delivers the power of predictive analytics to business users. No programming required. More than 60 connectors (incl. SAP, Hadoop, Cloud connectors like Twitter and Zapier) allowing easy access to structured and unstructured data.
  • 3. RapidMiner History 3 Cloud •Cloud •Hadoop Business Source •Commercial Editions •Community Editions •Client and Server Open Source •Command Line •Initial Workbench Open Source •Complete Workbench •CommunityExtensions •Marketplace Community Growth 2007 2010 2013 2014 5,000 30,000 150,000 250,000
  • 4. RapidMiner Metrics 4 60+ Employees Worldwide 100+ Active Developers 600+ Customers in over 50 Countries 40,000+ Downloads per Month 35,000+ Active Deployments with over 250,000 Users
  • 6. RapidMiner Studio •With access to over 1500 different operators, the Java-based visual environment of RapidMiner allows for rapid data mining process development 6 Visual Process Design Environment
  • 7. Accelerators 7 Wizard •Selection of data and label (e.g. churn) column. •Label column contains missings values if unknown –those will be predicted Results •Predictions (individuals, churn predictions) •Descriptive model •Model accuracy and lift chart
  • 9. RapidMiner Server 9 The RapidMiner Server provides enterprise-wide process development and process to web- service conversion with dynamic dashboards and data visualizations.
  • 10. Extensions and the Marketplace 10 http://marketplace.rapidminer.com
  • 11. ExistingExtensions 11 Edda–Extensions for Binominal Text Classification Instance selection and Prototype based rules RapidMiner Finance and Economics Extension Multimedia Mining Extension
  • 12. RapidMiner Finance and Economics Extension Edda–Extensions for Binominal Text Classification ExistingExtensions Confidential 12 Instance selection and Prototype based rules Multimedia Mining Extension
  • 13. Linked Open Data Extension •Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 •Goal: Predict the popularity of new books 13 …
  • 14. Linked Open Data Extension •Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 •Goal: Predict the popularity of new books 14 … PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX ontology: <http://dbpedia.org/ontology/> select distinct ?book ?author ?isbn?country ?abstract ?pages ?language where { ?book rdf:typeontology:Book. ?book ontology:author?author . ?book ontology:abstract?abstract . ?book ontology:isbn?isbn. ?book ontology:numberOfPages?pages . ?book ontology:language?language . ?book ontology:country?country . }
  • 15. Linked Open Data Extension •Assume a rating system for books giving us an ISBN number and a rating from 1 to 5 •Goal: Predict the popularity of new books 15 … …
  • 19. MLWizardExtension 19 1. Define data location 2. Evaluation of different models
  • 20. MLWizardExtension 20 3. Load the best model 4. The process will be designed for you
  • 22. HowtoextendRapidMiner Studio Confidential 22 gitclone https://github.com/rapidminer/rapidminer-extension-tutorial.gitgradleinstallExtension •Live Demo: –Extension skeleton –Operators –Special data objects –Advanced Extension elements –Accelerators •Documentation http://www.rapidminer.com/documentation
  • 23. HowtointegrateRapidMiner •By web services: 23 Web Service API 1.Export process as a web service in RM Server 2.Select output format (JSON, XML, PNG, …) 3. •HTTP POST to that URL •Read process results from HTTP response or •<iframe> into other Website
  • 24. HowtointegrateRapidMiner •OEM: 24 Java 1.RapidMiner can be easily invoked 2.Call RapidMiner.init() 3.Use the code: Create processes, run processes or transform data
  • 25. RapidMinerUSA RapidMiner, Inc. (Headquarters) 10 Fawcett St Cambridge, MA 02138 United States E-mailcontact-us@rapidminer.com Phone+1 -617 -401 -7708 Fax+1 -617 -401 -7709 THANK YOU 25 RapidMinerGermany RapidMinerGmbH StockumerStr. 475 44227 Dortmund Germany E-mailcontact-de@rapidminer.com Phone+49 -231 -425 786 9-0 Fax+49 -231 -425 786 9-9 RapidMinerUK RapidMinerLtd. QuatroHouse, Frimley Road CamberleyGU16 7ER United Kingdom E-mailcontact-uk@rapidminer.com Phone+44 1276 804 426 Fax+1 -617 -401 –7709 www.rapidminer.com RapidMiner Hungary RapidMiner Kft Iparutca5 1095 Budapest Hungary E-mailcontact-hu@rapidminer.com Phone+44 1276 804 426 Fax+1 -617 -401 -7709