SlideShare a Scribd company logo
Text Classification
Chakrit Phain
R&D Softnix Technology Co.,Ltd,
Text classification With Rapid Miner
• Data Preparation
• Choosing Model
• Evaluation
• CSV
• JDBC
CSV Example
• Data Preparation
• Choosing Model
• Evaluation
• CSV
• JDBC
Separate by
• Data Preparation
• Choosing Model
• Evaluation
• CSV
• JDBC
Separate by
https://prestosql.io/download.html
• Data Preparation
• Choosing Model
• Evaluation
• CSV
• JDBC
Separate by
https://www.cdata.com/kb/tech/presto-jdbc-rapidminer.rst
• Data Preparation
• Choosing Model
• Evaluation
Model For Text Classification
•Multinomial Naïve Bayes (NB)
•Logistic Regression (LR)
•SVM (SVM)
•Stochastic Gradient Descent (SGD)
•k-Nearest-Neighbors (kNN)
•RandomForest (RF)
•Gradient Boosting (GB)
•XGBoost (the famous) (XGB)
•Adaboost
•Catboost
•LigthGBM
•ExtraTreesClassifier
https://towardsdatascience.com/model-selection-in-text-classification-ac13eedf6146
• Data Preparation
• Choosing Model
• Evaluation
Evaluate Metrics
• Precision
• Recall
• F1-Score
• AUC
• ROC
• Cohen’s Kappa
• Log Loss curve
• Accuracy train test curve
• False/True Positive rate curve
• Balanced Accuracy
• Zero-one Loss
• Explained Variance
https://towardsdatascience.com/model-selection-in-text-classification-ac13eedf6146
Rapid Miner Example
https://towardsdatascience.com/model-selection-in-text-classification-ac13eedf6146
Rapid Miner Example
Rapid Miner Example
Rapid Miner Example
Thank you

More Related Content

PPT
Data warehouse solutions
PPTX
Four NoSQL Databases You Should Know
PPTX
mongodb_Introduction
PDF
Nosql databases for the .net developer
PPTX
Ten Commandants For Picking NoSQL Database
PDF
RDFauthor (EKAW)
PDF
Heterogenous Persistence
PDF
Schema Agnostic Indexing with Azure DocumentDB
Data warehouse solutions
Four NoSQL Databases You Should Know
mongodb_Introduction
Nosql databases for the .net developer
Ten Commandants For Picking NoSQL Database
RDFauthor (EKAW)
Heterogenous Persistence
Schema Agnostic Indexing with Azure DocumentDB

What's hot (20)

PDF
Visualize your graph database
PPTX
Introduction to Apache HBase
PPTX
Deven s presentation
PDF
RDF Seminar Presentation
PPTX
NoSQL Roundup
PPTX
Basic Application Performance Optimization Techniques (Backend)
PDF
Deep Dive on ArangoDB
PDF
Supercharge your RDBMS with Elasticsearch
PPTX
PDF
Open Location Data and Linked Open Data
PPTX
PDF
HPTS 2011: The NoSQL Ecosystem
PPTX
Couchbase
PDF
Overview of no sql
PPT
No sql landscape_nosqltips
PDF
Oracle Week 2016 - Modern Data Architecture
PDF
introduction to Neo4j (Tabriz Software Open Talks)
PPTX
PPT
Schema Design
Visualize your graph database
Introduction to Apache HBase
Deven s presentation
RDF Seminar Presentation
NoSQL Roundup
Basic Application Performance Optimization Techniques (Backend)
Deep Dive on ArangoDB
Supercharge your RDBMS with Elasticsearch
Open Location Data and Linked Open Data
HPTS 2011: The NoSQL Ecosystem
Couchbase
Overview of no sql
No sql landscape_nosqltips
Oracle Week 2016 - Modern Data Architecture
introduction to Neo4j (Tabriz Software Open Talks)
Schema Design
Ad

Similar to Text classification With Rapid Miner (20)

PPTX
Practical Distributed Machine Learning Pipelines on Hadoop
PPTX
Women Who Code, Ground Floor
PDF
Using Spring with NoSQL databases (SpringOne China 2012)
PPTX
Relational to Graph - Import
PPTX
Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15
PDF
Building, Debugging, and Tuning Spark Machine Leaning Pipelines-(Joseph Bradl...
PPTX
Building Data Pipelines with Spark and StreamSets
PPTX
Elasticity vs. State? Exploring Kafka Streams Cassandra State Store
PDF
High Concurrency Architecture and Laravel Performance Tuning
PDF
Using SparkML to Power a DSaaS (Data Science as a Service) with Kiran Muglurm...
PDF
Microsoft R - Data Science at Scale
PPTX
NoSQL: Cassadra vs. HBase
PPTX
Navigating NoSQL in cloudy skies
PPTX
Operationalizing security data science for the cloud: Challenges, solutions, ...
PPTX
Presentation1.pptx
PDF
Building a SIMD Supported Vectorized Native Engine for Spark SQL
PDF
Developing polyglot persistence applications (SpringOne China 2012)
PDF
Modern Big Data Analytics Tools: An Overview
PPTX
Migration from Oracle to PostgreSQL: NEED vs REALITY
PPTX
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
Practical Distributed Machine Learning Pipelines on Hadoop
Women Who Code, Ground Floor
Using Spring with NoSQL databases (SpringOne China 2012)
Relational to Graph - Import
Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15
Building, Debugging, and Tuning Spark Machine Leaning Pipelines-(Joseph Bradl...
Building Data Pipelines with Spark and StreamSets
Elasticity vs. State? Exploring Kafka Streams Cassandra State Store
High Concurrency Architecture and Laravel Performance Tuning
Using SparkML to Power a DSaaS (Data Science as a Service) with Kiran Muglurm...
Microsoft R - Data Science at Scale
NoSQL: Cassadra vs. HBase
Navigating NoSQL in cloudy skies
Operationalizing security data science for the cloud: Challenges, solutions, ...
Presentation1.pptx
Building a SIMD Supported Vectorized Native Engine for Spark SQL
Developing polyglot persistence applications (SpringOne China 2012)
Modern Big Data Analytics Tools: An Overview
Migration from Oracle to PostgreSQL: NEED vs REALITY
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
Ad

More from Chakrit Phain (20)

PDF
LLM_PairProgramming.pdf
PPTX
Web scraping with php
PPTX
ChatGPT_Prompts.pptx
PDF
Sentence-BERT
PDF
AI_ML_Softnix.pdf
PPTX
Web Scraping with Python
PPTX
เปรียบเทียบ RPA Opensource
PPTX
PHP Bandwidth Shaping script
PPTX
PHP Explode & Preg_split Test
PPTX
Types of Big Data Analytics
PDF
Genetic Algorithm
PDF
Machine Learning Algorithm & Anomaly detection 2021
PPTX
Ai optimization Example
PPTX
Zabbix aws
PPTX
Anomaly Detection Technique
PPTX
Softnix Anomaly Detection Methods
PDF
Neo4j Graph Database และการประยุกตร์ใช้
PDF
Softnix how ml_work_0.1draft
PPTX
Shell Shock
PPTX
Neo4j introduction
LLM_PairProgramming.pdf
Web scraping with php
ChatGPT_Prompts.pptx
Sentence-BERT
AI_ML_Softnix.pdf
Web Scraping with Python
เปรียบเทียบ RPA Opensource
PHP Bandwidth Shaping script
PHP Explode & Preg_split Test
Types of Big Data Analytics
Genetic Algorithm
Machine Learning Algorithm & Anomaly detection 2021
Ai optimization Example
Zabbix aws
Anomaly Detection Technique
Softnix Anomaly Detection Methods
Neo4j Graph Database และการประยุกตร์ใช้
Softnix how ml_work_0.1draft
Shell Shock
Neo4j introduction

Recently uploaded (20)

PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Encapsulation theory and applications.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Machine learning based COVID-19 study performance prediction
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Modernizing your data center with Dell and AMD
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPT
Teaching material agriculture food technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
The Rise and Fall of 3GPP – Time for a Sabbatical?
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
NewMind AI Monthly Chronicles - July 2025
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Encapsulation theory and applications.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Machine learning based COVID-19 study performance prediction
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Spectral efficient network and resource selection model in 5G networks
Encapsulation_ Review paper, used for researhc scholars
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Modernizing your data center with Dell and AMD
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Teaching material agriculture food technology
Building Integrated photovoltaic BIPV_UPV.pdf
Review of recent advances in non-invasive hemoglobin estimation

Text classification With Rapid Miner