SlideShare a Scribd company logo
RapidMiner - Don’t Forget to Pack Text
Analytics on Your Data Exploration Journey
February 2018
About Basis Technology
2
● Expertise: find meaning in unstructured text
○ NLP provider
● Product: Rosette (text analytics)
○ both on-premise and in the ‘cloud’
● Gil Irizarry - Director of Engineering
What will you learn?
3
● Where to find the Rosette operator
● How to use NLP operators within RapidMiner Studio
● What types of insights can we gain from text
How to apply what you’ll learn
4
● All the operators are
available in the
RapidMiner Marketplace.
● Free tier in the public
Rosette API allows 10K
calls per month
Sample Use Cases
5
● How engaged are fans of different TV shows?
● How is technology covered in the NY Times?
● Does having a location in an Airbnb title affect number of reviews?
● What are the most recurring names in foreign news?
Assess product engagement via sentiment analysis
6
Assess product engagement via sentiment analysis
7
Assess Media Coverage
8
Assess Media Coverage
9
Use Entities to determine Correlation
10
Use Entities to determine Correlation
11
Translate and Deduplicate name list
12
Translate and Deduplicate name list
13

More Related Content

PDF
Cloud run - Serverless Containers Done Right
PPTX
Si Funding Ppt Main
PDF
The Art of Deploying Artifacts to Production With Confidence
PDF
Text Analytics 2009: User Perspectives on Solutions and Providers
PPTX
The Next-Generation SharePoint: Powered by Text Analytics
PPTX
The Next Generation SharePoint: Powered by Text Analytics
PDF
Paper id 26201475
PDF
Getting Started with Unstructured Data
Cloud run - Serverless Containers Done Right
Si Funding Ppt Main
The Art of Deploying Artifacts to Production With Confidence
Text Analytics 2009: User Perspectives on Solutions and Providers
The Next-Generation SharePoint: Powered by Text Analytics
The Next Generation SharePoint: Powered by Text Analytics
Paper id 26201475
Getting Started with Unstructured Data

Similar to RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Journey (20)

PDF
When to use the different text analytics tools - Meaning Cloud
PPTX
Text Analytics Past, Present & Future
PPTX
Welcome - 2011 Text Analytics Summit
KEY
Big data 4 webmonday
PPT
Text Analytics Market Trends
PDF
7th Annual Text Analytics Summit
PDF
7th Annual Text Analytics Summit Brochure
PDF
Graduation Thesis Sample
PPTX
Text Analytics Past, Present & Future: An Industry View
PPTX
Applying ocr to extract information : Text mining
PDF
Veda Semantics - introduction document
PPT
Applying Data Mining for News Analytics
PDF
Industry applications of text analysis
PDF
Text Analytics 2014: User Perspectives on Solutions and Providers
PPTX
Text Analytics Today
PPT
Predictive Text Analytics
PPTX
Text Analytics Applied (LIDER roadmapping presentation)
PDF
Image Retrieval and Analysis Using Text and Fuzzy Shape Features Emerging Res...
PPTX
Knowledge Extraction from Social Media
PDF
Video Search And Mining 1st Edition Mattia Broilo Nicola Piotto
When to use the different text analytics tools - Meaning Cloud
Text Analytics Past, Present & Future
Welcome - 2011 Text Analytics Summit
Big data 4 webmonday
Text Analytics Market Trends
7th Annual Text Analytics Summit
7th Annual Text Analytics Summit Brochure
Graduation Thesis Sample
Text Analytics Past, Present & Future: An Industry View
Applying ocr to extract information : Text mining
Veda Semantics - introduction document
Applying Data Mining for News Analytics
Industry applications of text analysis
Text Analytics 2014: User Perspectives on Solutions and Providers
Text Analytics Today
Predictive Text Analytics
Text Analytics Applied (LIDER roadmapping presentation)
Image Retrieval and Analysis Using Text and Fuzzy Shape Features Emerging Res...
Knowledge Extraction from Social Media
Video Search And Mining 1st Edition Mattia Broilo Nicola Piotto
Ad

More from Gil Irizarry (17)

PDF
A Rose By Any Other Name.pdf
PPTX
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...
PPTX
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...
PPTX
Ai for Good: Bad Guys, Messy Data, & NLP
PPTX
DevSecOps Orchestration of Text Analytics with Containers
PDF
Towards Identity Resolution: The Challenge of Name Matching
PPT
Beginning Native Android Apps
PPTX
From Silos to DevOps: Our Story
PPTX
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014
PPTX
Graphics on the Go
PPTX
Make Mobile Apps Quickly
PPTX
Building The Agile Enterprise - LSSC '12
PPTX
Agile The Kanban Way - Central MA PMI 2011
PPTX
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011
PPTX
Transitioning to Kanban - Aug 11
PPTX
Transitioning to Kanban
PPTX
Beyond Scrum of Scrums
A Rose By Any Other Name.pdf
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...
Ai for Good: Bad Guys, Messy Data, & NLP
DevSecOps Orchestration of Text Analytics with Containers
Towards Identity Resolution: The Challenge of Name Matching
Beginning Native Android Apps
From Silos to DevOps: Our Story
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014
Graphics on the Go
Make Mobile Apps Quickly
Building The Agile Enterprise - LSSC '12
Agile The Kanban Way - Central MA PMI 2011
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011
Transitioning to Kanban - Aug 11
Transitioning to Kanban
Beyond Scrum of Scrums
Ad

Recently uploaded (20)

PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Digital Systems & Binary Numbers (comprehensive )
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
Computer Software and OS of computer science of grade 11.pptx
PDF
Digital Strategies for Manufacturing Companies
PPTX
Transform Your Business with a Software ERP System
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Understanding Forklifts - TECH EHS Solution
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PDF
top salesforce developer skills in 2025.pdf
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PPTX
assetexplorer- product-overview - presentation
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
ai tools demonstartion for schools and inter college
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Digital Systems & Binary Numbers (comprehensive )
CHAPTER 2 - PM Management and IT Context
Computer Software and OS of computer science of grade 11.pptx
Digital Strategies for Manufacturing Companies
Transform Your Business with a Software ERP System
PTS Company Brochure 2025 (1).pdf.......
Understanding Forklifts - TECH EHS Solution
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
top salesforce developer skills in 2025.pdf
wealthsignaloriginal-com-DS-text-... (1).pdf
assetexplorer- product-overview - presentation
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Softaken Excel to vCard Converter Software.pdf
Design an Analysis of Algorithms I-SECS-1021-03
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
ai tools demonstartion for schools and inter college
Upgrade and Innovation Strategies for SAP ERP Customers

RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Journey

Editor's Notes

  • #3: Expertise: find meaning in unstructured text Product: Rosette (text analytics) NLP provider- area of computer science aimed at applying software to tackle problems in text analytics Rosette is available both on-premise and in the ‘cloud’ Gil Irizarry is a Director of Engineering at Basis, managing teams working on name matching and identity resolution
  • #6: Assess product engagement via sentiment analysis Question: How engaged are fans of different TV shows? Assess media coverage of particular topics Question: How is technology covered in the NY Times? Determine effectiveness of posts via entity extraction Question: Does having a location in an Airbnb title affect number of reviews? Translate and Deduplicate name list Question: What are the most recurring names in foreign news?