SlideShare a Scribd company logo
DATA MINING
TOPIC:TEXT MINING
 Text data mining can be
described as the process of
extracting essential data from
standard language text. All the
data that we generate via text
messages, documents, emails,
files are written in common
language text.Text mining is
primarily used to draw useful
insights or patterns from such
data.
AREAS OF TEXT MINING IN DATA MINING:
 These are the following area of text mining :
INFORMATION EXTRACTION:
The automatic extraction of structured data such as
entities, entities relationships, and attributes
describing entities from an unstructured source is
called information extraction.
NATURAL LANGUAGE PROCESSING:
 NLP stands for Natural language processing.
Computer software can understand human language
as same as it is spoken. NLP is primarily a
component of artificial intelligence(AI).The
development of the NLP application is difficult
because computers generally expect humans to
“Speak” to them in a programming language that is
accurate, clear, and exceptionally structured. Human
speech is usually not authentic so that it can depend
on many complex variables, including slang, social
context, and regional dialects.
DATA MINING:
Data mining refers to the extraction of useful
data, hidden patterns from large data sets.
Data mining tools can predict behaviors and
future trends that allow businesses to make a
better data-driven decision. Data mining
tools can be used to resolve many business
problems that have traditionally been too
time-consuming.
INFORMATION RETRIEVAL:
 Information retrieval deals with retrieving useful data
from data that is stored in our systems. Alternately, as
an analogy, we can view search engines that happen
on websites such as e-commerce sites or any other
sites as part of information retrieval.
TEXT MINING APPROACHES IN DATA MINING:
 These are the following text mining approaches that are used in
data mining.
1. KEYWORD-BASED ASSOCIATION ANALYSIS:
 It collects sets of keywords or terms that often happen together and
afterward discover the association relationship among them. First, it
preprocesses the text data by parsing, stemming, removing stop
words, etc. Once it pre-processed the data, then it induces
association mining algorithms. Here, human effort is not required, so
the number of unwanted results and the execution time is reduced.
2. DOCUMENT CLASSIFICATION ANALYSIS / AUTOMATIC
DOCUMENT CLASSIFICATION:
 This analysis is used for the automatic classification of the huge
number of online text documents like web pages, emails, etc.Text
document classification varies with the classification of relational
data as document databases are not organized according to attribute
values pairs.
TEXT MINING PROCESS:
 The text mining process incorporates the following steps to extract
the data from the document.
TEXT TRANSFORMATION:
 A text transformation is a technique that is used to control the
capitalization of the text.
Here the two major way of document representation is given.
Bag of words
1. Vector Space
TEXT PRE-PROCESSING:
 Pre-processing is a significant task and a critical step in Text Mining,
Natural Language Processing (NLP), and information retrieval(IR). In
the field of text mining, data pre-processing is used for extracting
useful information and knowledge from unstructured text data.
Information Retrieval (IR) is a matter of choosing which documents in
a collection should be retrieved to fulfill the user’s need.
FEATURE SELECTION:
 Feature selection is a significant part of data mining. Feature selection
can be defined as the process of reducing the input of processing or
finding the essential information sources.The feature selection is also
called variable selection.
APPLICATIONS:
These are the following text mining applications:
 Risk Management:
Risk Management is a systematic and logical procedure of analyzing, identifying,
treating, and monitoring the risks involved in any action or process in
organizations. Insufficient risk analysis is usually a leading cause of
disappointment.
 Customer Care Service:
Text mining methods, particularly NLP, are finding increasing significance in the
field of customer care.The primary objective of text analysis is to reduce the
response time of the organizations and help to address the complaints of the
customer rapidly and productivelyfrom different sources such as customer
feedback, surveys, customer calls, etc.
 Business Intelligence:
Companies and business firms have started to use text mining strategies as a major
aspect of their business intelligence. Besides providing significant insights into
customer behavior and trends, text mining strategies also support organizations to
analyze the qualities and weaknesses of their opponent’s so, giving them a
competitive advantage in the market.
 Social Media Analysis:
Social media analysis helps to track the online data, and there are numerous text
mining tools designed particularly for performance analysis of social media sites.
These tools help to monitor and interpret the text generated via the internet from the
news, emails, blogs, etc.
DATA MINING:
 Now, in this step, the text mining procedure merges with the
conventional process. Classic Data Mining procedures are used in
the structural database.
EVALUATE:
 Afterward, it evaluates the results. Once the result is evaluated, the
result abandon.

More Related Content

PPTX
Text mining
PPTX
Text mining
PDF
A Survey on Text Mining-techniques and application
PPTX
TEXT MINING.pptx
PPTX
Text mining presentation in Data mining Area
PPTX
Data, Text and Web Mining
PPTX
sentiment analysis
PDF
Web_Mining_Overview_Nfaoui_El_Habib
Text mining
Text mining
A Survey on Text Mining-techniques and application
TEXT MINING.pptx
Text mining presentation in Data mining Area
Data, Text and Web Mining
sentiment analysis
Web_Mining_Overview_Nfaoui_El_Habib

Similar to text Mining topic in data Mining subject (20)

PDF
Text Mining at Feature Level: A Review
PPT
Text mining and data mining
PPTX
Text mining
PDF
A Review on Text Mining in Data Mining
PDF
A Review on Text Mining in Data Mining
PPTX
Data_mining_ppt_CA2.pptx
PPT
turban_ch07ch07ch07ch07ch07ch07dss9e_ch07.ppt
PPT
Week12
PDF
Decision Support for E-Governance: A Text Mining Approach
PPTX
Text Mining
DOC
Text Mining: Beyond Extraction Towards Exploitation
DOC
Text Mining: Beyond Extraction Towards Exploitation
PPTX
Text mining introduction-1
PPTX
Text Mining for Data analytics at ARMIET
PPTX
Text-Mining-Presentation artificial intelligence
PPTX
Natural Language Processing using Text Mining
PDF
Survey on text mining networks
PDF
A comparative study on different types of effective methods in text mining
PDF
Structured and Unstructured Information Extraction Using Text Mining and Natu...
Text Mining at Feature Level: A Review
Text mining and data mining
Text mining
A Review on Text Mining in Data Mining
A Review on Text Mining in Data Mining
Data_mining_ppt_CA2.pptx
turban_ch07ch07ch07ch07ch07ch07dss9e_ch07.ppt
Week12
Decision Support for E-Governance: A Text Mining Approach
Text Mining
Text Mining: Beyond Extraction Towards Exploitation
Text Mining: Beyond Extraction Towards Exploitation
Text mining introduction-1
Text Mining for Data analytics at ARMIET
Text-Mining-Presentation artificial intelligence
Natural Language Processing using Text Mining
Survey on text mining networks
A comparative study on different types of effective methods in text mining
Structured and Unstructured Information Extraction Using Text Mining and Natu...
Ad

More from RohanMalik45 (14)

PPSX
Software development life cycle and model
PPT
Quality Assurance in SE lecture week 08 .ppt
PPT
Software Testing Strategies lecture .ppt
PPT
Software Quality Assurance SQA lecture.ppt
PPTX
Project management Scheduling software engineering.pptx
PPT
Design Concepts software engineering.ppt
PPTX
actionbar in android development course.pptx
PPTX
ANN lecture data mining by Muhammad faraz.pptx
PPTX
Compiler Construction ( lexical analyzer).pptx
PPTX
csc322:lecture 28 ( Deadlock) Operating.pptx
PDF
NLP in artificial intelligence .pdf
PPTX
artificial neural network lec 2 rt .pptx
PPTX
process pattern-1 software engineering.pptx
PPTX
Artificial Intelligence horn clause.pptx
Software development life cycle and model
Quality Assurance in SE lecture week 08 .ppt
Software Testing Strategies lecture .ppt
Software Quality Assurance SQA lecture.ppt
Project management Scheduling software engineering.pptx
Design Concepts software engineering.ppt
actionbar in android development course.pptx
ANN lecture data mining by Muhammad faraz.pptx
Compiler Construction ( lexical analyzer).pptx
csc322:lecture 28 ( Deadlock) Operating.pptx
NLP in artificial intelligence .pdf
artificial neural network lec 2 rt .pptx
process pattern-1 software engineering.pptx
Artificial Intelligence horn clause.pptx
Ad

Recently uploaded (20)

PPTX
Presentation on HIE in infants and its manifestations
PDF
Computing-Curriculum for Schools in Ghana
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Institutional Correction lecture only . . .
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Complications of Minimal Access Surgery at WLH
PDF
Classroom Observation Tools for Teachers
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Cell Structure & Organelles in detailed.
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Pharma ospi slides which help in ospi learning
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Presentation on HIE in infants and its manifestations
Computing-Curriculum for Schools in Ghana
Supply Chain Operations Speaking Notes -ICLT Program
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Institutional Correction lecture only . . .
102 student loan defaulters named and shamed – Is someone you know on the list?
Final Presentation General Medicine 03-08-2024.pptx
Complications of Minimal Access Surgery at WLH
Classroom Observation Tools for Teachers
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Microbial diseases, their pathogenesis and prophylaxis
Anesthesia in Laparoscopic Surgery in India
Abdominal Access Techniques with Prof. Dr. R K Mishra
Cell Structure & Organelles in detailed.
GDM (1) (1).pptx small presentation for students
human mycosis Human fungal infections are called human mycosis..pptx
Pharma ospi slides which help in ospi learning
202450812 BayCHI UCSC-SV 20250812 v17.pptx
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf

text Mining topic in data Mining subject

  • 2.  Text data mining can be described as the process of extracting essential data from standard language text. All the data that we generate via text messages, documents, emails, files are written in common language text.Text mining is primarily used to draw useful insights or patterns from such data.
  • 3. AREAS OF TEXT MINING IN DATA MINING:  These are the following area of text mining :
  • 4. INFORMATION EXTRACTION: The automatic extraction of structured data such as entities, entities relationships, and attributes describing entities from an unstructured source is called information extraction.
  • 5. NATURAL LANGUAGE PROCESSING:  NLP stands for Natural language processing. Computer software can understand human language as same as it is spoken. NLP is primarily a component of artificial intelligence(AI).The development of the NLP application is difficult because computers generally expect humans to “Speak” to them in a programming language that is accurate, clear, and exceptionally structured. Human speech is usually not authentic so that it can depend on many complex variables, including slang, social context, and regional dialects.
  • 6. DATA MINING: Data mining refers to the extraction of useful data, hidden patterns from large data sets. Data mining tools can predict behaviors and future trends that allow businesses to make a better data-driven decision. Data mining tools can be used to resolve many business problems that have traditionally been too time-consuming.
  • 7. INFORMATION RETRIEVAL:  Information retrieval deals with retrieving useful data from data that is stored in our systems. Alternately, as an analogy, we can view search engines that happen on websites such as e-commerce sites or any other sites as part of information retrieval.
  • 8. TEXT MINING APPROACHES IN DATA MINING:  These are the following text mining approaches that are used in data mining.
  • 9. 1. KEYWORD-BASED ASSOCIATION ANALYSIS:  It collects sets of keywords or terms that often happen together and afterward discover the association relationship among them. First, it preprocesses the text data by parsing, stemming, removing stop words, etc. Once it pre-processed the data, then it induces association mining algorithms. Here, human effort is not required, so the number of unwanted results and the execution time is reduced.
  • 10. 2. DOCUMENT CLASSIFICATION ANALYSIS / AUTOMATIC DOCUMENT CLASSIFICATION:  This analysis is used for the automatic classification of the huge number of online text documents like web pages, emails, etc.Text document classification varies with the classification of relational data as document databases are not organized according to attribute values pairs.
  • 11. TEXT MINING PROCESS:  The text mining process incorporates the following steps to extract the data from the document.
  • 12. TEXT TRANSFORMATION:  A text transformation is a technique that is used to control the capitalization of the text. Here the two major way of document representation is given. Bag of words 1. Vector Space
  • 13. TEXT PRE-PROCESSING:  Pre-processing is a significant task and a critical step in Text Mining, Natural Language Processing (NLP), and information retrieval(IR). In the field of text mining, data pre-processing is used for extracting useful information and knowledge from unstructured text data. Information Retrieval (IR) is a matter of choosing which documents in a collection should be retrieved to fulfill the user’s need.
  • 14. FEATURE SELECTION:  Feature selection is a significant part of data mining. Feature selection can be defined as the process of reducing the input of processing or finding the essential information sources.The feature selection is also called variable selection.
  • 15. APPLICATIONS: These are the following text mining applications:  Risk Management: Risk Management is a systematic and logical procedure of analyzing, identifying, treating, and monitoring the risks involved in any action or process in organizations. Insufficient risk analysis is usually a leading cause of disappointment.  Customer Care Service: Text mining methods, particularly NLP, are finding increasing significance in the field of customer care.The primary objective of text analysis is to reduce the response time of the organizations and help to address the complaints of the customer rapidly and productivelyfrom different sources such as customer feedback, surveys, customer calls, etc.
  • 16.  Business Intelligence: Companies and business firms have started to use text mining strategies as a major aspect of their business intelligence. Besides providing significant insights into customer behavior and trends, text mining strategies also support organizations to analyze the qualities and weaknesses of their opponent’s so, giving them a competitive advantage in the market.  Social Media Analysis: Social media analysis helps to track the online data, and there are numerous text mining tools designed particularly for performance analysis of social media sites. These tools help to monitor and interpret the text generated via the internet from the news, emails, blogs, etc.
  • 17. DATA MINING:  Now, in this step, the text mining procedure merges with the conventional process. Classic Data Mining procedures are used in the structural database.
  • 18. EVALUATE:  Afterward, it evaluates the results. Once the result is evaluated, the result abandon.