SlideShare a Scribd company logo
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 335
HARMONIZED SCHEME FOR DATA MINING TECHNIQUE TO
PROGRESS DECISION SUPPORT SYSTEM IN AN UNCERTAIN
SITUATION
B.Swathi1
, M.Praveena2
, L.Kavitha3
1
Nri Institute of technology, CSE Dept, 2
Nri Institute of technology, IT department, 3
Nri institute of technology, MCA
Department
Abstract
Decision Support System (DSS) is equivalent synonym as management information systems (MIS). Decision supporting systems
include also decisions made upon individual data from external sources, management feeling, and various other data sources not
included in business intelligence. They serve as an integrated repository for internal and external data-intelligence critical to
understanding and evaluating the business within its environmental context. Data mining have emerged to meet this need. With the
addition of models, analytic tools, and user interfaces, they have the potential to provide actionable information that supports
effective problem and opportunity identification, critical decision-making, and strategy formulation, implementation, and
evaluation. The proposed system will support top level management to make a good decision in any time under any uncertain
environment.
Keywords: Dss, Dm, Mis, Clustering, Classification, Association Rule, K-Mean, Olap, Matlab
-----------------------------------------------------------------------***---------------------------------------------------------------------
1. INTRODUCTION
Decision Support System (DSS) is equivalent synonym as
management information systems (MIS). Decision supporting
systems include also decisions made upon individual data
from external sources, management feeling, and various other
data sources not included in business intelligence. With the
addition of models, analytic tools, and user interfaces, they
have the potential to provide actionable information that
supports effective problem and opportunity identification,
critical decision-making, and strategy formulation,
implementation, and evaluation. The proposed system will
support top level management to make a good decision in any
time under any uncertain environment [4]. This study aim to
investigate the adoption process of decision making under
uncertain situations or highly risk environments effecting in
decision of investing stoke cash of bank. This applied for two
types of usage investment - direct or indirect - or credit and
any sector of investment will be highly or moderate or low
risk. And select which one of this sectors risk „rejected‟ or un-
risk „accepted‟ all that under uncertain environments such as;
political, economical, marketing, operational, internal policies
and natural crises, all that using the contribution of this study
enhancing k-mean algorithm to improve the results and
comparing results between original algorithm and enhanced
algorithm. The paper is divided into four sections; section two
is a background and related work it is divided into two parts,
part one is about DSS, part two is about DM. Section three
presents the proposed Investing Data Mining System „IDMS.
Section four presents conclusion and finally section five
present future works2. Tables, Figures and Equations.
2. BACKGROUND AND RELATED WORK
2.1 Decision Support System (DSS)
DSS includes a body of knowledge that describes some
aspects of the decision maker's world that specifies how to
accomplish various tasks, that indicates what conclusions are
valid in different circumstances [4].The expected benefits of
DSS that discovered are higher decision quality, improved
communication, cost reduction, increased productivity, time
savings, improved customer satisfaction and improved
employee satisfaction. DSS is a computer-based system
consisting of three main interacting components:
• A language system: a mechanism to provide
communication between the user and other
components of the DSS.
• A knowledge system: A repository of problem
domain knowledge embodied in DSS as either data or
procedures.
• A problem processing system: a link between the
other two components, containing one or more of the
general problem manipulation capabilities required
for decision-making.
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 336
Fig 1: DSS Main Components
After surveying multiple decision support systems, it is
concluded that decision support systems are categorized into
the following [5]:
• File drawer systems: This category of DSS provides
access to data items.
• Data analysis systems: Those support the
manipulation of data by computerized tools tailored
to a specific task or by more general tools and
operators.
• Analytical information systems: Those provide
access to a series of decision-oriented databases.
• Accounting and financial models: those calculate
the consequences of possible actions.
• Representational models: those estimate the
consequences of actions based on simulation models
that include relationships that are causal as well as
accounting definitions.
• Optimization models: those provide guidelines for
actions by generating an optimal solution consistent
with a series of constraints.
• Suggestion models: those perform the logical
processing leading to a specific suggested decision or
a fairly structured or well understood task.
This section describes the approaches and techniques mostly
used when developing data warehousing systems that data
warehousing approaches such as; Online Analytical
Processing „OLAP‟, Data Mining „DM‟ and Artificial
Intelligence „AI‟. Data Warehousing Techniques such as; Star
Schema, Snowflake Schema and A Star Schema or a
Snowflake Schema
2.2 Data Mining Techniques (DM)
Data mining is the process of analyzing data from different
perspectives and summarizing it into useful information [10].
DM techniques are the result of a long process of research and
product development [10]. The evolution of DM [6] is shown
in table 1.
Table 1: The evolution by DM [6]
Steps for applying DM:
1. Definition of the business objective and expected
operational environment.
2. Data selection is required to identify meaningful sample of
data.
3.Data transformation that involves data representation in an
appropriate format for mining algorithm.
4. Selection and implementation of data mining algorithm
depends on the mining objective.
5.Analysis of the discovered outcomes is needed to formulate
business outcomes.
6. Representing valuable business outcomes.
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 337
Data mining consists of five major elements; to extract, to
transform, and to load transaction data onto the data
warehouse system ,to store and manage the data in a
multidimensional database system, to provide data access to
business analysts and information technology professionals,
Analyze the data by application software, and finally to
present the data in a useful format, such as a graph or table.
DM techniques usually fall into two categories, predictive or
descriptive. Predictive DM uses historical data to infer
something about future events. Predictive mining tasks use
data to build a model to make predictions on unseen future
events. Descriptive DM aims to find patterns in the data that
provide some information about internal hidden relationships.
Descriptive mining tasks characterize the general properties of
the data and represent it in a meaningful way. Figure2 shows
the classification of DM techniques.
Association Rule is used to disc over relationships between
attribute sets for a given input pattern. [6] Define sequence
discovery as "a sequential technique is a given set of
sequences find the complete set of frequent subsequences”.
Clustering is “the process of organizing objects into groups
whose members are alike in some way” [7]. So, it deals with
finding the internal structure in a collection of data, as shown
in figure 3.
Fig 3: Simple graphical for clustering data [7]
[8] Define that “Clustering involves identifying a finite set of
categories or segments „clusters‟ to describe the data
according to a certain metric". [9] Define that “Clustering
enables to find specific discriminative factors or attributes for
the studied data. Each member of a cluster should be very
similar to other member s in its cluster and very dissimilar to
other clusters. When a new data is introduced, it is classified
into the most similar cluster". Several researchers classified
clustering algorithms differently.
Some classifies clusters as mutually exclusive, hierarchical or
overlapping. Others classifies cluster into hierarchal and
partitional. The most common classification is shown in figure
3. Techniques for creating clusters include partitioning
methods as in k-means algorithm, and hierarchical methods as
in decision trees, and density-based methods.
Fig 4: Clustering methods classifications for Moses Charikar
[10]
3. THE PROPOSED 'IDM S'
Investment Data Mining System „IDMS‟ aims to build a data
mining system for investment in the banking sector. IDMS
consists of several components; data gathering, preparing data
to discover knowledge, data preprocessing, using data mining
techniques in sequences steps start with classification data,
clustering data especially using K-mean algorithm and
enhanced K-mean algorithm to set which best result and then
set and run association rules to solve problem, post processing
and finally get result and visualize result to create best
decision to take a good decision for investment under
uncertain situations. IDMS Shown in figure 5.
Hardware for applying the IDMC system is a personal
computer configurations with this Processor 3.2, Hard Disk
160 gaga, Ram 2 G and Monitor 17 Inch. Operating system is
windows XP services pack 3. Several software tools have been
used. The first is Microsoft Excel sheets 2007 and has been
used for analysis and filtering data. Mat-lab version 6.5 has
been used in data preprocessing and data classification. The
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 338
last software is the WEKA which is a collection of java tools
for DM written by staff at the University Of Waikato, New
Zealand.
Fig 5: Proposed IDMS
CONCLUSIONS
This paper represents a contribution of applying DM and DSS
for banking sector especially in investment which has been
rarely addressed before. IDMS is a new proposed system
which is simple, straightforward with low computation needs.
The proposed preprocessing component is an aggregation of
several known steps. The banking is planning a set of actions
in accordance of IDMS outcomes for decision making in
investment sector. The investment department in the banking
is starting to analyze the approached investment sector, to
introduce a good decision under uncertain situation.
FUTURE WORK
In next step of this study implementing this proposed approach
and creating full steps of preprocessing using enhanced k-
mean algorithm by real data from bank as a case study, to give
us a best result and support high level of management with a
good decision. After implementing this case study and
proposed system will get the main factors which effect in
decision under uncertain
REFERENCES
[1] A. Hunter and S. Parsons, "A review of uncertainty
handling formalisms", Applications of Uncertainty
Formalisms, LNAI 1455, pp.8-37. Springer -Verlag, 1998.
[2] E. Hernandez and J. Recasens, "A general framework for
induction of decision trees under uncertainty", Modelling with
Words, LNAI 2873, pp.26–43, Springer-Verlag, 2003.
[3] M. S. Chen, J. Han, and P. S. Yu. IEEE Trans Knowledge
and Data Engineering Data mining. An overview from a
database perspective, 8:866-883, 1996.
[4]. U. Fayyad, G. Piatetsky-Shapiro and W. J. Frawley.
AAAI/MIT, Press definition of KDD at KDD96. Knowledge
Discovery in Databases, 1991.
[5]. Gartner. Evolution of data mining, Gartner Group
Advanced Technologies and Applications Research Note,
2/1/95.
[6]. International Conferences on Knowledge Discovery in
Databases and Data Mining (KDD‟95-98), 1995-1998.
[7]. R.J. Miller and Y. Yang. Association rules over interval
data. SIGMOD'97, 452-461, Tucson, Arizona, 1997.
[8]. Zaki, M.J., SPADE An Efficient Algorithm for Mining
Frequent Sequences Machine Learning, 42(1) 31-60, 2001.
[9]. Osmar R. Zaïane. “Principles of Knowledge Discovery in
Databases - Chapter 8 Data Clustering”. & Shantanu Godbole
data mining Data mining Workshop 9th November 2003.
[10]. T.Imielinski and H. Mannila. Communications of ACM.
A database perspective on knowledge discovery, 39:58-64,
1996.
[11]. BIRCH Zhang, T., Ramakrishnan, R., and Livny, M.
SIGMOD '96. BIRCH an efficient data clustering method for
very large databases. 1996.
[12] Pascal Poncelet, Florent Masseglia and Maguelonne
Teisseire (Editors). Information Science Reference. Data
Mining Patterns New Methods and Applications, ISBN 978
1599041629, October 2007.
[13]. Thearling K, Exchange Applications White Paper, Inc.
increasing customer value by integrating data mining and
campaign management software, 1998.
[14]. Noah Gans, Spring. Service Operations Management,
Vol. 5, No. 2, 2003.
[15]. Joun Mack. IEEE TRANSACTIONS ON PATTERN
ANALYSIS AND MACHINE INTELLIGENCE. An Efficient
k-Means Clustering Algorithm, Analysis and Implementation,
VOL. 24, NO. 7, JULY 2002.
[16]. Andrew Moore and Brian T. Luke. Tutorial Slides, K-
means and Hierarchical Clustering and K-Means Clustering,
Slide 15, 2003.
[17]. E. Turban, J. E. Aronson, T. Liang, and R. Sharda,
Decision Support and Business Intelligence Systems, eighth
edition. Prentice Hall, 2007.

More Related Content

PDF
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
PDF
Developing Sales Information System Application using Prototyping Model
PDF
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
PDF
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
PDF
International Refereed Journal of Engineering and Science (IRJES)
PDF
Selecting Experts Using Data Quality Concepts
PDF
A simulated decision trees algorithm (sdt)
PDF
6. ijece guideforauthors 2012_2 eidt sat
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Developing Sales Information System Application using Prototyping Model
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
International Refereed Journal of Engineering and Science (IRJES)
Selecting Experts Using Data Quality Concepts
A simulated decision trees algorithm (sdt)
6. ijece guideforauthors 2012_2 eidt sat

What's hot (18)

PPTX
Lecture 02 - The Data Warehouse Environment
PDF
IRJET- Fault Detection and Prediction of Failure using Vibration Analysis
PDF
Technique for Order Preference by Similarity to Ideal Solution as Decision Su...
PDF
6 ijaems sept-2015-6-a review of data security primitives in data mining
PPTX
System Data Modelling Tools
PDF
Data Transformation Technique for Protecting Private Information in Privacy P...
PDF
Data discrimination prevention in customer relationship managment
DOC
Systems Lifecycle workbook
PDF
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREE
PDF
Data Warehouse: A Primer
PDF
50120130406032
PDF
AUTO-CDD: automatic cleaning dirty data using machine learning techniques
PDF
External data preprocessing for efficient sorting
PPT
D01 etl
PDF
Performance Analysis of Selected Classifiers in User Profiling
PDF
Literature review of attribute level and
PDF
Characterizing and Processing of Big Data Using Data Mining Techniques
PDF
ijcatr04081001
Lecture 02 - The Data Warehouse Environment
IRJET- Fault Detection and Prediction of Failure using Vibration Analysis
Technique for Order Preference by Similarity to Ideal Solution as Decision Su...
6 ijaems sept-2015-6-a review of data security primitives in data mining
System Data Modelling Tools
Data Transformation Technique for Protecting Private Information in Privacy P...
Data discrimination prevention in customer relationship managment
Systems Lifecycle workbook
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREE
Data Warehouse: A Primer
50120130406032
AUTO-CDD: automatic cleaning dirty data using machine learning techniques
External data preprocessing for efficient sorting
D01 etl
Performance Analysis of Selected Classifiers in User Profiling
Literature review of attribute level and
Characterizing and Processing of Big Data Using Data Mining Techniques
ijcatr04081001
Ad

Viewers also liked (20)

PDF
Availability analysis of parallel transit fuel system in petrol engine under ...
PDF
Stability and surface free energy analysis of a liquid drop on a horizontal c...
PDF
Analysis of zone routing protocol in manet
PDF
Privacy preservation techniques in data mining
PDF
Capacity and performance analysis of suame
PDF
Comparative analysis of singularities of 3 prs and 3-
PDF
Study of bad block management and wear leveling in
PDF
Comparison of various noise mitigation technique used
PDF
A survey on congestion control mechanisms
PDF
Online social network mining current trends and research issues
PDF
A quantitative risk assessment approach in an integrated cold chain system en...
PDF
Comparison of flow analysis of a sudden and gradual change
PDF
Collectors innovation to increase performance solar
PDF
Grid fabrication of traffic maintenance system clustering at road junctions
PDF
A short review on microemulsion and its application in extraction of vegetabl...
PDF
Biodiesel as a blended fuel in compression ignition
PDF
Assessment of the leachability and mechanical stability of mud from a zinc pl...
PDF
Buckling analysis of line continuum with new matrices of stiffness and geometry
PDF
A hydration study by xrdrietveld analysis of cement regenerated from complete...
PDF
“Remedies over the obstacles in implementing automation in indian infrastruct...
Availability analysis of parallel transit fuel system in petrol engine under ...
Stability and surface free energy analysis of a liquid drop on a horizontal c...
Analysis of zone routing protocol in manet
Privacy preservation techniques in data mining
Capacity and performance analysis of suame
Comparative analysis of singularities of 3 prs and 3-
Study of bad block management and wear leveling in
Comparison of various noise mitigation technique used
A survey on congestion control mechanisms
Online social network mining current trends and research issues
A quantitative risk assessment approach in an integrated cold chain system en...
Comparison of flow analysis of a sudden and gradual change
Collectors innovation to increase performance solar
Grid fabrication of traffic maintenance system clustering at road junctions
A short review on microemulsion and its application in extraction of vegetabl...
Biodiesel as a blended fuel in compression ignition
Assessment of the leachability and mechanical stability of mud from a zinc pl...
Buckling analysis of line continuum with new matrices of stiffness and geometry
A hydration study by xrdrietveld analysis of cement regenerated from complete...
“Remedies over the obstacles in implementing automation in indian infrastruct...
Ad

Similar to Harmonized scheme for data mining technique to progress decision support system in an uncertain situation (20)

PDF
Enhanced K-Mean Algorithm to Improve Decision Support System Under Uncertain ...
PDF
Au2640944101
PDF
Data Mining System and Applications: A Review
PDF
Data mining techniques
PDF
The Survey of Data Mining Applications And Feature Scope
PDF
Study of Data Mining Methods and its Applications
PDF
Variance rover system
PDF
Variance rover system web analytics tool using data
PDF
Data mining techniques a survey paper
PDF
A study and survey on various progressive duplicate detection mechanisms
PDF
New approaches of Data Mining for the Internet of things with systems: Litera...
PDF
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
PDF
Evaluating the efficiency of rule techniques for file classification
PDF
Different Classification Technique for Data mining in Insurance Industry usin...
PDF
Evaluating the efficiency of rule techniques for file
PDF
Applications Of Clustering Techniques In Data Mining A Comparative Study
PDF
IRJET- Medical Data Mining
PDF
Developing Sales Information System Application using Prototyping Model
PDF
Introduction to feature subset selection method
PPT
Data management new
Enhanced K-Mean Algorithm to Improve Decision Support System Under Uncertain ...
Au2640944101
Data Mining System and Applications: A Review
Data mining techniques
The Survey of Data Mining Applications And Feature Scope
Study of Data Mining Methods and its Applications
Variance rover system
Variance rover system web analytics tool using data
Data mining techniques a survey paper
A study and survey on various progressive duplicate detection mechanisms
New approaches of Data Mining for the Internet of things with systems: Litera...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Evaluating the efficiency of rule techniques for file classification
Different Classification Technique for Data mining in Insurance Industry usin...
Evaluating the efficiency of rule techniques for file
Applications Of Clustering Techniques In Data Mining A Comparative Study
IRJET- Medical Data Mining
Developing Sales Information System Application using Prototyping Model
Introduction to feature subset selection method
Data management new

More from eSAT Publishing House (20)

PDF
Likely impacts of hudhud on the environment of visakhapatnam
PDF
Impact of flood disaster in a drought prone area – case study of alampur vill...
PDF
Hudhud cyclone – a severe disaster in visakhapatnam
PDF
Groundwater investigation using geophysical methods a case study of pydibhim...
PDF
Flood related disasters concerned to urban flooding in bangalore, india
PDF
Enhancing post disaster recovery by optimal infrastructure capacity building
PDF
Effect of lintel and lintel band on the global performance of reinforced conc...
PDF
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
PDF
Wind damage to buildings, infrastrucuture and landscape elements along the be...
PDF
Shear strength of rc deep beam panels – a review
PDF
Role of voluntary teams of professional engineers in dissater management – ex...
PDF
Risk analysis and environmental hazard management
PDF
Review study on performance of seismically tested repaired shear walls
PDF
Monitoring and assessment of air quality with reference to dust particles (pm...
PDF
Low cost wireless sensor networks and smartphone applications for disaster ma...
PDF
Coastal zones – seismic vulnerability an analysis from east coast of india
PDF
Can fracture mechanics predict damage due disaster of structures
PDF
Assessment of seismic susceptibility of rc buildings
PDF
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
PDF
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Likely impacts of hudhud on the environment of visakhapatnam
Impact of flood disaster in a drought prone area – case study of alampur vill...
Hudhud cyclone – a severe disaster in visakhapatnam
Groundwater investigation using geophysical methods a case study of pydibhim...
Flood related disasters concerned to urban flooding in bangalore, india
Enhancing post disaster recovery by optimal infrastructure capacity building
Effect of lintel and lintel band on the global performance of reinforced conc...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Shear strength of rc deep beam panels – a review
Role of voluntary teams of professional engineers in dissater management – ex...
Risk analysis and environmental hazard management
Review study on performance of seismically tested repaired shear walls
Monitoring and assessment of air quality with reference to dust particles (pm...
Low cost wireless sensor networks and smartphone applications for disaster ma...
Coastal zones – seismic vulnerability an analysis from east coast of india
Can fracture mechanics predict damage due disaster of structures
Assessment of seismic susceptibility of rc buildings
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...

Recently uploaded (20)

PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
composite construction of structures.pdf
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
Construction Project Organization Group 2.pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
Geodesy 1.pptx...............................................
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
Internet of Things (IOT) - A guide to understanding
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Strings in CPP - Strings in C++ are sequences of characters used to store and...
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
composite construction of structures.pdf
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Construction Project Organization Group 2.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
573137875-Attendance-Management-System-original
Geodesy 1.pptx...............................................
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Lecture Notes Electrical Wiring System Components
CH1 Production IntroductoryConcepts.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Internet of Things (IOT) - A guide to understanding

Harmonized scheme for data mining technique to progress decision support system in an uncertain situation

  • 1. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 335 HARMONIZED SCHEME FOR DATA MINING TECHNIQUE TO PROGRESS DECISION SUPPORT SYSTEM IN AN UNCERTAIN SITUATION B.Swathi1 , M.Praveena2 , L.Kavitha3 1 Nri Institute of technology, CSE Dept, 2 Nri Institute of technology, IT department, 3 Nri institute of technology, MCA Department Abstract Decision Support System (DSS) is equivalent synonym as management information systems (MIS). Decision supporting systems include also decisions made upon individual data from external sources, management feeling, and various other data sources not included in business intelligence. They serve as an integrated repository for internal and external data-intelligence critical to understanding and evaluating the business within its environmental context. Data mining have emerged to meet this need. With the addition of models, analytic tools, and user interfaces, they have the potential to provide actionable information that supports effective problem and opportunity identification, critical decision-making, and strategy formulation, implementation, and evaluation. The proposed system will support top level management to make a good decision in any time under any uncertain environment. Keywords: Dss, Dm, Mis, Clustering, Classification, Association Rule, K-Mean, Olap, Matlab -----------------------------------------------------------------------***--------------------------------------------------------------------- 1. INTRODUCTION Decision Support System (DSS) is equivalent synonym as management information systems (MIS). Decision supporting systems include also decisions made upon individual data from external sources, management feeling, and various other data sources not included in business intelligence. With the addition of models, analytic tools, and user interfaces, they have the potential to provide actionable information that supports effective problem and opportunity identification, critical decision-making, and strategy formulation, implementation, and evaluation. The proposed system will support top level management to make a good decision in any time under any uncertain environment [4]. This study aim to investigate the adoption process of decision making under uncertain situations or highly risk environments effecting in decision of investing stoke cash of bank. This applied for two types of usage investment - direct or indirect - or credit and any sector of investment will be highly or moderate or low risk. And select which one of this sectors risk „rejected‟ or un- risk „accepted‟ all that under uncertain environments such as; political, economical, marketing, operational, internal policies and natural crises, all that using the contribution of this study enhancing k-mean algorithm to improve the results and comparing results between original algorithm and enhanced algorithm. The paper is divided into four sections; section two is a background and related work it is divided into two parts, part one is about DSS, part two is about DM. Section three presents the proposed Investing Data Mining System „IDMS. Section four presents conclusion and finally section five present future works2. Tables, Figures and Equations. 2. BACKGROUND AND RELATED WORK 2.1 Decision Support System (DSS) DSS includes a body of knowledge that describes some aspects of the decision maker's world that specifies how to accomplish various tasks, that indicates what conclusions are valid in different circumstances [4].The expected benefits of DSS that discovered are higher decision quality, improved communication, cost reduction, increased productivity, time savings, improved customer satisfaction and improved employee satisfaction. DSS is a computer-based system consisting of three main interacting components: • A language system: a mechanism to provide communication between the user and other components of the DSS. • A knowledge system: A repository of problem domain knowledge embodied in DSS as either data or procedures. • A problem processing system: a link between the other two components, containing one or more of the general problem manipulation capabilities required for decision-making.
  • 2. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 336 Fig 1: DSS Main Components After surveying multiple decision support systems, it is concluded that decision support systems are categorized into the following [5]: • File drawer systems: This category of DSS provides access to data items. • Data analysis systems: Those support the manipulation of data by computerized tools tailored to a specific task or by more general tools and operators. • Analytical information systems: Those provide access to a series of decision-oriented databases. • Accounting and financial models: those calculate the consequences of possible actions. • Representational models: those estimate the consequences of actions based on simulation models that include relationships that are causal as well as accounting definitions. • Optimization models: those provide guidelines for actions by generating an optimal solution consistent with a series of constraints. • Suggestion models: those perform the logical processing leading to a specific suggested decision or a fairly structured or well understood task. This section describes the approaches and techniques mostly used when developing data warehousing systems that data warehousing approaches such as; Online Analytical Processing „OLAP‟, Data Mining „DM‟ and Artificial Intelligence „AI‟. Data Warehousing Techniques such as; Star Schema, Snowflake Schema and A Star Schema or a Snowflake Schema 2.2 Data Mining Techniques (DM) Data mining is the process of analyzing data from different perspectives and summarizing it into useful information [10]. DM techniques are the result of a long process of research and product development [10]. The evolution of DM [6] is shown in table 1. Table 1: The evolution by DM [6] Steps for applying DM: 1. Definition of the business objective and expected operational environment. 2. Data selection is required to identify meaningful sample of data. 3.Data transformation that involves data representation in an appropriate format for mining algorithm. 4. Selection and implementation of data mining algorithm depends on the mining objective. 5.Analysis of the discovered outcomes is needed to formulate business outcomes. 6. Representing valuable business outcomes.
  • 3. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 337 Data mining consists of five major elements; to extract, to transform, and to load transaction data onto the data warehouse system ,to store and manage the data in a multidimensional database system, to provide data access to business analysts and information technology professionals, Analyze the data by application software, and finally to present the data in a useful format, such as a graph or table. DM techniques usually fall into two categories, predictive or descriptive. Predictive DM uses historical data to infer something about future events. Predictive mining tasks use data to build a model to make predictions on unseen future events. Descriptive DM aims to find patterns in the data that provide some information about internal hidden relationships. Descriptive mining tasks characterize the general properties of the data and represent it in a meaningful way. Figure2 shows the classification of DM techniques. Association Rule is used to disc over relationships between attribute sets for a given input pattern. [6] Define sequence discovery as "a sequential technique is a given set of sequences find the complete set of frequent subsequences”. Clustering is “the process of organizing objects into groups whose members are alike in some way” [7]. So, it deals with finding the internal structure in a collection of data, as shown in figure 3. Fig 3: Simple graphical for clustering data [7] [8] Define that “Clustering involves identifying a finite set of categories or segments „clusters‟ to describe the data according to a certain metric". [9] Define that “Clustering enables to find specific discriminative factors or attributes for the studied data. Each member of a cluster should be very similar to other member s in its cluster and very dissimilar to other clusters. When a new data is introduced, it is classified into the most similar cluster". Several researchers classified clustering algorithms differently. Some classifies clusters as mutually exclusive, hierarchical or overlapping. Others classifies cluster into hierarchal and partitional. The most common classification is shown in figure 3. Techniques for creating clusters include partitioning methods as in k-means algorithm, and hierarchical methods as in decision trees, and density-based methods. Fig 4: Clustering methods classifications for Moses Charikar [10] 3. THE PROPOSED 'IDM S' Investment Data Mining System „IDMS‟ aims to build a data mining system for investment in the banking sector. IDMS consists of several components; data gathering, preparing data to discover knowledge, data preprocessing, using data mining techniques in sequences steps start with classification data, clustering data especially using K-mean algorithm and enhanced K-mean algorithm to set which best result and then set and run association rules to solve problem, post processing and finally get result and visualize result to create best decision to take a good decision for investment under uncertain situations. IDMS Shown in figure 5. Hardware for applying the IDMC system is a personal computer configurations with this Processor 3.2, Hard Disk 160 gaga, Ram 2 G and Monitor 17 Inch. Operating system is windows XP services pack 3. Several software tools have been used. The first is Microsoft Excel sheets 2007 and has been used for analysis and filtering data. Mat-lab version 6.5 has been used in data preprocessing and data classification. The
  • 4. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 338 last software is the WEKA which is a collection of java tools for DM written by staff at the University Of Waikato, New Zealand. Fig 5: Proposed IDMS CONCLUSIONS This paper represents a contribution of applying DM and DSS for banking sector especially in investment which has been rarely addressed before. IDMS is a new proposed system which is simple, straightforward with low computation needs. The proposed preprocessing component is an aggregation of several known steps. The banking is planning a set of actions in accordance of IDMS outcomes for decision making in investment sector. The investment department in the banking is starting to analyze the approached investment sector, to introduce a good decision under uncertain situation. FUTURE WORK In next step of this study implementing this proposed approach and creating full steps of preprocessing using enhanced k- mean algorithm by real data from bank as a case study, to give us a best result and support high level of management with a good decision. After implementing this case study and proposed system will get the main factors which effect in decision under uncertain REFERENCES [1] A. Hunter and S. Parsons, "A review of uncertainty handling formalisms", Applications of Uncertainty Formalisms, LNAI 1455, pp.8-37. Springer -Verlag, 1998. [2] E. Hernandez and J. Recasens, "A general framework for induction of decision trees under uncertainty", Modelling with Words, LNAI 2873, pp.26–43, Springer-Verlag, 2003. [3] M. S. Chen, J. Han, and P. S. Yu. IEEE Trans Knowledge and Data Engineering Data mining. An overview from a database perspective, 8:866-883, 1996. [4]. U. Fayyad, G. Piatetsky-Shapiro and W. J. Frawley. AAAI/MIT, Press definition of KDD at KDD96. Knowledge Discovery in Databases, 1991. [5]. Gartner. Evolution of data mining, Gartner Group Advanced Technologies and Applications Research Note, 2/1/95. [6]. International Conferences on Knowledge Discovery in Databases and Data Mining (KDD‟95-98), 1995-1998. [7]. R.J. Miller and Y. Yang. Association rules over interval data. SIGMOD'97, 452-461, Tucson, Arizona, 1997. [8]. Zaki, M.J., SPADE An Efficient Algorithm for Mining Frequent Sequences Machine Learning, 42(1) 31-60, 2001. [9]. Osmar R. Zaïane. “Principles of Knowledge Discovery in Databases - Chapter 8 Data Clustering”. & Shantanu Godbole data mining Data mining Workshop 9th November 2003. [10]. T.Imielinski and H. Mannila. Communications of ACM. A database perspective on knowledge discovery, 39:58-64, 1996. [11]. BIRCH Zhang, T., Ramakrishnan, R., and Livny, M. SIGMOD '96. BIRCH an efficient data clustering method for very large databases. 1996. [12] Pascal Poncelet, Florent Masseglia and Maguelonne Teisseire (Editors). Information Science Reference. Data Mining Patterns New Methods and Applications, ISBN 978 1599041629, October 2007. [13]. Thearling K, Exchange Applications White Paper, Inc. increasing customer value by integrating data mining and campaign management software, 1998. [14]. Noah Gans, Spring. Service Operations Management, Vol. 5, No. 2, 2003. [15]. Joun Mack. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. An Efficient k-Means Clustering Algorithm, Analysis and Implementation, VOL. 24, NO. 7, JULY 2002. [16]. Andrew Moore and Brian T. Luke. Tutorial Slides, K- means and Hierarchical Clustering and K-Means Clustering, Slide 15, 2003. [17]. E. Turban, J. E. Aronson, T. Liang, and R. Sharda, Decision Support and Business Intelligence Systems, eighth edition. Prentice Hall, 2007.