SlideShare a Scribd company logo
Introduction toXLMiner™DATA UtilitiesXLMiner and Microsoft Office are registered trademarks of the respective owners.
Brief description of the features of XLMiner:Data UtilitiesThe XLMiner provides the user with a host of Data Utilities at his disposal. They are:	The different Data Utilities that XLMiner Provides are:-Sample from Worksheet/Database.Simple Random sample.
Stratified Sampling.Missing Data handling.Bin Continuous Data.Transform Categorical Data .http://dataminingtools.net
Sample data from WorksheetWhen huge amounts of data are involved, statisticians prefer taking a sample of the data that represents the entire database. However, such a representative sample is very difficult to obtain. The entire dataset we want information about is called the population. A sample is a part of population that we actually examine to draw conclusions. A good sample should be a true representation of data. As far as possible the cases chosen for sample should be like the cases that are not chosen. If the sample design is poor it can produce misleading conclusions. Various methods and techniques are developed to ensure a true sample.XLMiner provides us sampling facilities.http://dataminingtools.net
Sample data from WorksheetIn XLMiner, sampling can be done in two ways:Simple Random sampling:	A random sample of x records is chosen from the data such that every record in that sample has an equal chance of being chosenStratified Sampling :	The data is divided into strata of similar items. Then each stratum is sampled using the simple random approach and the results are then combined to give a final sample.http://dataminingtools.net
Sample data from Worksheet- Simple Random SamplingSelect the variables to be present in the sampleHere “Simple Random sampling is selectedWe can specify the seed value( value used for random selection) or the wizard will specify it by default.Set the size for the sampled setIf selected duplicate copies of records may be used.http://dataminingtools.net
Sample data from Worksheet- Simple Random Sampling outputhttp://dataminingtools.net
Sample data from Worksheet- Simple Random Sampling output with replacement.Duplicate copies of record exist in the sample.http://dataminingtools.net
Sample data from Worksheet- Stratified Sample( proportionate )http://dataminingtools.net
Sample data from Worksheet- Stratified Sample( proportionate – output )As selected by us, the % of records in each stratum in the sample set is same as that in the input sethttp://dataminingtools.net
Sample data from Worksheet- Stratified Sample(specify number)http://dataminingtools.net
Sample data from Worksheet- Stratified Sample(specify number)All stratums have equal sizes as specified by user (here 10 records each)http://dataminingtools.net
Sample data from Worksheet- Stratified Sample( size of smallest stratum)http://dataminingtools.net
Sample data from Worksheet- Stratified Sample( size of smallest stratum-output)All stratum have size equal to the size of the smallest stratumhttp://dataminingtools.net
Missing Data HandlingThis utility allows the user to process the data before any mining method is applied on it. It allows the user to detect the missing values in the data and handle them the way the user wants. XLMiner� considers a cell to be missing data if it is empty or contains an invalid formula. XLMiner� can be prompted to treat a cell to be missing data  if it contains a certain value specified by the user or handles the data as specified by the user.The user can specify how XLMiner� should correct these missing values. A treatment can be assigned for every variable. The records with missing data can be either deleted fully or the missing values can be replaced.  XLMiner� provides options on how to replace the missing data, e.g. by mean or median or mode or a value specified by the user. The available options depend on the type of variablehttp://dataminingtools.net
Missing Data Handlinghttp://dataminingtools.net
Missing Data HandlingData SetSelect the action to handle the missing data in individual columns and click on “Apply this option to selected variable”http://dataminingtools.net
Missing Data Handling-OutputChanged records high-lightedhttp://dataminingtools.net
Transform Categorical DataSometimes our data sets may contain variables that take non-numeric values. This makes it difficult to apply standard procedures. Hence XLMiner provides us with a tool which can be used to rename (transform) non-numeric data to numeric data.There are two ways to transform  categorical data:Creating Dummies: Consider the variable to have 4 distinct values as A,B,C and D. Then 3 new rows, VAL1,VAL2, VAL3 are created with values either 1 or 0 .If row one contains value A the VAL1 will have a value 1,rest have 0.If all have 0,then the row has a value D.Create category scores: In this if the non-numeric holds 4 distinct values as above, each value( ordered alphabetically) will be numbered from 1 to 4 and a new column is created that contains the value of number the non-numeric variable corresponds to.http://dataminingtools.net
Transform Categorical Data- DummiesSelect the variable that contains non-numeric Data and needs to be transformedhttp://dataminingtools.net
Transform Categorical Data-Category Scoreshttp://dataminingtools.net
Transform Categorical Data-Category Scores(output)http://dataminingtools.net
Thank youFor more visit:http://dataminingtools.nethttp://dataminingtools.net

More Related Content

PPTX
XL-MINER: Data Utilities
PPTX
XL-MINER:Partition
PPTX
XL Miner: Classification
PPTX
XL-MINER:Prediction
PPT
Fundamental of SPSS
PPTX
XL-MINER: Data Exploration
PPTX
XL-MINER: Associations
PPTX
Introduction To XL-Miner
XL-MINER: Data Utilities
XL-MINER:Partition
XL Miner: Classification
XL-MINER:Prediction
Fundamental of SPSS
XL-MINER: Data Exploration
XL-MINER: Associations
Introduction To XL-Miner

What's hot (18)

PPT
Data Processing-Presentation
PPTX
Dsa unit 1
PPTX
Classification
PPTX
Dma unit 2
PDF
Data Creation and Importing in IBM SPSS
PPT
Spss beginners
PPTX
Spss as a research tool
PPT
Data processing
PPT
Dma unit 1
PPTX
What Is the Use of SPSS in Data Analysis
PPTX
Data entry in Excel and SPSS
PPTX
Data processing & Analysis: SPSS an overview
PPTX
Decision tree induction
DOCX
Database design
PDF
SPSS introduction Presentation
PDF
Ibm spss statistics 19 brief guide
PDF
SELECTED DATA PREPARATION METHODS
PPTX
Spss basics tutorial
Data Processing-Presentation
Dsa unit 1
Classification
Dma unit 2
Data Creation and Importing in IBM SPSS
Spss beginners
Spss as a research tool
Data processing
Dma unit 1
What Is the Use of SPSS in Data Analysis
Data entry in Excel and SPSS
Data processing & Analysis: SPSS an overview
Decision tree induction
Database design
SPSS introduction Presentation
Ibm spss statistics 19 brief guide
SELECTED DATA PREPARATION METHODS
Spss basics tutorial
Ad

Viewers also liked (17)

PPTX
XL-MINER:Data Exploration
PPTX
XL-Miner: Classification
PPTX
XL-Miner: Time Series
PPTX
XL-MINER:Introduction To Xl Miner
PPTX
XL MINER: Associations
PPTX
Areas of machine leanring
PPTX
XL-MINER:Prediction
PPTX
XL-MINER:Partition
PDF
Prueba de corridas arriba y abajo de la media
PPTX
Data Mining: Mining ,associations, and correlations
PPTX
AI: AI & Searching
PPTX
Data Mining: Mining stream time series and sequence data
PPTX
Data Mining: Graph mining and social network analysis
PPTX
AI: AI & Problem Solving
PPTX
Data Mining: Data processing
PPTX
Data warehouse and olap technology
PPTX
Terminology Machine Learning
XL-MINER:Data Exploration
XL-Miner: Classification
XL-Miner: Time Series
XL-MINER:Introduction To Xl Miner
XL MINER: Associations
Areas of machine leanring
XL-MINER:Prediction
XL-MINER:Partition
Prueba de corridas arriba y abajo de la media
Data Mining: Mining ,associations, and correlations
AI: AI & Searching
Data Mining: Mining stream time series and sequence data
Data Mining: Graph mining and social network analysis
AI: AI & Problem Solving
Data Mining: Data processing
Data warehouse and olap technology
Terminology Machine Learning
Ad

Similar to XL-MINER:Data Utilities (20)

PPTX
Machine learning module 2
PPT
Excel Datamining Addin Advanced
PPT
Excel Datamining Addin Advanced
PPTX
pjgjhkjhkjhkkhkhkkhkjhjhjhjkhjhjkhjhroject.pptx
PPT
Excel Datamining Addin Beginner
PPT
Excel Datamining Addin Beginner
PPTX
PATTERNS08 - Strong Typing and Data Validation in .NET
PPTX
UNIT 2: Part 2: Data Warehousing and Data Mining
PDF
data mining
PPTX
mod3part 3 of robotic process automation
DOC
Data Mining: Data Preprocessing
PPTX
3. chapter iii(aggregate data)
PDF
Data Science Interview Questions PDF By ScholarHat
PPT
Computer notes - data structures
PDF
somhelpdoc
DOCX
Concept of Classification in Data Mining.docx
PPTX
Datamanipulationcases in data analysis.pptx
PPTX
Unit-IV-Introduction to Data Warehousing .pptx
PPTX
Data Preprocessing
PPTX
Introduction to data mining
Machine learning module 2
Excel Datamining Addin Advanced
Excel Datamining Addin Advanced
pjgjhkjhkjhkkhkhkkhkjhjhjhjkhjhjkhjhroject.pptx
Excel Datamining Addin Beginner
Excel Datamining Addin Beginner
PATTERNS08 - Strong Typing and Data Validation in .NET
UNIT 2: Part 2: Data Warehousing and Data Mining
data mining
mod3part 3 of robotic process automation
Data Mining: Data Preprocessing
3. chapter iii(aggregate data)
Data Science Interview Questions PDF By ScholarHat
Computer notes - data structures
somhelpdoc
Concept of Classification in Data Mining.docx
Datamanipulationcases in data analysis.pptx
Unit-IV-Introduction to Data Warehousing .pptx
Data Preprocessing
Introduction to data mining

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Encapsulation theory and applications.pdf
PDF
Modernizing your data center with Dell and AMD
PPT
Teaching material agriculture food technology
PDF
cuic standard and advanced reporting.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Electronic commerce courselecture one. Pdf
PPTX
A Presentation on Artificial Intelligence
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Review of recent advances in non-invasive hemoglobin estimation
Mobile App Security Testing_ A Comprehensive Guide.pdf
Approach and Philosophy of On baking technology
Reach Out and Touch Someone: Haptics and Empathic Computing
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Spectral efficient network and resource selection model in 5G networks
NewMind AI Monthly Chronicles - July 2025
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Diabetes mellitus diagnosis method based random forest with bat algorithm
NewMind AI Weekly Chronicles - August'25 Week I
Encapsulation theory and applications.pdf
Modernizing your data center with Dell and AMD
Teaching material agriculture food technology
cuic standard and advanced reporting.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Electronic commerce courselecture one. Pdf
A Presentation on Artificial Intelligence

XL-MINER:Data Utilities

  • 1. Introduction toXLMiner™DATA UtilitiesXLMiner and Microsoft Office are registered trademarks of the respective owners.
  • 2. Brief description of the features of XLMiner:Data UtilitiesThe XLMiner provides the user with a host of Data Utilities at his disposal. They are: The different Data Utilities that XLMiner Provides are:-Sample from Worksheet/Database.Simple Random sample.
  • 3. Stratified Sampling.Missing Data handling.Bin Continuous Data.Transform Categorical Data .http://dataminingtools.net
  • 4. Sample data from WorksheetWhen huge amounts of data are involved, statisticians prefer taking a sample of the data that represents the entire database. However, such a representative sample is very difficult to obtain. The entire dataset we want information about is called the population. A sample is a part of population that we actually examine to draw conclusions. A good sample should be a true representation of data. As far as possible the cases chosen for sample should be like the cases that are not chosen. If the sample design is poor it can produce misleading conclusions. Various methods and techniques are developed to ensure a true sample.XLMiner provides us sampling facilities.http://dataminingtools.net
  • 5. Sample data from WorksheetIn XLMiner, sampling can be done in two ways:Simple Random sampling: A random sample of x records is chosen from the data such that every record in that sample has an equal chance of being chosenStratified Sampling : The data is divided into strata of similar items. Then each stratum is sampled using the simple random approach and the results are then combined to give a final sample.http://dataminingtools.net
  • 6. Sample data from Worksheet- Simple Random SamplingSelect the variables to be present in the sampleHere “Simple Random sampling is selectedWe can specify the seed value( value used for random selection) or the wizard will specify it by default.Set the size for the sampled setIf selected duplicate copies of records may be used.http://dataminingtools.net
  • 7. Sample data from Worksheet- Simple Random Sampling outputhttp://dataminingtools.net
  • 8. Sample data from Worksheet- Simple Random Sampling output with replacement.Duplicate copies of record exist in the sample.http://dataminingtools.net
  • 9. Sample data from Worksheet- Stratified Sample( proportionate )http://dataminingtools.net
  • 10. Sample data from Worksheet- Stratified Sample( proportionate – output )As selected by us, the % of records in each stratum in the sample set is same as that in the input sethttp://dataminingtools.net
  • 11. Sample data from Worksheet- Stratified Sample(specify number)http://dataminingtools.net
  • 12. Sample data from Worksheet- Stratified Sample(specify number)All stratums have equal sizes as specified by user (here 10 records each)http://dataminingtools.net
  • 13. Sample data from Worksheet- Stratified Sample( size of smallest stratum)http://dataminingtools.net
  • 14. Sample data from Worksheet- Stratified Sample( size of smallest stratum-output)All stratum have size equal to the size of the smallest stratumhttp://dataminingtools.net
  • 15. Missing Data HandlingThis utility allows the user to process the data before any mining method is applied on it. It allows the user to detect the missing values in the data and handle them the way the user wants. XLMiner� considers a cell to be missing data if it is empty or contains an invalid formula. XLMiner� can be prompted to treat a cell to be missing data  if it contains a certain value specified by the user or handles the data as specified by the user.The user can specify how XLMiner� should correct these missing values. A treatment can be assigned for every variable. The records with missing data can be either deleted fully or the missing values can be replaced.  XLMiner� provides options on how to replace the missing data, e.g. by mean or median or mode or a value specified by the user. The available options depend on the type of variablehttp://dataminingtools.net
  • 17. Missing Data HandlingData SetSelect the action to handle the missing data in individual columns and click on “Apply this option to selected variable”http://dataminingtools.net
  • 18. Missing Data Handling-OutputChanged records high-lightedhttp://dataminingtools.net
  • 19. Transform Categorical DataSometimes our data sets may contain variables that take non-numeric values. This makes it difficult to apply standard procedures. Hence XLMiner provides us with a tool which can be used to rename (transform) non-numeric data to numeric data.There are two ways to transform categorical data:Creating Dummies: Consider the variable to have 4 distinct values as A,B,C and D. Then 3 new rows, VAL1,VAL2, VAL3 are created with values either 1 or 0 .If row one contains value A the VAL1 will have a value 1,rest have 0.If all have 0,then the row has a value D.Create category scores: In this if the non-numeric holds 4 distinct values as above, each value( ordered alphabetically) will be numbered from 1 to 4 and a new column is created that contains the value of number the non-numeric variable corresponds to.http://dataminingtools.net
  • 20. Transform Categorical Data- DummiesSelect the variable that contains non-numeric Data and needs to be transformedhttp://dataminingtools.net
  • 21. Transform Categorical Data-Category Scoreshttp://dataminingtools.net
  • 22. Transform Categorical Data-Category Scores(output)http://dataminingtools.net
  • 23. Thank youFor more visit:http://dataminingtools.nethttp://dataminingtools.net
  • 24. Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net