SlideShare a Scribd company logo
Students Academic
Performance
Knowledge Discovery from Data
Introduction..
 Our project aim is to find students academic performance
and find out whether there is any general pattern in their
marks and performance.
 So here ,We are analyzing both internal and external
marks of a student.
 We did the following KDD preprocessing steps to mine
our data.
Learning the application domain
 Learning the application domain is the first step in KDD
process .
 Need to have a clear understanding about the application
domain and our objectives.
 The institution considered for mining is MCA batch of Rajagiri
College of Social Sciences.
 We collected all previous year academic record from the
department of computer science
Create a target data set:
data selection
 We selected 2007-2010 batch marks for analysing the
pattern.
 There were around 45 records(45 students).
 Both the internal and external marks of each student were
selected, in order to find out the performance pattern.
Internal & External Dataset
Data cleaning & preprocessing
 Data cleaning is the step where noise and irrelevant data are
removed from the large data set.
 This is a very important pre-processing step because our
outcome would be dependent on the quality of selected data.
 Remove duplicate records, enter logically correct values for
missing records(absent students), remove unnecessary data
fields and standardize data format.
 There was no much duplicate data or unnecessary data in the
collected record . The dataset was partially cleaned.
 Student internal mark and external mark were stored in
different records.
 By applying data integration these records were integrated
into one record.
 The new dataset consist of internal mark details and external
mark details of each student in one record.
Students academic performance using clustering technique
Data reduction & transformation
 Data is transformed into appropriate form for making it ready for
data mining step.
 The dataset contains marks of 5 theory paper and 2 lab paper of
all 5 semesters.
 These marks are transformed into sum of internal marks and sum
of external marks of each student for the easiness of analysing
the pattern.
Students academic performance using clustering technique
Cluster Analysis
 The data mining technique we used here is clustering.
 A cluster is a collection of data objects that are similar to
one another within same cluster and are dissimilar to
objects in other cluster.
 We first partitioned the set of data into groups based on
data similarity and then assign labels
Choosing functions of data mining
K-MEANS Partitioning
 The K-means algorithm takes input parameter k and
partitions the set of n objects into k clusters.
 Here we selected no: of cluster as 4
 Objects are distributed to a cluster based on cluster
center to which it is nearest.
 For each semester we found out the clusters separately
and labeled them as students Excellent, Good, Fair and
Poor
Choosing mining algorithms
The Tool used for pattern evaluation is ORANGE
Orange Cluster Analysis
No of cluster selected is 4
Semester 1
poor
Fair
Good
Excellent
Semester 2
Semester 3
Semester 4
Semester 5
Centroid Analysis
Semester 1
Semester 2
Semester 3
Semester 4
Semester 5
Combined Centroid Analysis
Data mining search for patterns of
interest
 From the mining process we found that “All the 5 semester
clusters followed the same pattern of performance”.
 A student with high internal mark has higher external
marks and a student with less internal marks has less
external marks.
 There is a direct relation between the internal and the
external marks.
 At some case this evaluation is not valid, cases like
 Being absent for internal exam and scoring high marks for
the externals (vice versa)
CONCLUSION
 A students performance in his university exam can be
predicted with the help of his internal marks. There is
a direct relation between the internal and the external
marks.
 A student with low internals will get low marks for
externals too
Use of discovered knowledge
representation
Thank You

More Related Content

PDF
Predicting student performance using aggregated data sources
PPTX
Data mining to predict academic performance.
PPTX
STUDENT PERFORMANCE ANALYSIS USING DECISION TREE
PPTX
Predicting students performance in final examination
DOCX
machine learning based predictive analytics of student academic performance i...
PDF
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
PPTX
Student Grade Prediction
PDF
Predicting students performance using classification techniques in data mining
Predicting student performance using aggregated data sources
Data mining to predict academic performance.
STUDENT PERFORMANCE ANALYSIS USING DECISION TREE
Predicting students performance in final examination
machine learning based predictive analytics of student academic performance i...
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
Student Grade Prediction
Predicting students performance using classification techniques in data mining

What's hot (20)

PPTX
Education data mining presentation
PPT
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
PPT
Data Mining Overview
PPTX
Enterprise Data Management
PPT
Data Mining Concepts
PPTX
Introduction to Machine Learning
PPTX
Predictive Modelling
PPTX
Text MIning
PPTX
Data Mining
PPTX
Web Mining Presentation Final
PPTX
What Is Unstructured Data And Why Is It So Important To Businesses?
PDF
Data Analytics
PPTX
Knowledge discovery process
PPT
Data mining
PPTX
Classification and Clustering
PPTX
Introduction of Data Science and Data Analytics
PPTX
The 8 Step Data Mining Process
PPTX
Kdd process
PDF
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
PPTX
Network Intrusion Detection System Using Machine Learning and Deep Learning F...
Education data mining presentation
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data Mining Overview
Enterprise Data Management
Data Mining Concepts
Introduction to Machine Learning
Predictive Modelling
Text MIning
Data Mining
Web Mining Presentation Final
What Is Unstructured Data And Why Is It So Important To Businesses?
Data Analytics
Knowledge discovery process
Data mining
Classification and Clustering
Introduction of Data Science and Data Analytics
The 8 Step Data Mining Process
Kdd process
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Network Intrusion Detection System Using Machine Learning and Deep Learning F...
Ad

Viewers also liked (20)

PPTX
Factors affecting the academic performance of college students (1)
PDF
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
PPTX
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
PPT
Sania rtp
PPT
Smartcards and Authentication Tokens
DOCX
Data Mining _ Weka
PPTX
Attendance and student performance arp (1)
PPTX
Some Thoughts on Learning Analytics and Educational Data Mining
PPTX
Data Mining Project for student academic specialization and performance
PPTX
Mining Student Data LIVE_EUR_v2
PPTX
Grand challenges for the Educational Data Mining and Learning Sciences Commun...
PDF
Provision and management of school plant as a correlate of science students a...
PPTX
Predicting Student Performance in Solving Parameterized Exercises
PDF
Ethical Hacking
PDF
Solar and wind power forecasting
PPTX
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
PDF
My First Data Science Project (using Rapid Miner)
PDF
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
PDF
The effects of skipping breakfast on the academic performance
PDF
Big Data in Education
Factors affecting the academic performance of college students (1)
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
Sania rtp
Smartcards and Authentication Tokens
Data Mining _ Weka
Attendance and student performance arp (1)
Some Thoughts on Learning Analytics and Educational Data Mining
Data Mining Project for student academic specialization and performance
Mining Student Data LIVE_EUR_v2
Grand challenges for the Educational Data Mining and Learning Sciences Commun...
Provision and management of school plant as a correlate of science students a...
Predicting Student Performance in Solving Parameterized Exercises
Ethical Hacking
Solar and wind power forecasting
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
My First Data Science Project (using Rapid Miner)
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
The effects of skipping breakfast on the academic performance
Big Data in Education
Ad

Similar to Students academic performance using clustering technique (20)

PDF
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
PDF
Vol2no2 7 copy
PDF
Study of Clustering of Data Base in Education Sector Using Data Mining
PDF
Study of Clustering of Data Base in Education Sector Using Data Mining
PDF
Study of Clustering of Data Base in Education Sector Using Data Mining
PDF
Educational Data Mining to Analyze Students Performance – Concept Plan
PDF
Data Mining Techniques in Higher Education an Empirical Study for the Univer...
PDF
Data Clustering in Education for Students
PDF
IRJET- Academic Performance Analysis System
PDF
EDM_IJTIR_Article_201504020
PDF
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
PDF
A study model on the impact of various indicators in the performance of stude...
PDF
Application of Higher Education System for Predicting Student Using Data mini...
PDF
Performance Assessment of Faculties of Management Discipline From Student Per...
PDF
Extending the Student’s Performance via K-Means and Blended Learning
PDF
Educational Data Mining & Students Performance Prediction using SVM Techniques
PDF
A Survey on the Classification Techniques In Educational Data Mining
PDF
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
PDF
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...
PPTX
seminxxxxxxxxxxxxxxxxxxxxxxxxxar ppt(1).pptx
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
Vol2no2 7 copy
Study of Clustering of Data Base in Education Sector Using Data Mining
Study of Clustering of Data Base in Education Sector Using Data Mining
Study of Clustering of Data Base in Education Sector Using Data Mining
Educational Data Mining to Analyze Students Performance – Concept Plan
Data Mining Techniques in Higher Education an Empirical Study for the Univer...
Data Clustering in Education for Students
IRJET- Academic Performance Analysis System
EDM_IJTIR_Article_201504020
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
A study model on the impact of various indicators in the performance of stude...
Application of Higher Education System for Predicting Student Using Data mini...
Performance Assessment of Faculties of Management Discipline From Student Per...
Extending the Student’s Performance via K-Means and Blended Learning
Educational Data Mining & Students Performance Prediction using SVM Techniques
A Survey on the Classification Techniques In Educational Data Mining
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...
seminxxxxxxxxxxxxxxxxxxxxxxxxxar ppt(1).pptx

More from saniacorreya (6)

PPTX
PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
PPTX
Object recognition
PPTX
Color and human vision
PPTX
Manipulator robot for crack detection and welding
PPTX
Windows 10 ppt
PPTX
PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
Object recognition
Color and human vision
Manipulator robot for crack detection and welding
Windows 10 ppt

Recently uploaded (20)

PDF
Basic Mud Logging Guide for educational purpose
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
master seminar digital applications in india
PPTX
Cell Types and Its function , kingdom of life
PDF
RMMM.pdf make it easy to upload and study
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Basic Mud Logging Guide for educational purpose
PPH.pptx obstetrics and gynecology in nursing
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
TR - Agricultural Crops Production NC III.pdf
Anesthesia in Laparoscopic Surgery in India
Supply Chain Operations Speaking Notes -ICLT Program
Complications of Minimal Access Surgery at WLH
Final Presentation General Medicine 03-08-2024.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
GDM (1) (1).pptx small presentation for students
Pharmacology of Heart Failure /Pharmacotherapy of CHF
master seminar digital applications in india
Cell Types and Its function , kingdom of life
RMMM.pdf make it easy to upload and study
Microbial diseases, their pathogenesis and prophylaxis
2.FourierTransform-ShortQuestionswithAnswers.pdf
Computing-Curriculum for Schools in Ghana
Renaissance Architecture: A Journey from Faith to Humanism
human mycosis Human fungal infections are called human mycosis..pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx

Students academic performance using clustering technique

  • 2. Introduction..  Our project aim is to find students academic performance and find out whether there is any general pattern in their marks and performance.  So here ,We are analyzing both internal and external marks of a student.  We did the following KDD preprocessing steps to mine our data.
  • 3. Learning the application domain  Learning the application domain is the first step in KDD process .  Need to have a clear understanding about the application domain and our objectives.  The institution considered for mining is MCA batch of Rajagiri College of Social Sciences.  We collected all previous year academic record from the department of computer science
  • 4. Create a target data set: data selection  We selected 2007-2010 batch marks for analysing the pattern.  There were around 45 records(45 students).  Both the internal and external marks of each student were selected, in order to find out the performance pattern.
  • 6. Data cleaning & preprocessing  Data cleaning is the step where noise and irrelevant data are removed from the large data set.  This is a very important pre-processing step because our outcome would be dependent on the quality of selected data.  Remove duplicate records, enter logically correct values for missing records(absent students), remove unnecessary data fields and standardize data format.
  • 7.  There was no much duplicate data or unnecessary data in the collected record . The dataset was partially cleaned.  Student internal mark and external mark were stored in different records.  By applying data integration these records were integrated into one record.  The new dataset consist of internal mark details and external mark details of each student in one record.
  • 9. Data reduction & transformation  Data is transformed into appropriate form for making it ready for data mining step.  The dataset contains marks of 5 theory paper and 2 lab paper of all 5 semesters.  These marks are transformed into sum of internal marks and sum of external marks of each student for the easiness of analysing the pattern.
  • 11. Cluster Analysis  The data mining technique we used here is clustering.  A cluster is a collection of data objects that are similar to one another within same cluster and are dissimilar to objects in other cluster.  We first partitioned the set of data into groups based on data similarity and then assign labels Choosing functions of data mining
  • 12. K-MEANS Partitioning  The K-means algorithm takes input parameter k and partitions the set of n objects into k clusters.  Here we selected no: of cluster as 4  Objects are distributed to a cluster based on cluster center to which it is nearest.  For each semester we found out the clusters separately and labeled them as students Excellent, Good, Fair and Poor Choosing mining algorithms
  • 13. The Tool used for pattern evaluation is ORANGE
  • 15. No of cluster selected is 4
  • 28. Data mining search for patterns of interest  From the mining process we found that “All the 5 semester clusters followed the same pattern of performance”.  A student with high internal mark has higher external marks and a student with less internal marks has less external marks.  There is a direct relation between the internal and the external marks.  At some case this evaluation is not valid, cases like  Being absent for internal exam and scoring high marks for the externals (vice versa)
  • 29. CONCLUSION  A students performance in his university exam can be predicted with the help of his internal marks. There is a direct relation between the internal and the external marks.  A student with low internals will get low marks for externals too
  • 30. Use of discovered knowledge representation