SlideShare a Scribd company logo
2
Most read
Crime Data Analyis Using Data Mining for Crime
Prediction
Anavadya Shibu; Anish S. A.; Gokul Krishnan S.;Vijayalekshmi V. R.
S8 Computer Science and Engineering
Mohandas College of Engineering and Technology
Anad, Nedumangad, Thiruvananthapuram -695544
Mrs. Prathibha S. Nair,
Assistant Professor, Department of Computer Science and Engineering,
Mohandas College of Engineering and Technology
Anad, Nedumangad, Thiruvananthapuram -695544
Abstract— This project presents a general idea about the
model of Data Mining techniques and diverse crimes. It also
provides an inclusive survey of competent and valuable
techniques on data mining for crime data analysis. The
objective of the data mining is to recognize patterns in
criminal manners in order to predict crime anticipate
criminal activity and prevent it. This project implements a
novel data mining techniques like KNN, Text Clustering, IR
tree for investigating the crime data sets and sorts out the
accessible problems. The collective knowledge of various
data mining algorithms tend certainly to afford an
enhanced, incorporated, and precise result over the crime
prediction in the banking sectors Our law enforcement
organizations require to be adequately outfitted to defeat
and prevent the crime. This project is developed using Java
as front-end and MySQL as back-end. Supporting
applications like Sunset, NetBeans are used to make the
portal more interactive.
Keywords-component; formatting; style; styling; insert
(key words)
I. INTRODUCTION
Data Mining or Knowledge Discovery in Databases (KDD
in simple words is nontrivial extraction of implicit,
previously unknown, and potentially useful information
from data [1],[2],[3]. It deals with the discovery of hidden
knowledge, unexpected patterns and new rules from large
databases. KDD is the process of indentifying a valid,
potentially, useful and ultimately understandable structure
in data. Data mining represents of the emerging field that
can be used a wide disciplinary of applications including
marketing, banking, airlines and many other fields that
highly affect the communities. Crime analyzes is one of
these important applications of data mining. Data mining
contains many tasks and techniques including
Classification, Association, Clustering, Prediction each of
them has its own importance and applications [1],[2],[ 3].
Advances in technology, which allow analyzes of large
quantities of data, are the foundation for the for relatively
new field known as crime analyze. Crime analyzes is an
emerging field in law enforcement without standard
definitions. This makes it difficult to determine the crime
analyzes focus for agencies that are new to the field. In
some police departments, what is called “crime analysis”
consist of mapping crimes for command staff and
producing crime statistics. In other agencies, crime analysis
might mean focusing on analyzing various police reports
and suspect information to help investigators in major crime
units. Crime analysis is proceeding of analyzing crime.
More specifically, crime analysis is the breaking up of acts
committed in violation of laws into their parts to find out
their nature and reporting, some analysis [4]. The role of the
crime analysts varies from agency to agency. Statement of
these findings is the objective of most crime analysis to find
meaningful information in vast amounts of data and
disseminate this information to officers and investigators in
the field to assist in their efforts to apprehend criminals and
suppress criminal activity. Assessing crime through
analysis also helps in crime prevention efforts [4],[7],[10].
II. PROBLEM STATEMENT
We propose to implement a model that could help us to extract
crime patterns. These patterns will be applied to some data
mining algorithms such as association rules mining and
clustering to classify crime records on the basis of the values
of attributes crime. Applying such algorithms will illustrate the
overall results of using both algorithms to perform better
results rather in association rule mining or in clustering. The
scenarios generated by the system could be easily presented in
human language which might be used by Law Enforcement
officers to help them decided a crime prevention strategy.
III. PROBLEM SOLUTION
Law enforcement agencies today are faced a large volume of
data that must be preprocessed and transformed into useful
information. Data mining can improve crime analysis and aid
in reducing and preventing crime. The purpose of this study to
explore the applicability of data mining techniques in the
efforts of crime analysis and prevention. The data was
collected manually. Our proposed model will be able to extract
crime patterns by using clustering to classify crime records on
the basis of the values of crime attributes.
Cluster analysis: Grouping a set of data objects into
clusters. Clustering is unsupervised classification: no
predefined Classes. Simple K-Means clustering algorithm is
used in this paper. K-Means algorithm clusters the data
members groups were m is predefined. Input-Crime type,
Number of clusters, Number of Iteration Initial seeds might
produce an important role in the final results.
Step 1: Randomly Choose cluster centers.
Step 2: Assign instance to cluster based on
their distance to the cluster centers.
Step 3: Centers of clusters are adjusted.
Step 4: go to Step 1 until convergence.
Step 5: Output X0,X1,X2,X3.
Output
Table 2: Confusion Matrix
All of these values are derived from information provided
from the truth table, also known as a confusion matrix,
provides the actual and predicted classifications from the
predictor.
TPR= a/a+b………………..(2)
FPR = b/b+d……………….(3)
Accuracy= a+d/a+b+c+d…..(4)
Precision= a/a+b…………...(5)
The mean idea is to define k centers, one for each cluster.
These centers should be placed in a cunning way because of
Different location causes different result. So, the better choice
is to place them as much as possible far away from each other.
The next step is to take each point belonging to a given data
set and associate it to the nearest center. the first step is
completed and an early group age is done. At this point we
need to re-calculate k new centroids as barycenter of the
clusters resulting from the previous step. After we have these k
new centroids, a new binding has to be done between the same
data set points and the nearest new center. A loop has been
generated. As a result of this loop we may notice that the k
centers change their location step by step until no more
changes are done or in other words centers do not move any
more. Finally, this algorithm aims at minimizing an
objective function know as squared error function given by:
J(v)= ( )2 1 1 xi vj c i ci j −∑∑ = = (1)…..(6)
Whereas,
‘||xi - vj||’ is the Euclidean distance between xi and vj
‘ci’ is the number of data points in ith cluster. ‘c’ is the
number of cluster centers.
The K-Mean algorithm is fast, robust and easier to
understand. and gives best results when data set are distinct or
well separated from each other.
IV. CONCLUSION
An acceptable model for data mining which comes up with
excellent results of analyzing crime data set; it requires huge
historical data that can be used for creating and testing the
model.

More Related Content

PPTX
Crime prediction-using-data-mining
PPTX
Crime Analysis using Data Analysis
PPTX
Handwriting Recognition
PPTX
Crime Pattern Detection using K-Means Clustering
PDF
Fraud Detection presentation
PPTX
Atm frauds
PPT
CCTNS
PPTX
Digital forensics
Crime prediction-using-data-mining
Crime Analysis using Data Analysis
Handwriting Recognition
Crime Pattern Detection using K-Means Clustering
Fraud Detection presentation
Atm frauds
CCTNS
Digital forensics

What's hot (20)

PPTX
Cyber evidence at crime scene
PPT
Digital Forensic
PDF
A brief Intro to Digital Forensics
PPTX
Police Organisation kfjkbasfkbakfbs.pptx
PPT
Using Data Mining Techniques to Analyze Crime Pattern
PPT
Collecting and preserving digital evidence
DOCX
Digital forensics Steps
PPT
FORGERY.ppt
PPTX
Fingerprints, Footprint Impressions, and Tire Impressions
PPTX
Handwritten Character Recognition
PPT
Crime Analysis
PDF
Credit card fraud detection through machine learning
PPTX
2010aug Capgemini Group Presentation
PPTX
it act 2000
PDF
IT Act 2000 Penalties, Offences with case studies
PDF
Artificial Intelligence for Policing
PDF
Machine Learning Approaches for Crime Pattern Detection
PPT
CS426_forensics.ppt
PDF
Crime analysis
PPTX
Forensic investigation
Cyber evidence at crime scene
Digital Forensic
A brief Intro to Digital Forensics
Police Organisation kfjkbasfkbakfbs.pptx
Using Data Mining Techniques to Analyze Crime Pattern
Collecting and preserving digital evidence
Digital forensics Steps
FORGERY.ppt
Fingerprints, Footprint Impressions, and Tire Impressions
Handwritten Character Recognition
Crime Analysis
Credit card fraud detection through machine learning
2010aug Capgemini Group Presentation
it act 2000
IT Act 2000 Penalties, Offences with case studies
Artificial Intelligence for Policing
Machine Learning Approaches for Crime Pattern Detection
CS426_forensics.ppt
Crime analysis
Forensic investigation
Ad

Similar to Crime Data Analysis, Visualization and Prediction using Data Mining (20)

PDF
IRJET - Crime Analysis and Prediction - by using DBSCAN Algorithm
PDF
PDF
IRJET- Detecting Criminal Method using Data Mining
PDF
Survey of Data Mining Techniques on Crime Data Analysis
PDF
Survey of Data Mining Techniques on Crime Data Analysis
DOC
V1_I2_2012_Paper6.doc
PDF
IRJET- Detection and Analysis of Crime Patterns using Apriori Algorithm
PDF
Survey on Crime Interpretation and Forecasting Using Machine Learning
PPTX
CRIME.pptx
PDF
Propose Data Mining AR-GA Model to Advance Crime analysis
PDF
An Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
PDF
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNING
PDF
IRJET- Crime Analysis using Data Mining and Data Analytics
PDF
Life and science journal.pdf
PDF
U24149153
PPTX
Case.pptx
DOCX
Crime analysis mapping, intrusion detection using data mining
DOCX
Crime analysis mapping, intrusion detection using data mining
PDF
Review of Algorithms for Crime Analysis & Prediction
PDF
The International Journal of Engineering and Science (IJES)
IRJET - Crime Analysis and Prediction - by using DBSCAN Algorithm
IRJET- Detecting Criminal Method using Data Mining
Survey of Data Mining Techniques on Crime Data Analysis
Survey of Data Mining Techniques on Crime Data Analysis
V1_I2_2012_Paper6.doc
IRJET- Detection and Analysis of Crime Patterns using Apriori Algorithm
Survey on Crime Interpretation and Forecasting Using Machine Learning
CRIME.pptx
Propose Data Mining AR-GA Model to Advance Crime analysis
An Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNING
IRJET- Crime Analysis using Data Mining and Data Analytics
Life and science journal.pdf
U24149153
Case.pptx
Crime analysis mapping, intrusion detection using data mining
Crime analysis mapping, intrusion detection using data mining
Review of Algorithms for Crime Analysis & Prediction
The International Journal of Engineering and Science (IJES)
Ad

Recently uploaded (20)

PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
Digital Strategies for Manufacturing Companies
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
medical staffing services at VALiNTRY
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
How Creative Agencies Leverage Project Management Software.pdf
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Nekopoi APK 2025 free lastest update
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
top salesforce developer skills in 2025.pdf
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Design an Analysis of Algorithms I-SECS-1021-03
2025 Textile ERP Trends: SAP, Odoo & Oracle
Digital Strategies for Manufacturing Companies
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Softaken Excel to vCard Converter Software.pdf
Design an Analysis of Algorithms II-SECS-1021-03
How to Migrate SBCGlobal Email to Yahoo Easily
How to Choose the Right IT Partner for Your Business in Malaysia
medical staffing services at VALiNTRY
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
How Creative Agencies Leverage Project Management Software.pdf
PTS Company Brochure 2025 (1).pdf.......
Understanding Forklifts - TECH EHS Solution
Nekopoi APK 2025 free lastest update
Wondershare Filmora 15 Crack With Activation Key [2025
Reimagine Home Health with the Power of Agentic AI​
top salesforce developer skills in 2025.pdf

Crime Data Analysis, Visualization and Prediction using Data Mining

  • 1. Crime Data Analyis Using Data Mining for Crime Prediction Anavadya Shibu; Anish S. A.; Gokul Krishnan S.;Vijayalekshmi V. R. S8 Computer Science and Engineering Mohandas College of Engineering and Technology Anad, Nedumangad, Thiruvananthapuram -695544 Mrs. Prathibha S. Nair, Assistant Professor, Department of Computer Science and Engineering, Mohandas College of Engineering and Technology Anad, Nedumangad, Thiruvananthapuram -695544 Abstract— This project presents a general idea about the model of Data Mining techniques and diverse crimes. It also provides an inclusive survey of competent and valuable techniques on data mining for crime data analysis. The objective of the data mining is to recognize patterns in criminal manners in order to predict crime anticipate criminal activity and prevent it. This project implements a novel data mining techniques like KNN, Text Clustering, IR tree for investigating the crime data sets and sorts out the accessible problems. The collective knowledge of various data mining algorithms tend certainly to afford an enhanced, incorporated, and precise result over the crime prediction in the banking sectors Our law enforcement organizations require to be adequately outfitted to defeat and prevent the crime. This project is developed using Java as front-end and MySQL as back-end. Supporting applications like Sunset, NetBeans are used to make the portal more interactive. Keywords-component; formatting; style; styling; insert (key words) I. INTRODUCTION Data Mining or Knowledge Discovery in Databases (KDD in simple words is nontrivial extraction of implicit, previously unknown, and potentially useful information from data [1],[2],[3]. It deals with the discovery of hidden knowledge, unexpected patterns and new rules from large databases. KDD is the process of indentifying a valid, potentially, useful and ultimately understandable structure in data. Data mining represents of the emerging field that can be used a wide disciplinary of applications including marketing, banking, airlines and many other fields that highly affect the communities. Crime analyzes is one of these important applications of data mining. Data mining contains many tasks and techniques including Classification, Association, Clustering, Prediction each of them has its own importance and applications [1],[2],[ 3]. Advances in technology, which allow analyzes of large quantities of data, are the foundation for the for relatively new field known as crime analyze. Crime analyzes is an emerging field in law enforcement without standard definitions. This makes it difficult to determine the crime analyzes focus for agencies that are new to the field. In some police departments, what is called “crime analysis” consist of mapping crimes for command staff and producing crime statistics. In other agencies, crime analysis might mean focusing on analyzing various police reports and suspect information to help investigators in major crime units. Crime analysis is proceeding of analyzing crime. More specifically, crime analysis is the breaking up of acts committed in violation of laws into their parts to find out their nature and reporting, some analysis [4]. The role of the crime analysts varies from agency to agency. Statement of these findings is the objective of most crime analysis to find meaningful information in vast amounts of data and disseminate this information to officers and investigators in the field to assist in their efforts to apprehend criminals and suppress criminal activity. Assessing crime through analysis also helps in crime prevention efforts [4],[7],[10]. II. PROBLEM STATEMENT We propose to implement a model that could help us to extract crime patterns. These patterns will be applied to some data mining algorithms such as association rules mining and clustering to classify crime records on the basis of the values of attributes crime. Applying such algorithms will illustrate the overall results of using both algorithms to perform better results rather in association rule mining or in clustering. The
  • 2. scenarios generated by the system could be easily presented in human language which might be used by Law Enforcement officers to help them decided a crime prevention strategy. III. PROBLEM SOLUTION Law enforcement agencies today are faced a large volume of data that must be preprocessed and transformed into useful information. Data mining can improve crime analysis and aid in reducing and preventing crime. The purpose of this study to explore the applicability of data mining techniques in the efforts of crime analysis and prevention. The data was collected manually. Our proposed model will be able to extract crime patterns by using clustering to classify crime records on the basis of the values of crime attributes. Cluster analysis: Grouping a set of data objects into clusters. Clustering is unsupervised classification: no predefined Classes. Simple K-Means clustering algorithm is used in this paper. K-Means algorithm clusters the data members groups were m is predefined. Input-Crime type, Number of clusters, Number of Iteration Initial seeds might produce an important role in the final results. Step 1: Randomly Choose cluster centers. Step 2: Assign instance to cluster based on their distance to the cluster centers. Step 3: Centers of clusters are adjusted. Step 4: go to Step 1 until convergence. Step 5: Output X0,X1,X2,X3. Output Table 2: Confusion Matrix All of these values are derived from information provided from the truth table, also known as a confusion matrix, provides the actual and predicted classifications from the predictor. TPR= a/a+b………………..(2) FPR = b/b+d……………….(3) Accuracy= a+d/a+b+c+d…..(4) Precision= a/a+b…………...(5) The mean idea is to define k centers, one for each cluster. These centers should be placed in a cunning way because of Different location causes different result. So, the better choice is to place them as much as possible far away from each other. The next step is to take each point belonging to a given data set and associate it to the nearest center. the first step is completed and an early group age is done. At this point we need to re-calculate k new centroids as barycenter of the clusters resulting from the previous step. After we have these k new centroids, a new binding has to be done between the same data set points and the nearest new center. A loop has been generated. As a result of this loop we may notice that the k centers change their location step by step until no more changes are done or in other words centers do not move any more. Finally, this algorithm aims at minimizing an objective function know as squared error function given by: J(v)= ( )2 1 1 xi vj c i ci j −∑∑ = = (1)…..(6) Whereas, ‘||xi - vj||’ is the Euclidean distance between xi and vj ‘ci’ is the number of data points in ith cluster. ‘c’ is the number of cluster centers. The K-Mean algorithm is fast, robust and easier to understand. and gives best results when data set are distinct or well separated from each other. IV. CONCLUSION An acceptable model for data mining which comes up with excellent results of analyzing crime data set; it requires huge historical data that can be used for creating and testing the model.