SlideShare a Scribd company logo
International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016
DOI : 10.5121/ijist.2016.6219 185
A STUDY OF TRADITIONAL DATA ANALYSIS
AND SENSOR DATA ANALYTICS
Kavita Ahuja
1
and N.N.Jani
2
1
Shree Madhav Institute of Computer & Information Technology, Surat, Gujarat, India
2
Ex-Director, SKPIMCS - MCA, Gandhinagar, Gujarat, India
ABSTRACT
The growth of smart and intelligent devices known as sensors generate large amount of data. These
generated data over a time span takes such a large volume which is designated as big data. The data
structure of repository holds unstructured data. The traditional data analytics methods well developed and
used widely to analyze structured data and to limit extend the semi-structured data which involves
additional processing over heads. The similar methods used to analyze unstructured data are different
because of distributed computing approach where as there is a possibility of centralized processing in case
of structured and semi-structured data. The under taken work is confined to analysis of both verities of
methods. The result of this study is targeted to introduce methods available to analyze big data.
KEYWORDS
Sensor data, big data, data analysis, data analytics
1. INTRODUCTION
The technological growth has rapidly influenced automation of devices but now this automated
system is being made intelligent not only for purpose of operation but also for effective control.
This initiative has opened up the window of smart devices, smart homes, smart hospitals, smart
agriculture, smart citizen services, smart medical services and many more to improve their quality
of life and services. The core at this initiative requires instant data acquisition, immediate
processing and storage for analysis - dynamic as well as futuristic, requires deployment of
wireless sensors of diversified types for data capture and high performance processing,
communication and storage on cloud infrastructure. This will help to integrate all related areas
and give analytics based decision support as well as intelligent control. Sensor data pose
challenges with respect to data acquisition, storage and efficient and real-time processing of
massive volumes [1] of possibly unstructured data resulting in big data. Therefore, analytics of
sensor data raise a set of issue with respect to traditional methods of data analysis.
According to the estimation of Steve Lever [2], the growth of data is more than 50% a year,
estimated by IDC, a technology research firm. There are now countless digital sensors worldwide
in the devices used in areas such as agriculture, industries, automobiles, defense, and shipping
transport etc. These devices can trap sensors data based on geographical location, movement,
vibration, temperature, pressure, environmental changes, chemical changes and many more.
International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016
186
Although the advances of computer systems and internet technologies have witnessed the
development of computing hardware following the Moore’s law for several decades, the problems
of handling the large-scale data still exist when we are entering the age of wireless sensor data
which result in big data. That is why Fisher et al. [3] pointed out that big data means that the data
is unable to be handled and processed by most current information systems or methods because
data in the big data era will not only become too big to be loaded into a single machine, it also
implies that most traditional data mining methods or data analytics developed for a centralized
data analysis process may not be able to be applied directly to big data.
In response to the problems of analyzing large-scale data, quite a few efficient methods[4], such
as sampling, data condensation, density-based approaches, grid-based approaches, divide and
conquer, incremental learning, and distributed computing, have been presented. Of course, these
methods are constantly used to improve the performance of the operators of data analytics
process. The results of these methods illustrate that with the efficient methods at hand, the large
–scale data may be analyzed in a reasonable time.
To make the whole process of knowledge discovery in databases (KDD) more clear, Fayyad et al
[5] summarized the KDD process by a few operations, which are selection, preprocessing,
transformation, data mining, and interpretation/evaluation. These operators will be able to build a
complete data analytics system to gather data first as Data Input, than Data Analytics and gives
the result/output to the user.
Because the traditional data analysis methods are not designed for large-scale and complex data,
they are almost impossible to be capable of analyzing the big data. Redesigning and changing the
way the data analysis methods are designed are two critical trends for big data analysis. Several
important concepts in the design of the big data analysis method will be given in the following
sections.
Data streams are generated at multiple Distributed computing sensor nodes, which requires data
mining approaches to optimized communication cost across different nodes, also the cost of
computation, storage requirements at each node. Charu Aggrawal [6],draw attention to distributed
computing network, which contains very large amount of sensor nodes, the aggregation of
streams of data of every node become quite challenging task, also it pose challenges to mining
problems, because the result of one node’s mining algorithm must be integrate across the different
nodes of the network.
Further, according to Russom [7], the data that need to be analyzed are not just large, but they are
composed of various data types, and even including streaming data. Since big data has the unique
features of “massive, high dimensional, heterogeneous, complex, unstructured, incomplete, noisy,
and erroneous,” which may change the statistical and data analysis approaches [8]. Although it
seems that big data makes it possible for us to collect more data to find more useful information,
the truth is that more data do not necessarily mean more useful information. It may contain more
ambiguous or abnormal data. For instance, a user may have multiple accounts, or an account may
be used by multiple users, which may degrade the accuracy of the mining results [9]. Sensor data
brings numerous challenges with it in the context of data collection, storage and processing. This
is because sensor data processing often requires
International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016
187
efficient and real-time processing from massive volumes of possibly uncertain data. Therefore,
Charu Aggrawal [10] introduced several new issues of sensor data processing such as Data
collection, Sensor Mining, Application-Specific issue, security, storage, and quality of data.
Different from traditional data analytics, Baraniuk [11] pointed out that the bottleneck of big data
analytics will be shifted from sensor to processing, communications, storage of sensing data for
its better management and maintenance in one single machine, as shown in Figure 1.
Figure 1. Difference of Traditional data and big data analysis on wireless sensor network
2. COMPARATIVE ANALYSIS
Many efficient analysis mining methods exist to analyze the data with respect to structure and
semi-structure format, and those are as given in Table 1:
Table 1: Efficient data analytics methods for data mining
International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016
188
Most data analysis methods have limitations for big data analysis with regards to Unscalability
and centralization, Non-dynamic and they have uniform data structure. Because the traditional
data analysis methods are not designed for large-scale and complex data, they are almost
impossible to be capable of analyzing the big data.
Several studies attempted to present an efficient or effective solution from the perspective of
system (e.g., framework and platform) or algorithm level. A simple comparison of these big data
analysis technologies from different perspectives is described in Table 2. The “Perspective”
column of this table explains that the study is focused on the framework or algorithm level; the
“Description” column gives the further goal of the study.
Table 2: big data analysis framework and methods
International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016
189
3. CONCLUSION AND FUTURE RESEARCH
In this article, we reviewed studies on the data analytics from the traditional data analysis to the
recent big data analysis. From the system perspective, the KDD process is used as the framework
for these studies and is summarized into three parts: input, analysis, and output.
From the analysis framework perspective, this study shows that big data framework, platform,
and machine learning are the current research trends in big data analytics system. For the mining
algorithm perspective, the clustering, classification, and frequent pattern mining issues play the
vital role of these researches because several data analysis problems can be mapped to these
essential issues.
REFERENCES
[1] Charu. C. Aggrawal, (2012), “Real-Time Data Analytics in Sensor Networks”, Managing and
Mining Sensor Data Journal, Springer publisher, ISBN 978-1-4614-6309-2, pp173-201.
[2] Steve Lohr, The age of big data.Rep. (2012).
http://wolfweb.unr.edu/homepage/ania/NYTFeb12.pdf
[3] Fisher D, DeLine R, Czerwinski M, Drucker S., (2012), Interactions with big data analytics.
Interactions Journal, ACM, Volume:19, pp50–9
[4] R, Wunsch D. (2009), Clustering Hoboken: Wiley-IEEE Press.
[5] Fayyad UM, Piatetsky-Shapiro G, Smyth P. (1996), “From data mining to knowledge discovery in
databases”. AI Mag, Vol. 17, pp37–54.
[6] Charu. C. Aggrawal, (2012), “Mining Sensor Data Streams”, managing and Mining Sensor Data
Journal, Springer publisher, ISBN 978-1-4614-6309-2, pp143-166.
[7] Russom P. (2011), big data analytics. TDWI: Tech. Rep.
[8] Ma C, Zhang HH, Wang X. (2014), “Machine learning for big data analytics in plants”, Trends
Plant Sci., Vol. 19, pp798–808.
[9] Boyd D, Crawford K. (2012), “Critical questions for big data”. Inform Commun Soc., Vol. 15,
pp662–79.
[10] Charu. C. Aggrawal, (2012) “An Introduction to sensor Data Analytics”, Managing and Mining
Sensor Data Journal, Springer Publisher, ISBN 978-1-4614-6309-2, pp1-8.
[11] Baraniuk RG., (2011),” More is less: signal processing and the data deluge” Science. Vol. 331,
pp717-9.
Authors
Kavita Ahuja
(Assistant Professor) received the B.Sc. in mathematics and M.C.A. degree in 2005
and 2008 respectively from Veer Narmad South Gujarat University, Surat, India,
and currently involved in researcher as a Ph.D. scholar in the area of sensor data
analytics from Hemchandracharya North Gujarat University, Patan, India from
2014, under the guidance of Dr. N. N. Jani (Director, KSV).She is currently working
as Assistant Professor in B.C.A college since 2008.
International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016
190
N. N. Jani
Currently contributing as Mentor to Institute Industry Interaction Cell of KSV
University at Gandhinagar. He has served Saurashtra University Rajkot For a
period of more than two decades as Prof and Head, Dept of Computer Science till
July 2008 and thereafter as Director at SK Patel Institute of Management and
Computer Studies, Gandhinagar up to Aug,2015with a rich teaching and research
experience of 41 years. He successfully guided 36 scholars who completed their
PhD in Computer Science. He published more than 82 research papers during this
tenure. His research area is:
High Performance Computing, Big Data Analytics, Smart Embedded Systems, Nano Materials
Characterization.

More Related Content

PDF
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
PDF
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
PDF
A SURVEY OF BIG DATA ANALYTICS
PDF
A SURVEY OF BIG DATA ANALYTICS..........
PDF
Novel holistic architecture for analytical operation on sensory data relayed...
PDF
Sameer Kumar Das International Conference Paper 53
PDF
Data reduction techniques to analyze nsl kdd dataset
PDF
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
A SURVEY OF BIG DATA ANALYTICS
A SURVEY OF BIG DATA ANALYTICS..........
Novel holistic architecture for analytical operation on sensory data relayed...
Sameer Kumar Das International Conference Paper 53
Data reduction techniques to analyze nsl kdd dataset
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE

Similar to A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS (20)

PDF
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
PDF
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
PDF
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
PDF
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
PDF
Application of Dynamic Clustering Alogirthm in Medical Surveillance
PDF
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
PDF
The Survey of Data Mining Applications And Feature Scope
PDF
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
DOCX
Big Data Analytics
PDF
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...
PDF
4113ijaia09
PDF
4113ijaia09
PDF
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
PDF
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
PDF
Data repository for sensor network a data mining approach
PDF
Mining Social Media Data for Understanding Drugs Usage
PDF
ISSUES, CHALLENGES, AND SOLUTIONS: BIG DATA MINING
PDF
Issues, challenges, and solutions
PDF
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
PDF
Different Classification Technique for Data mining in Insurance Industry usin...
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
Application of Dynamic Clustering Alogirthm in Medical Surveillance
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
The Survey of Data Mining Applications And Feature Scope
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
Big Data Analytics
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...
4113ijaia09
4113ijaia09
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
Data repository for sensor network a data mining approach
Mining Social Media Data for Understanding Drugs Usage
ISSUES, CHALLENGES, AND SOLUTIONS: BIG DATA MINING
Issues, challenges, and solutions
APPLICATION OF DYNAMIC CLUSTERING ALGORITHM IN MEDICAL SURVEILLANCE
Different Classification Technique for Data mining in Insurance Industry usin...
Ad

More from ijistjournal (20)

PDF
MATHEMATICAL EXPLANATION TO SOLUTION FOR EX-NOR PROBLEM USING MLFFN
PPTX
Call for Papers - International Journal of Information Sciences and Technique...
PDF
3rd International Conference on NLP, AI & Information Retrieval (NLAII 2025)
PDF
SURVEY ON LI-FI TECHNOLOGY AND ITS APPLICATIONS
PPTX
Research Article Submission - International Journal of Information Sciences a...
PDF
A BRIEF REVIEW OF SENTIMENT ANALYSIS METHODS
PDF
14th International Conference on Information Technology Convergence and Servi...
PPTX
Online Paper Submission - International Journal of Information Sciences and T...
PDF
New Era of Teaching Learning : 3D Marker Based Augmented Reality
PPTX
Submit Your Research Articles - International Journal of Information Sciences...
PDF
GOOGLE CLOUD MESSAGING (GCM): A LIGHT WEIGHT COMMUNICATION MECHANISM BETWEEN ...
PDF
6th International Conference on Artificial Intelligence and Machine Learning ...
PPTX
Call for Papers - International Journal of Information Sciences and Technique...
PDF
SURVEY OF ANDROID APPS FOR AGRICULTURE SECTOR
PDF
6th International Conference on Machine Learning Techniques and Data Science ...
PDF
International Journal of Information Sciences and Techniques (IJIST)
PPTX
Research Article Submission - International Journal of Information Sciences a...
PDF
SURVEY OF DATA MINING TECHNIQUES USED IN HEALTHCARE DOMAIN
PDF
International Journal of Information Sciences and Techniques (IJIST)
PPTX
Online Paper Submission - International Journal of Information Sciences and T...
MATHEMATICAL EXPLANATION TO SOLUTION FOR EX-NOR PROBLEM USING MLFFN
Call for Papers - International Journal of Information Sciences and Technique...
3rd International Conference on NLP, AI & Information Retrieval (NLAII 2025)
SURVEY ON LI-FI TECHNOLOGY AND ITS APPLICATIONS
Research Article Submission - International Journal of Information Sciences a...
A BRIEF REVIEW OF SENTIMENT ANALYSIS METHODS
14th International Conference on Information Technology Convergence and Servi...
Online Paper Submission - International Journal of Information Sciences and T...
New Era of Teaching Learning : 3D Marker Based Augmented Reality
Submit Your Research Articles - International Journal of Information Sciences...
GOOGLE CLOUD MESSAGING (GCM): A LIGHT WEIGHT COMMUNICATION MECHANISM BETWEEN ...
6th International Conference on Artificial Intelligence and Machine Learning ...
Call for Papers - International Journal of Information Sciences and Technique...
SURVEY OF ANDROID APPS FOR AGRICULTURE SECTOR
6th International Conference on Machine Learning Techniques and Data Science ...
International Journal of Information Sciences and Techniques (IJIST)
Research Article Submission - International Journal of Information Sciences a...
SURVEY OF DATA MINING TECHNIQUES USED IN HEALTHCARE DOMAIN
International Journal of Information Sciences and Techniques (IJIST)
Online Paper Submission - International Journal of Information Sciences and T...
Ad

Recently uploaded (20)

PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
composite construction of structures.pdf
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
DOCX
573137875-Attendance-Management-System-original
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PDF
Digital Logic Computer Design lecture notes
PPTX
web development for engineering and engineering
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
Lecture Notes Electrical Wiring System Components
PDF
PPT on Performance Review to get promotions
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
Welding lecture in detail for understanding
PPTX
Construction Project Organization Group 2.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
CH1 Production IntroductoryConcepts.pptx
composite construction of structures.pdf
Operating System & Kernel Study Guide-1 - converted.pdf
573137875-Attendance-Management-System-original
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Digital Logic Computer Design lecture notes
web development for engineering and engineering
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Lecture Notes Electrical Wiring System Components
PPT on Performance Review to get promotions
bas. eng. economics group 4 presentation 1.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Welding lecture in detail for understanding
Construction Project Organization Group 2.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems

A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS

  • 1. International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016 DOI : 10.5121/ijist.2016.6219 185 A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS Kavita Ahuja 1 and N.N.Jani 2 1 Shree Madhav Institute of Computer & Information Technology, Surat, Gujarat, India 2 Ex-Director, SKPIMCS - MCA, Gandhinagar, Gujarat, India ABSTRACT The growth of smart and intelligent devices known as sensors generate large amount of data. These generated data over a time span takes such a large volume which is designated as big data. The data structure of repository holds unstructured data. The traditional data analytics methods well developed and used widely to analyze structured data and to limit extend the semi-structured data which involves additional processing over heads. The similar methods used to analyze unstructured data are different because of distributed computing approach where as there is a possibility of centralized processing in case of structured and semi-structured data. The under taken work is confined to analysis of both verities of methods. The result of this study is targeted to introduce methods available to analyze big data. KEYWORDS Sensor data, big data, data analysis, data analytics 1. INTRODUCTION The technological growth has rapidly influenced automation of devices but now this automated system is being made intelligent not only for purpose of operation but also for effective control. This initiative has opened up the window of smart devices, smart homes, smart hospitals, smart agriculture, smart citizen services, smart medical services and many more to improve their quality of life and services. The core at this initiative requires instant data acquisition, immediate processing and storage for analysis - dynamic as well as futuristic, requires deployment of wireless sensors of diversified types for data capture and high performance processing, communication and storage on cloud infrastructure. This will help to integrate all related areas and give analytics based decision support as well as intelligent control. Sensor data pose challenges with respect to data acquisition, storage and efficient and real-time processing of massive volumes [1] of possibly unstructured data resulting in big data. Therefore, analytics of sensor data raise a set of issue with respect to traditional methods of data analysis. According to the estimation of Steve Lever [2], the growth of data is more than 50% a year, estimated by IDC, a technology research firm. There are now countless digital sensors worldwide in the devices used in areas such as agriculture, industries, automobiles, defense, and shipping transport etc. These devices can trap sensors data based on geographical location, movement, vibration, temperature, pressure, environmental changes, chemical changes and many more.
  • 2. International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016 186 Although the advances of computer systems and internet technologies have witnessed the development of computing hardware following the Moore’s law for several decades, the problems of handling the large-scale data still exist when we are entering the age of wireless sensor data which result in big data. That is why Fisher et al. [3] pointed out that big data means that the data is unable to be handled and processed by most current information systems or methods because data in the big data era will not only become too big to be loaded into a single machine, it also implies that most traditional data mining methods or data analytics developed for a centralized data analysis process may not be able to be applied directly to big data. In response to the problems of analyzing large-scale data, quite a few efficient methods[4], such as sampling, data condensation, density-based approaches, grid-based approaches, divide and conquer, incremental learning, and distributed computing, have been presented. Of course, these methods are constantly used to improve the performance of the operators of data analytics process. The results of these methods illustrate that with the efficient methods at hand, the large –scale data may be analyzed in a reasonable time. To make the whole process of knowledge discovery in databases (KDD) more clear, Fayyad et al [5] summarized the KDD process by a few operations, which are selection, preprocessing, transformation, data mining, and interpretation/evaluation. These operators will be able to build a complete data analytics system to gather data first as Data Input, than Data Analytics and gives the result/output to the user. Because the traditional data analysis methods are not designed for large-scale and complex data, they are almost impossible to be capable of analyzing the big data. Redesigning and changing the way the data analysis methods are designed are two critical trends for big data analysis. Several important concepts in the design of the big data analysis method will be given in the following sections. Data streams are generated at multiple Distributed computing sensor nodes, which requires data mining approaches to optimized communication cost across different nodes, also the cost of computation, storage requirements at each node. Charu Aggrawal [6],draw attention to distributed computing network, which contains very large amount of sensor nodes, the aggregation of streams of data of every node become quite challenging task, also it pose challenges to mining problems, because the result of one node’s mining algorithm must be integrate across the different nodes of the network. Further, according to Russom [7], the data that need to be analyzed are not just large, but they are composed of various data types, and even including streaming data. Since big data has the unique features of “massive, high dimensional, heterogeneous, complex, unstructured, incomplete, noisy, and erroneous,” which may change the statistical and data analysis approaches [8]. Although it seems that big data makes it possible for us to collect more data to find more useful information, the truth is that more data do not necessarily mean more useful information. It may contain more ambiguous or abnormal data. For instance, a user may have multiple accounts, or an account may be used by multiple users, which may degrade the accuracy of the mining results [9]. Sensor data brings numerous challenges with it in the context of data collection, storage and processing. This is because sensor data processing often requires
  • 3. International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016 187 efficient and real-time processing from massive volumes of possibly uncertain data. Therefore, Charu Aggrawal [10] introduced several new issues of sensor data processing such as Data collection, Sensor Mining, Application-Specific issue, security, storage, and quality of data. Different from traditional data analytics, Baraniuk [11] pointed out that the bottleneck of big data analytics will be shifted from sensor to processing, communications, storage of sensing data for its better management and maintenance in one single machine, as shown in Figure 1. Figure 1. Difference of Traditional data and big data analysis on wireless sensor network 2. COMPARATIVE ANALYSIS Many efficient analysis mining methods exist to analyze the data with respect to structure and semi-structure format, and those are as given in Table 1: Table 1: Efficient data analytics methods for data mining
  • 4. International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016 188 Most data analysis methods have limitations for big data analysis with regards to Unscalability and centralization, Non-dynamic and they have uniform data structure. Because the traditional data analysis methods are not designed for large-scale and complex data, they are almost impossible to be capable of analyzing the big data. Several studies attempted to present an efficient or effective solution from the perspective of system (e.g., framework and platform) or algorithm level. A simple comparison of these big data analysis technologies from different perspectives is described in Table 2. The “Perspective” column of this table explains that the study is focused on the framework or algorithm level; the “Description” column gives the further goal of the study. Table 2: big data analysis framework and methods
  • 5. International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016 189 3. CONCLUSION AND FUTURE RESEARCH In this article, we reviewed studies on the data analytics from the traditional data analysis to the recent big data analysis. From the system perspective, the KDD process is used as the framework for these studies and is summarized into three parts: input, analysis, and output. From the analysis framework perspective, this study shows that big data framework, platform, and machine learning are the current research trends in big data analytics system. For the mining algorithm perspective, the clustering, classification, and frequent pattern mining issues play the vital role of these researches because several data analysis problems can be mapped to these essential issues. REFERENCES [1] Charu. C. Aggrawal, (2012), “Real-Time Data Analytics in Sensor Networks”, Managing and Mining Sensor Data Journal, Springer publisher, ISBN 978-1-4614-6309-2, pp173-201. [2] Steve Lohr, The age of big data.Rep. (2012). http://wolfweb.unr.edu/homepage/ania/NYTFeb12.pdf [3] Fisher D, DeLine R, Czerwinski M, Drucker S., (2012), Interactions with big data analytics. Interactions Journal, ACM, Volume:19, pp50–9 [4] R, Wunsch D. (2009), Clustering Hoboken: Wiley-IEEE Press. [5] Fayyad UM, Piatetsky-Shapiro G, Smyth P. (1996), “From data mining to knowledge discovery in databases”. AI Mag, Vol. 17, pp37–54. [6] Charu. C. Aggrawal, (2012), “Mining Sensor Data Streams”, managing and Mining Sensor Data Journal, Springer publisher, ISBN 978-1-4614-6309-2, pp143-166. [7] Russom P. (2011), big data analytics. TDWI: Tech. Rep. [8] Ma C, Zhang HH, Wang X. (2014), “Machine learning for big data analytics in plants”, Trends Plant Sci., Vol. 19, pp798–808. [9] Boyd D, Crawford K. (2012), “Critical questions for big data”. Inform Commun Soc., Vol. 15, pp662–79. [10] Charu. C. Aggrawal, (2012) “An Introduction to sensor Data Analytics”, Managing and Mining Sensor Data Journal, Springer Publisher, ISBN 978-1-4614-6309-2, pp1-8. [11] Baraniuk RG., (2011),” More is less: signal processing and the data deluge” Science. Vol. 331, pp717-9. Authors Kavita Ahuja (Assistant Professor) received the B.Sc. in mathematics and M.C.A. degree in 2005 and 2008 respectively from Veer Narmad South Gujarat University, Surat, India, and currently involved in researcher as a Ph.D. scholar in the area of sensor data analytics from Hemchandracharya North Gujarat University, Patan, India from 2014, under the guidance of Dr. N. N. Jani (Director, KSV).She is currently working as Assistant Professor in B.C.A college since 2008.
  • 6. International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016 190 N. N. Jani Currently contributing as Mentor to Institute Industry Interaction Cell of KSV University at Gandhinagar. He has served Saurashtra University Rajkot For a period of more than two decades as Prof and Head, Dept of Computer Science till July 2008 and thereafter as Director at SK Patel Institute of Management and Computer Studies, Gandhinagar up to Aug,2015with a rich teaching and research experience of 41 years. He successfully guided 36 scholars who completed their PhD in Computer Science. He published more than 82 research papers during this tenure. His research area is: High Performance Computing, Big Data Analytics, Smart Embedded Systems, Nano Materials Characterization.