SlideShare a Scribd company logo
Dear students get fully solved assignments
Send your semester & Specialization name to our mail id :
help.mbaassignments@gmail.com
or
call us at : 08263069601
ASSIGNMENT
DRIVE WINTER 2016
PROGRAM MCA(REVISED FALL 2007)
SEMESTER 6
SUBJECT CODE & NAME MC0088- DATA MINING
BK ID B1009
CREDITS 4
MARKS 60
Note: Answer all questions. Kindly note that answers for 10 marks questions should be
approximately of 400 words. Each question is followed by evaluation scheme.
Question. 1. Differentiate between Data Mining and Data
Warehousing.
Answer:Data warehousing is about the STORING of analytical data in a structure suitable for data
mining. This analytical data is extracted from the operational systems usually on a daily basis.
Data miningisa setof techniquesusedto search, retrieve and analyze data from a data warehouse
(or other data storage mechanism) to answer
Question. 2. What are the key features of a Data Warehouse?
Explain.
Answer:Data warehouse canbe definedas ‘Structural Repository’of historic data. It is developed in
evolutionaryprocessbyintegratingthe datafromnon integratedsystemsliketextfiles,excelsheets,
databases(The same is shown in the diagram below.)
Question. 3. Differentiate between Data Integration and
Transformation.
Answer:ETL is a type of data integration and involves an architecture that extracts, transforms and
then loads data in a target database or file. Other forms of data integration include ELT (Extract,
Load and Transform) or ELTL or EII. There is also manual data integration where a user exports a
database table and imports it into another database. ETL is most commonly used as a name for a
specific type of data integration tool.
Question. 4. Differentiate between database management systems
(DBMS) and data mining.
Answer:A DBMS (Database Management System) is a complete system used for managing digital
databasesthatallowsstorage of database content,creation/maintenance of data, search and other
functionalities.On the otherhand, Data Mining is a field in computer science, which deals with the
extractionof previouslyunknownandinterestinginformationfrom raw data. Usually, the data used
as the input for the Data mining process is stored in databases. Users who are inclined toward
statistics use Data Mining. They utilize
Question. 5. Differentiate between K-means and Hierarchical
clustering.
Answer:There are a numberof importantdifferencesbetween k-means and hierarchical clustering,
ranging from how the algorithms are implemented to how you can interpret the results.
The k-means algorithm is parameterized by the value k, which is the number of clusters that you
want to create. As the animation below illustrates, the algorithm begins by creating k centroids. It
then iterates between an assign step (where
Question. 6. Differentiate between Web content mining and Web
usage mining.
Answer:Web mining is a rapid growing research area. It consists of Web usage mining, Web
structure mining,andWebcontent mining.Webusage miningrefers to the discovery of user access
patterns from Web usage logs. Web structure mining tries to discover useful knowledge from the
structure of hyperlinks.Webcontentminingaimstoextract/mine useful information or knowledge
from web page contents. This tutorial focuses on Web Content Mining.
Dear students get fully solved assignments
Send your semester & Specialization name to our mail id :
help.mbaassignments@gmail.com
or
call us at : 08263069601

More Related Content

PPTX
EDI Training Module 4: Organizing Data Into Publishable Units
DOCX
Mit401 data warehousing and data mining
PPTX
EDI Training Module 12: An Introduction to Metadata and Data Repositories
PPTX
Data Mining and Knowledge
PDF
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
PPTX
Architecture of data mining system
PPTX
Data Mining: Classification and analysis
PPTX
Data Mining
EDI Training Module 4: Organizing Data Into Publishable Units
Mit401 data warehousing and data mining
EDI Training Module 12: An Introduction to Metadata and Data Repositories
Data Mining and Knowledge
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
Architecture of data mining system
Data Mining: Classification and analysis
Data Mining

What's hot (19)

PPTX
Data mining
PPTX
Data mining introduction
PPTX
5 data preparation and processing2
PDF
A Quantified Approach for large Dataset Compression in Association Mining
PPTX
Data mining an introduction
PDF
GCUBE INDEXING
PDF
Recommendation system using bloom filter in mapreduce
PPSX
PDF
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
PDF
A new link based approach for categorical data clustering
PDF
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
PDF
G045033841
PPTX
2 Data-mining process
PDF
Data preprocessing
PPTX
Data mining
PPTX
Data Mining: Applying data mining
DOC
Mca535 data mining and data warehousing
PDF
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
PPTX
1 Introduction to-data-mining lecture
Data mining
Data mining introduction
5 data preparation and processing2
A Quantified Approach for large Dataset Compression in Association Mining
Data mining an introduction
GCUBE INDEXING
Recommendation system using bloom filter in mapreduce
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
A new link based approach for categorical data clustering
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
G045033841
2 Data-mining process
Data preprocessing
Data mining
Data Mining: Applying data mining
Mca535 data mining and data warehousing
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
1 Introduction to-data-mining lecture
Ad

Viewers also liked (19)

DOCX
Mc0086 digital image processing
DOCX
Bt0088 cryptography and network security
DOCX
Hospitality management (2)
DOCX
Hospitality management (3)
DOCX
Business environment
DOCX
Hotel management 2
DOCX
Marketing management
DOCX
Hotel mgmt
DOCX
Financial management
DOCX
Business strategy
DOCX
Bt8901 object oriented systems-de (1)
DOCX
Management control systems
DOCX
Bt9003, data storage management
DOCX
International business
PDF
athertongemma
PPTX
3Com AP2750
DOCX
Resume Alanna Scott - Project Controls Specialist
PDF
Evalution Task 1 Presentation
PPT
Портфолио
Mc0086 digital image processing
Bt0088 cryptography and network security
Hospitality management (2)
Hospitality management (3)
Business environment
Hotel management 2
Marketing management
Hotel mgmt
Financial management
Business strategy
Bt8901 object oriented systems-de (1)
Management control systems
Bt9003, data storage management
International business
athertongemma
3Com AP2750
Resume Alanna Scott - Project Controls Specialist
Evalution Task 1 Presentation
Портфолио
Ad

Similar to Mc0088 data mining (20)

DOCX
Bt9001, data mining
PPT
Data Warehouse and Data Mining
PPT
Data Mining and Data Warehousing
DOCX
Bc0058 data warehousing
DOCX
Mi0034 –database management systems
DOCX
Mi0034 database management systems
DOCX
Mi0034 database management systems
DOCX
Mi0034 – database management system
DOC
Dwdm unit 1-2016-Data ingarehousing
PDF
Information Systems Today Managing in the Digital World 7th Edition Valacich ...
PPTX
Data mining , Knowledge Discovery Process, Classification
PDF
DSA 1- Introduction.pdf
DOCX
Mca1040 system analysis and design
DOCX
Mca1040 system analysis and design
PDF
Information Systems Today Managing in the Digital World 7th Edition Valacich ...
DOCX
Mi0034 – database management system
PDF
J0212065068
PDF
Advanced Database System
PPTX
Introduction to data mining and data warehousing
PDF
Information Systems Today Managing in the Digital World 7th Edition Valacich ...
Bt9001, data mining
Data Warehouse and Data Mining
Data Mining and Data Warehousing
Bc0058 data warehousing
Mi0034 –database management systems
Mi0034 database management systems
Mi0034 database management systems
Mi0034 – database management system
Dwdm unit 1-2016-Data ingarehousing
Information Systems Today Managing in the Digital World 7th Edition Valacich ...
Data mining , Knowledge Discovery Process, Classification
DSA 1- Introduction.pdf
Mca1040 system analysis and design
Mca1040 system analysis and design
Information Systems Today Managing in the Digital World 7th Edition Valacich ...
Mi0034 – database management system
J0212065068
Advanced Database System
Introduction to data mining and data warehousing
Information Systems Today Managing in the Digital World 7th Edition Valacich ...

Recently uploaded (20)

PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
master seminar digital applications in india
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
Updated Idioms and Phrasal Verbs in English subject
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Cell Types and Its function , kingdom of life
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
Computing-Curriculum for Schools in Ghana
PDF
A systematic review of self-coping strategies used by university students to ...
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
Yogi Goddess Pres Conference Studio Updates
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PDF
Complications of Minimal Access Surgery at WLH
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Cell Structure & Organelles in detailed.
master seminar digital applications in india
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Practical Manual AGRO-233 Principles and Practices of Natural Farming
Updated Idioms and Phrasal Verbs in English subject
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Cell Types and Its function , kingdom of life
Weekly quiz Compilation Jan -July 25.pdf
Computing-Curriculum for Schools in Ghana
A systematic review of self-coping strategies used by university students to ...
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
Yogi Goddess Pres Conference Studio Updates
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Complications of Minimal Access Surgery at WLH
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Microbial diseases, their pathogenesis and prophylaxis
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...

Mc0088 data mining

  • 1. Dear students get fully solved assignments Send your semester & Specialization name to our mail id : help.mbaassignments@gmail.com or call us at : 08263069601 ASSIGNMENT DRIVE WINTER 2016 PROGRAM MCA(REVISED FALL 2007) SEMESTER 6 SUBJECT CODE & NAME MC0088- DATA MINING BK ID B1009 CREDITS 4 MARKS 60 Note: Answer all questions. Kindly note that answers for 10 marks questions should be approximately of 400 words. Each question is followed by evaluation scheme. Question. 1. Differentiate between Data Mining and Data Warehousing. Answer:Data warehousing is about the STORING of analytical data in a structure suitable for data mining. This analytical data is extracted from the operational systems usually on a daily basis. Data miningisa setof techniquesusedto search, retrieve and analyze data from a data warehouse (or other data storage mechanism) to answer Question. 2. What are the key features of a Data Warehouse? Explain. Answer:Data warehouse canbe definedas ‘Structural Repository’of historic data. It is developed in evolutionaryprocessbyintegratingthe datafromnon integratedsystemsliketextfiles,excelsheets, databases(The same is shown in the diagram below.)
  • 2. Question. 3. Differentiate between Data Integration and Transformation. Answer:ETL is a type of data integration and involves an architecture that extracts, transforms and then loads data in a target database or file. Other forms of data integration include ELT (Extract, Load and Transform) or ELTL or EII. There is also manual data integration where a user exports a database table and imports it into another database. ETL is most commonly used as a name for a specific type of data integration tool. Question. 4. Differentiate between database management systems (DBMS) and data mining. Answer:A DBMS (Database Management System) is a complete system used for managing digital databasesthatallowsstorage of database content,creation/maintenance of data, search and other functionalities.On the otherhand, Data Mining is a field in computer science, which deals with the extractionof previouslyunknownandinterestinginformationfrom raw data. Usually, the data used as the input for the Data mining process is stored in databases. Users who are inclined toward statistics use Data Mining. They utilize Question. 5. Differentiate between K-means and Hierarchical clustering. Answer:There are a numberof importantdifferencesbetween k-means and hierarchical clustering, ranging from how the algorithms are implemented to how you can interpret the results. The k-means algorithm is parameterized by the value k, which is the number of clusters that you want to create. As the animation below illustrates, the algorithm begins by creating k centroids. It then iterates between an assign step (where Question. 6. Differentiate between Web content mining and Web usage mining. Answer:Web mining is a rapid growing research area. It consists of Web usage mining, Web structure mining,andWebcontent mining.Webusage miningrefers to the discovery of user access patterns from Web usage logs. Web structure mining tries to discover useful knowledge from the structure of hyperlinks.Webcontentminingaimstoextract/mine useful information or knowledge from web page contents. This tutorial focuses on Web Content Mining. Dear students get fully solved assignments Send your semester & Specialization name to our mail id : help.mbaassignments@gmail.com
  • 3. or call us at : 08263069601