SlideShare a Scribd company logo
GROUP 5 Master Jenniferson  Napallatan Neri  Openiano Ostil
TOPICS Index Indexed Sequential File Properties of Indexed Sequential  File Datawarehousing
Index Indexes provide fast searching of a  table based on one or more key  columns. Indexes on foreign keys can also greatly improve the performance of join.
Indexed Sequential File A file combining properties of  random-access files and sequential  files
Records in indexed sequential  files are stored in the order that they  are written to the disk. Records may be retrieved in sequential order or in random order using a numeric index to represent the record number in file
Properties Primary Storage Area : Records in indexed sequential files are stored in the order that they are written to the disk. Records may be retrieved in sequential order or in random order using a numeric index to represent the record number in the file.
Properties Records are stored sequentially,  originally to speed access on a tape system. In contrast, a relational database uses a query optimizer which automatically Selects indexes. The record size, specified when the file is created, may range from 1 to 8000 bytes.
Properties 2. Separate Indexes : The Indexed Access method of reading  or writing data only provides the desired outcome if in fact the file is organized as an ISAM file with the appropriate, previously defined keys. Access to data via the previously defined key(s) is extremely fast.
Properties Multiple keys, overlapping keys and  key compression within the hash  tables are supported. A utility to define/redefine keys in existing files is provided. Records can be deleted, although "garbage collection" is done via a separate utility.
Properties 3. Overflow Area : When an ISAM file is created, index  nodes are fixed, and their pointers do not change during inserts and deletes that occur later (only content of leaf nodes change afterwards).
Properties node exceed the node's capacity,  new records are stored in overflow chains. If there are more inserts than deletions from a table, these overflow chains can gradually become very large, and this affects the time required for retrieval of a record.
Properties Indexed sequential files:  commonly used for transaction files  because they take less disk space than keyed files, and are faster to read from beginning to end than a keyed file.
Data Warehousing What is a Data Warehouse? DW is a  subject-oriented ,  integrated ,  time-variant , and  nonvolatile collection  of data intended to support management decision making
Data Warehousing What is a Data Warehouse? DW is a  subject-oriented ,  integrated , time-variant , and  nonvolatile collection  of  data intended to support management  decision making
Data Warehousing DATABASE vs DATA WAREHOUSE Database:  transactional  (relational, object-oriented, network, heierarchical) Data Warehouse: mainly INTENDED for decision support applications **optimized for retrieval not routine transactional processing**
Data Warehousing What is a Data Warehousing? combining  multiple   and usually  varied sources  into one  comprehensive  and  easily manipulated  database.  (wiseGEEK.com)
Data Warehousing Properties: 1. Organized around major subject areas of an org.  (i.e. sales ,suppliers,products, etc.) 2. Integrated from multiple operational  OLTP data sources ** OLTP = OnLine Transaction Processing db
Data Warehousing Properties: 3. Periodic updates  (based on schedules) There is a trend wherein updates are gearing towards near real-time reporting  of business analytics.
Data Warehousing Advantages: Competitive advantage Increased productivity of corporate decision makers 3. Potential high return on investment as the org. Finds the best way to impove efficiency and/or profitability
Data Warehousing Encountered Problems: Underestimation of resources required to load the data 2. Hidden data integrity problems in source  data 3. Omitting data later found to be required
Data Warehousing Encountered Problems: 4. Ever increasing end user demands 5. Consolidating data from diparate data sources 6. High resource demands  (huge amount of  storage; queries that process millions of rows) 7. Ownership of data
Data Warehousing Encountered Problems: 8. Difficulty in determining what the business really wants or needs to  analyze 9. “Big Bang” projects that seem never-ending

More Related Content

PPTX
Hadoop distributed file system
PDF
Cs437 lecture 14_15
PDF
Benefits of data_archiving_in_data _warehouses
PPTX
Chapter 7(documnet databse termininology) no sql for mere mortals
PPTX
Hadoop Distributed File System
PPTX
Apache Hive
PDF
Ceph Days 2014 Paul Evans Slide Deck
PPTX
Design of Hadoop Distributed File System
Hadoop distributed file system
Cs437 lecture 14_15
Benefits of data_archiving_in_data _warehouses
Chapter 7(documnet databse termininology) no sql for mere mortals
Hadoop Distributed File System
Apache Hive
Ceph Days 2014 Paul Evans Slide Deck
Design of Hadoop Distributed File System

What's hot (20)

PPTX
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
PPTX
Big Data and Hadoop
PDF
The Object Storage Chart - A to Z
PPTX
Apache storm
PDF
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
PPTX
Chapter 5 design of keyvalue databses from nosql for mere mortals
PPTX
Big Data and Hadoop
PPTX
Data Warehouse
PDF
Hadoop & Data Warehouse
PPTX
Chapter 8(designing of documnt databases)no sql for mere mortals
PDF
Hdfs Dhruba
PPTX
Hadoop
PDF
Data cloud lab version v.001.2020
PPT
PPT
Data Archiving -Ramesh sap bw
PDF
Hadoop
PPT
Apache hadoop and hive
PDF
Pillars of Heterogeneous HDFS Storage
PPTX
Data warehouse
PPTX
FileTable and Semantic Search in SQL Server 2012
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
Big Data and Hadoop
The Object Storage Chart - A to Z
Apache storm
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Chapter 5 design of keyvalue databses from nosql for mere mortals
Big Data and Hadoop
Data Warehouse
Hadoop & Data Warehouse
Chapter 8(designing of documnt databases)no sql for mere mortals
Hdfs Dhruba
Hadoop
Data cloud lab version v.001.2020
Data Archiving -Ramesh sap bw
Hadoop
Apache hadoop and hive
Pillars of Heterogeneous HDFS Storage
Data warehouse
FileTable and Semantic Search in SQL Server 2012
Ad

Viewers also liked (10)

PPT
Fileorganization AbnMagdy
PDF
AN INVERTED LIST BASED APPROACH TO GENERATE OPTIMISED PATH IN DSR IN MANETS –...
PPT
external sorting
PDF
Files and data storage
PPT
File organization
PPT
File organisation
PPT
B trees in Data Structure
PPTX
File Organization
PPT
File Management
Fileorganization AbnMagdy
AN INVERTED LIST BASED APPROACH TO GENERATE OPTIMISED PATH IN DSR IN MANETS –...
external sorting
Files and data storage
File organization
File organisation
B trees in Data Structure
File Organization
File Management
Ad

Similar to Csci12 report aug18 (20)

PDF
data warehousing
DOC
Data warehouse concepts
PPTX
Data warehouse - Nivetha Durganathan
PPTX
DBMS (UNIT 5)
DOC
Informatica and datawarehouse Material
PPT
PPTX
Data warehouse
PDF
Introduction to Data Warehouse
DOC
Data mining notes
PPTX
datamining techniques and various tools.pptx
PPT
Teradata a z
PPT
A Comparsion of Databases and DataWarehouses.ppt
PDF
File organisation and B tree easy presentation
PPTX
1. Data warehouse Fundamentals for MCA SPPU.pptx
PPTX
WEEK 1 - Data mining and Warehouse.pptx
PPTX
DATA WAREHOUSING.2.pptx
PPTX
Introduction to Data Warehousing
PPTX
Data Management
data warehousing
Data warehouse concepts
Data warehouse - Nivetha Durganathan
DBMS (UNIT 5)
Informatica and datawarehouse Material
Data warehouse
Introduction to Data Warehouse
Data mining notes
datamining techniques and various tools.pptx
Teradata a z
A Comparsion of Databases and DataWarehouses.ppt
File organisation and B tree easy presentation
1. Data warehouse Fundamentals for MCA SPPU.pptx
WEEK 1 - Data mining and Warehouse.pptx
DATA WAREHOUSING.2.pptx
Introduction to Data Warehousing
Data Management

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
KodekX | Application Modernization Development
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPT
Teaching material agriculture food technology
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Encapsulation theory and applications.pdf
PDF
Machine learning based COVID-19 study performance prediction
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PDF
cuic standard and advanced reporting.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Encapsulation_ Review paper, used for researhc scholars
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KodekX | Application Modernization Development
The Rise and Fall of 3GPP – Time for a Sabbatical?
Advanced methodologies resolving dimensionality complications for autism neur...
Review of recent advances in non-invasive hemoglobin estimation
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Spectroscopy.pptx food analysis technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Teaching material agriculture food technology
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Encapsulation theory and applications.pdf
Machine learning based COVID-19 study performance prediction
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
cuic standard and advanced reporting.pdf

Csci12 report aug18

  • 1. GROUP 5 Master Jenniferson Napallatan Neri Openiano Ostil
  • 2. TOPICS Index Indexed Sequential File Properties of Indexed Sequential File Datawarehousing
  • 3. Index Indexes provide fast searching of a table based on one or more key columns. Indexes on foreign keys can also greatly improve the performance of join.
  • 4. Indexed Sequential File A file combining properties of random-access files and sequential files
  • 5. Records in indexed sequential files are stored in the order that they are written to the disk. Records may be retrieved in sequential order or in random order using a numeric index to represent the record number in file
  • 6. Properties Primary Storage Area : Records in indexed sequential files are stored in the order that they are written to the disk. Records may be retrieved in sequential order or in random order using a numeric index to represent the record number in the file.
  • 7. Properties Records are stored sequentially, originally to speed access on a tape system. In contrast, a relational database uses a query optimizer which automatically Selects indexes. The record size, specified when the file is created, may range from 1 to 8000 bytes.
  • 8. Properties 2. Separate Indexes : The Indexed Access method of reading or writing data only provides the desired outcome if in fact the file is organized as an ISAM file with the appropriate, previously defined keys. Access to data via the previously defined key(s) is extremely fast.
  • 9. Properties Multiple keys, overlapping keys and key compression within the hash tables are supported. A utility to define/redefine keys in existing files is provided. Records can be deleted, although "garbage collection" is done via a separate utility.
  • 10. Properties 3. Overflow Area : When an ISAM file is created, index nodes are fixed, and their pointers do not change during inserts and deletes that occur later (only content of leaf nodes change afterwards).
  • 11. Properties node exceed the node's capacity, new records are stored in overflow chains. If there are more inserts than deletions from a table, these overflow chains can gradually become very large, and this affects the time required for retrieval of a record.
  • 12. Properties Indexed sequential files: commonly used for transaction files because they take less disk space than keyed files, and are faster to read from beginning to end than a keyed file.
  • 13. Data Warehousing What is a Data Warehouse? DW is a subject-oriented , integrated , time-variant , and nonvolatile collection of data intended to support management decision making
  • 14. Data Warehousing What is a Data Warehouse? DW is a subject-oriented , integrated , time-variant , and nonvolatile collection of data intended to support management decision making
  • 15. Data Warehousing DATABASE vs DATA WAREHOUSE Database: transactional (relational, object-oriented, network, heierarchical) Data Warehouse: mainly INTENDED for decision support applications **optimized for retrieval not routine transactional processing**
  • 16. Data Warehousing What is a Data Warehousing? combining multiple and usually varied sources into one comprehensive and easily manipulated database. (wiseGEEK.com)
  • 17. Data Warehousing Properties: 1. Organized around major subject areas of an org. (i.e. sales ,suppliers,products, etc.) 2. Integrated from multiple operational OLTP data sources ** OLTP = OnLine Transaction Processing db
  • 18. Data Warehousing Properties: 3. Periodic updates (based on schedules) There is a trend wherein updates are gearing towards near real-time reporting of business analytics.
  • 19. Data Warehousing Advantages: Competitive advantage Increased productivity of corporate decision makers 3. Potential high return on investment as the org. Finds the best way to impove efficiency and/or profitability
  • 20. Data Warehousing Encountered Problems: Underestimation of resources required to load the data 2. Hidden data integrity problems in source data 3. Omitting data later found to be required
  • 21. Data Warehousing Encountered Problems: 4. Ever increasing end user demands 5. Consolidating data from diparate data sources 6. High resource demands (huge amount of storage; queries that process millions of rows) 7. Ownership of data
  • 22. Data Warehousing Encountered Problems: 8. Difficulty in determining what the business really wants or needs to analyze 9. “Big Bang” projects that seem never-ending