SlideShare a Scribd company logo
LOGO


  FILE STRUCTURE AND FILE
       ORGANIZATION




                         BY-
                   KHUSHBU
              RISHIKA ARORA
wps.cn/moban



                   Topics covered

What are files?
File related keywords.
File organization.
            1. Introduction
File organizing methods.
             2. Strategy
  *heap file organization.
  *sequential files organization.
             3. Challenges Forward
  *indexed file organization.
  *inverted4. Conclusion
              file organization.
  *direct file organization.
Comparison
Quiz
wps.cn/moban



                        FILES

A file is a collection of data that is treated as
  a single unit on a peripheral device.
TYPES OF FILES-
MASTER FILE
 It contains record of permanent data
  types.

 They are created when you install your
  business.
Work files
a program can work efficiently if a work file
  is used.-a program can work efficiently if a
  work file is used.
wps.cn/moban



                   FILES

TRANSACTION FILE

Contains data which is used to update
the records of master file. Ex-address
of a customer.

Transaction files also serves as audit
trails and history of the organization.
BASIC FILE RELATED KEYWORKS
•Byte:- It is the smallest addressable unit in computer. A byte is
a set of 8 bits and
represents a character.

•Element:- It is a combination of one or more bytes. It is
referred to as a field. A field is actually a physical space on tape
or disk. A roll number, age, name of employee etc. are examples
of it.

File: - It is a collection of similar records. The records will have
the same fields but
different values in each record. The size of a file is limited by
the size of memory
available.
BASIC FILE RELATED KEYWORDS
•Database: - It is a set of interrelated files. The files in combination
  tend to link to a common solution. For example, a student
attendance file, a student result file, a student
admission file, etc. are related to academic software pertaining to
students.

•Record: - The elements related to are combined into a record. An
employee has a record with his name, designation, basic pay,
allowances, deductions etc. as its fields. A record may have a
unique key to identify a record e.g. employee number. Records are
represented as logical & physical records. A logical record
maintains a logical
relationship among all the data items in the record. It is the way
the program or user sees the data. In contrast a physical record is
the way data are recorded on a storage medium.
FILE ORGANIZATION

It is the methodology which is applied to structured computer
files. Files contain computer records which can be documents
or information which is stored in a certain way for later
retrieval.

File organization refers primarily to the logical arrangement
of data in a file system. It should not be confused with the
physical storage of the file in some types of storage media.
There are certain basic types of computer file, which can
include files stored as blocks of data and streams of data,
where the information streams out of the file while it is being
read until the end of the file is encountered.
Methods of organizing files


Different methods of organizing files-

1.Heap
2.Sequential
3.Indexed-sequential
4.Inverted list
5.Direct access
Choosing a file organization is a design decision, hence it
must be done having in mind the achievement of good
performance with respect to the most likely usage of the
file. The criteria usually considered important are:

  1. Fast access to single record or collection of related
records.
  2. Easy record adding/update/removal, without
disrupting .
  3. Storage efficiency.
  4. Redundancy as a warranty against data corruption.
Heap files(unordered)

Basically these files are unordered files. It is the simplest and
most basic type. These files consist of randomly ordered
records. The records will have no particular order.
The operations we can perform on the records are insert,
retrieve and delete. The features of the heap file or the pile file
Organisation are:

1.New records can be inserted in any empty space that can
accommodate them.
2.When old records are deleted, the occupied space becomes
empty and available for any new insertion.
3.If updated records grow; they may need to be relocated
(moved) to a new empty space. This needs to keep a list of
empty space.
Advantages and disadvantages
Advantages
1.This is a simple file Organisation method.
2. Insertion is somehow efficient.
3. Good for bulk-loading data into a table.
4. Best if file scans are common or insertions are
frequent.

Disadvantages

1.Retrieval requires a linear search and is
inefficient.
2. Deletion can result in unused space/need for
reorganisation.
Heap file organization

In the below figure, we can see a sample of heap file
organization for EMPLOYEE relation which consists of 8
records stored in 3 contiguous blocks, each blocks can
contains at most 3 records.
Sequential file organization
            Sequential file organization
•Stored in key sequence.
•Adding/deleting requires making new file.
•Used as master file.
•Records in these files can only be read or written
sequentially.
Sequential file organization


•Records are also in sequence
within each block. To access a
record, previous records within
the block are scanned. Thus
sequential record design is best
suited for “get next” activities,
reading one record after
another without a search delay.

•records can be added only at
the end of the file.
Advantages and disadvantages
ADVANTAGES
Simple file design
Very efficient when most of the records must be processed
e.g. Payroll
Very efficient if the data has a natural order
Can be stored on inexpensive devices like magnetic tape.

DISADVANTAGES

Entire file must be processed even if a single record is to be
searched.
Transactions have to be sorted before processing
Overall processing is slow.
Indexed-sequential organization
Each record of a file has a key field which uniquely
identifies that record.
An index consists of keys and addresses.
An indexed sequential file is a sequential file (i.e. sorted
into order of a key field) which has an index.
A full index to a file is one in which there is an entry for
every record.

When a record is inserted or deleted in a file the data
can be added at any location in the data file. Each index
must also be updated to reflect the change. For a simple
sequential index this may mean rewriting the
index for each insertion.
Indexed-sequential organization
Indexed-sequential organization
File organisation
File organisation
Indexed-sequential organization

Indexed sequential files are important for
applications where data needs to be accessed.....
Sequentially
randomly using the index.

An indexed sequential file can only be stored on a
random access device
e.g. magnetic disc, CD.
ADVANTAGES AND DISADVANTAGES


Advantages
Provides flexibility for users who need both type of
accesses with the same file.
Faster than sequential.


Disadvantages

Extra storage space for the index is required
Inverted list organization

Like the indexed-sequential storage method, the
inverted list organization maintains an index. The
two methods differ, however, in the index level and
record storage. The indexed- sequential method
has a multiple index for a given key, whereas
the inverted list method has a single index for each
key type.
The records are not necessarily stored in a
sequence. They are placed in the are data storage
area, but indexes are updated for the record keys
and location.
ADVANTAGES AND DISADVANTAGES



Advantages
The benefits are apparent immediately because
searching is fast

disadvantages
inverted list files use more media space and the
storage devices get full quickly with this type of
organization.
updating is much slower.
Direct/random file organization

Records are read directly from or written on to the
file.
The records are stored at known address.
Address is calculated by applying a mathematical
function to the key field.
A random file would have to be stored on a direct
  access backing storage medium e.g. magnetic disc,
  CD, DVD

Example : Any information retrieval system. Eg Train
timetable system.
Advantages and disadvantages
Advantages
Any record can be directly accessed.
Speed of record processing is very fast.
Up-to-date file because of online updating.
Concurrent processing is possible.
 Transactions need not be sorted.
Disadvantages
More complex than sequential.
Does not fully use memory locations.
More security and backup problems.
 Expensive hardware and software are required.
 System design is complex and costly.
 File updation is more difficult as compared to sequential
files.
wps.cn/moban



Comparison
Quiz
1.Different types of files are
a)Master
Transaction
Backup

b)Archive
Table
Report

c)Dump
Library

2. Major criteria for selecting a File organization are
1. Method of processing of file
2. Size of data
3. File inquiry capability
4. File volatility
5. Response time
6. Activity ratio
Quiz
3.What is file organization?

4.What are advantages of sequential file
organization?

5. True or false (indexed sequential file)
 The data can be added at any location in the file.

6.Give an example of direct file organization?

7.Give one advantage and disadvantage of direct
file organization?
Thank You

More Related Content

PDF
Complete dbms notes
PPTX
Hashing
PPTX
Normalization in DBMS
PDF
Supply and Demand
PPT
Document Management System(DMS)
PPTX
social, legal and ethical issues of e-commerce..
PPT
Software estimation
PPTX
Efficient Market Hypothesis (EMH)
Complete dbms notes
Hashing
Normalization in DBMS
Supply and Demand
Document Management System(DMS)
social, legal and ethical issues of e-commerce..
Software estimation
Efficient Market Hypothesis (EMH)

What's hot (20)

PPT
File organization 1
PPTX
File system structure
PPTX
File organization
PDF
File organization
PPT
File structures
PPTX
Rdbms
PPT
Chapter 10 - File System Interface
PPTX
FIle Organization.pptx
PPT
File organization
DOC
rdbms-notes
PPTX
Distributed file system
PPTX
File Organization
PPTX
directory structure and file system mounting
PPTX
FUNCTION DEPENDENCY AND TYPES & EXAMPLE
PPT
OPERATING SYSTEMS DESIGN AND IMPLEMENTATION
PPT
File organization and indexing
PPT
Databases: Normalisation
PDF
File organisation
PPT
Database security
 
File organization 1
File system structure
File organization
File organization
File structures
Rdbms
Chapter 10 - File System Interface
FIle Organization.pptx
File organization
rdbms-notes
Distributed file system
File Organization
directory structure and file system mounting
FUNCTION DEPENDENCY AND TYPES & EXAMPLE
OPERATING SYSTEMS DESIGN AND IMPLEMENTATION
File organization and indexing
Databases: Normalisation
File organisation
Database security
 
Ad

Similar to File organisation (20)

PPT
File organisation in system analysis and design
PPT
File organisation
PPTX
File organization and introduction of DBMS
PDF
fileorganizationandintroductionofdbms-210313163900.pdf
PPTX
Wk 1 - File organization.pptx
PPTX
Unit-1-Lecture-9.pptx file structure semester
PPTX
normalization process in relational data base management
PPTX
File Structure.pptx
PPTX
Unit 6 OSY.pptx aaaaaaaaaaaaaaaaaaaaaaaa
PPTX
File Management
PPTX
Chapter 12.pptx
PPTX
Chapter 3
PPTX
file organization ppt on dbms types of f
PPTX
(file systems)12312321321321312312312.pptx
PPTX
Operating System Unit 4(RTU Syllabus).pptx
PPTX
File Concept.pptx fa s fasfasfasfsfsfasfasfas
PPTX
Application portfolio development.advadisadvan.pptx
PPTX
overview of storage and indexing BY-Pratik kadam
PDF
File Systems
DOCX
File organization in database
File organisation in system analysis and design
File organisation
File organization and introduction of DBMS
fileorganizationandintroductionofdbms-210313163900.pdf
Wk 1 - File organization.pptx
Unit-1-Lecture-9.pptx file structure semester
normalization process in relational data base management
File Structure.pptx
Unit 6 OSY.pptx aaaaaaaaaaaaaaaaaaaaaaaa
File Management
Chapter 12.pptx
Chapter 3
file organization ppt on dbms types of f
(file systems)12312321321321312312312.pptx
Operating System Unit 4(RTU Syllabus).pptx
File Concept.pptx fa s fasfasfasfsfsfasfasfas
Application portfolio development.advadisadvan.pptx
overview of storage and indexing BY-Pratik kadam
File Systems
File organization in database
Ad

More from Mukund Trivedi (20)

PPTX
System development life cycle (sdlc)
PPTX
Process of design
PPT
New file and form 2
PPTX
Evaluation
PPTX
Database
PPTX
Case tools
PPTX
Evaluation
PPTX
Dfd final
DOCX
C++ file
PPT
Ff40fnatural resources (1)
PPT
Ff40fnatural resources
PPT
F58fbnatural resources 2 (1)
PPT
F58fbnatural resources 2
PPT
F6dc1 session6 c++
DOC
Ee2fbunit 7
PPT
E212d9a797dbms chapter3 b.sc2 (2)
PPT
E212d9a797dbms chapter3 b.sc2 (1)
PPT
E212d9a797dbms chapter3 b.sc2
PPT
C96e1 session3 c++
System development life cycle (sdlc)
Process of design
New file and form 2
Evaluation
Database
Case tools
Evaluation
Dfd final
C++ file
Ff40fnatural resources (1)
Ff40fnatural resources
F58fbnatural resources 2 (1)
F58fbnatural resources 2
F6dc1 session6 c++
Ee2fbunit 7
E212d9a797dbms chapter3 b.sc2 (2)
E212d9a797dbms chapter3 b.sc2 (1)
E212d9a797dbms chapter3 b.sc2
C96e1 session3 c++

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Approach and Philosophy of On baking technology
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Tartificialntelligence_presentation.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
August Patch Tuesday
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Machine learning based COVID-19 study performance prediction
PDF
Mushroom cultivation and it's methods.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
A Presentation on Artificial Intelligence
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Getting Started with Data Integration: FME Form 101
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
Spectral efficient network and resource selection model in 5G networks
Approach and Philosophy of On baking technology
Per capita expenditure prediction using model stacking based on satellite ima...
gpt5_lecture_notes_comprehensive_20250812015547.pdf
cloud_computing_Infrastucture_as_cloud_p
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Tartificialntelligence_presentation.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
August Patch Tuesday
Unlocking AI with Model Context Protocol (MCP)
NewMind AI Weekly Chronicles - August'25-Week II
Machine learning based COVID-19 study performance prediction
Mushroom cultivation and it's methods.pdf
Encapsulation theory and applications.pdf
Assigned Numbers - 2025 - Bluetooth® Document
A Presentation on Artificial Intelligence
Univ-Connecticut-ChatGPT-Presentaion.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Getting Started with Data Integration: FME Form 101
Mobile App Security Testing_ A Comprehensive Guide.pdf

File organisation

  • 1. LOGO FILE STRUCTURE AND FILE ORGANIZATION BY- KHUSHBU RISHIKA ARORA
  • 2. wps.cn/moban Topics covered What are files? File related keywords. File organization. 1. Introduction File organizing methods. 2. Strategy *heap file organization. *sequential files organization. 3. Challenges Forward *indexed file organization. *inverted4. Conclusion file organization. *direct file organization. Comparison Quiz
  • 3. wps.cn/moban FILES A file is a collection of data that is treated as a single unit on a peripheral device. TYPES OF FILES- MASTER FILE  It contains record of permanent data types.  They are created when you install your business. Work files a program can work efficiently if a work file is used.-a program can work efficiently if a work file is used.
  • 4. wps.cn/moban FILES TRANSACTION FILE Contains data which is used to update the records of master file. Ex-address of a customer. Transaction files also serves as audit trails and history of the organization.
  • 5. BASIC FILE RELATED KEYWORKS •Byte:- It is the smallest addressable unit in computer. A byte is a set of 8 bits and represents a character. •Element:- It is a combination of one or more bytes. It is referred to as a field. A field is actually a physical space on tape or disk. A roll number, age, name of employee etc. are examples of it. File: - It is a collection of similar records. The records will have the same fields but different values in each record. The size of a file is limited by the size of memory available.
  • 6. BASIC FILE RELATED KEYWORDS •Database: - It is a set of interrelated files. The files in combination tend to link to a common solution. For example, a student attendance file, a student result file, a student admission file, etc. are related to academic software pertaining to students. •Record: - The elements related to are combined into a record. An employee has a record with his name, designation, basic pay, allowances, deductions etc. as its fields. A record may have a unique key to identify a record e.g. employee number. Records are represented as logical & physical records. A logical record maintains a logical relationship among all the data items in the record. It is the way the program or user sees the data. In contrast a physical record is the way data are recorded on a storage medium.
  • 7. FILE ORGANIZATION It is the methodology which is applied to structured computer files. Files contain computer records which can be documents or information which is stored in a certain way for later retrieval. File organization refers primarily to the logical arrangement of data in a file system. It should not be confused with the physical storage of the file in some types of storage media. There are certain basic types of computer file, which can include files stored as blocks of data and streams of data, where the information streams out of the file while it is being read until the end of the file is encountered.
  • 8. Methods of organizing files Different methods of organizing files- 1.Heap 2.Sequential 3.Indexed-sequential 4.Inverted list 5.Direct access
  • 9. Choosing a file organization is a design decision, hence it must be done having in mind the achievement of good performance with respect to the most likely usage of the file. The criteria usually considered important are: 1. Fast access to single record or collection of related records. 2. Easy record adding/update/removal, without disrupting . 3. Storage efficiency. 4. Redundancy as a warranty against data corruption.
  • 10. Heap files(unordered) Basically these files are unordered files. It is the simplest and most basic type. These files consist of randomly ordered records. The records will have no particular order. The operations we can perform on the records are insert, retrieve and delete. The features of the heap file or the pile file Organisation are: 1.New records can be inserted in any empty space that can accommodate them. 2.When old records are deleted, the occupied space becomes empty and available for any new insertion. 3.If updated records grow; they may need to be relocated (moved) to a new empty space. This needs to keep a list of empty space.
  • 11. Advantages and disadvantages Advantages 1.This is a simple file Organisation method. 2. Insertion is somehow efficient. 3. Good for bulk-loading data into a table. 4. Best if file scans are common or insertions are frequent. Disadvantages 1.Retrieval requires a linear search and is inefficient. 2. Deletion can result in unused space/need for reorganisation.
  • 12. Heap file organization In the below figure, we can see a sample of heap file organization for EMPLOYEE relation which consists of 8 records stored in 3 contiguous blocks, each blocks can contains at most 3 records.
  • 13. Sequential file organization Sequential file organization •Stored in key sequence. •Adding/deleting requires making new file. •Used as master file. •Records in these files can only be read or written sequentially.
  • 14. Sequential file organization •Records are also in sequence within each block. To access a record, previous records within the block are scanned. Thus sequential record design is best suited for “get next” activities, reading one record after another without a search delay. •records can be added only at the end of the file.
  • 15. Advantages and disadvantages ADVANTAGES Simple file design Very efficient when most of the records must be processed e.g. Payroll Very efficient if the data has a natural order Can be stored on inexpensive devices like magnetic tape. DISADVANTAGES Entire file must be processed even if a single record is to be searched. Transactions have to be sorted before processing Overall processing is slow.
  • 16. Indexed-sequential organization Each record of a file has a key field which uniquely identifies that record. An index consists of keys and addresses. An indexed sequential file is a sequential file (i.e. sorted into order of a key field) which has an index. A full index to a file is one in which there is an entry for every record. When a record is inserted or deleted in a file the data can be added at any location in the data file. Each index must also be updated to reflect the change. For a simple sequential index this may mean rewriting the index for each insertion.
  • 21. Indexed-sequential organization Indexed sequential files are important for applications where data needs to be accessed..... Sequentially randomly using the index. An indexed sequential file can only be stored on a random access device e.g. magnetic disc, CD.
  • 22. ADVANTAGES AND DISADVANTAGES Advantages Provides flexibility for users who need both type of accesses with the same file. Faster than sequential. Disadvantages Extra storage space for the index is required
  • 23. Inverted list organization Like the indexed-sequential storage method, the inverted list organization maintains an index. The two methods differ, however, in the index level and record storage. The indexed- sequential method has a multiple index for a given key, whereas the inverted list method has a single index for each key type. The records are not necessarily stored in a sequence. They are placed in the are data storage area, but indexes are updated for the record keys and location.
  • 24. ADVANTAGES AND DISADVANTAGES Advantages The benefits are apparent immediately because searching is fast disadvantages inverted list files use more media space and the storage devices get full quickly with this type of organization. updating is much slower.
  • 25. Direct/random file organization Records are read directly from or written on to the file. The records are stored at known address. Address is calculated by applying a mathematical function to the key field. A random file would have to be stored on a direct access backing storage medium e.g. magnetic disc, CD, DVD Example : Any information retrieval system. Eg Train timetable system.
  • 26. Advantages and disadvantages Advantages Any record can be directly accessed. Speed of record processing is very fast. Up-to-date file because of online updating. Concurrent processing is possible.  Transactions need not be sorted. Disadvantages More complex than sequential. Does not fully use memory locations. More security and backup problems.  Expensive hardware and software are required.  System design is complex and costly.  File updation is more difficult as compared to sequential files.
  • 28. Quiz 1.Different types of files are a)Master Transaction Backup b)Archive Table Report c)Dump Library 2. Major criteria for selecting a File organization are 1. Method of processing of file 2. Size of data 3. File inquiry capability 4. File volatility 5. Response time 6. Activity ratio
  • 29. Quiz 3.What is file organization? 4.What are advantages of sequential file organization? 5. True or false (indexed sequential file) The data can be added at any location in the file. 6.Give an example of direct file organization? 7.Give one advantage and disadvantage of direct file organization?