PERANCANGAN SISTEM INFORMASI
Session 8 Data Storage Design
Based on System Analysis & Design 2
nd
Edition
Authors : Alan Dennis & Barbara Haley Wixom
Publisher : John Wiley & Sons
Session 8 Data Storage Design
Session 8 Data Storage Design
Based on System Analysis & Design 2
nd
Edition
Authors : Alan Dennis & Barbara Haley Wixom
2
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Objectives
Objectives
Become familiar with several file and database
formats
Understand several goals of data storage
Be able to optimize a relational database for
data storage and data access
Become familiar with indexes
Key Definitions
Key Definitions
The
data storage function
manages how data is
stored and handled by programs that run the
system
Goals of data storage design
4
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Types of Data Storage Formats
Types of Data Storage Formats
Files:
electronic lists of data optimized to
perform a particular transaction
Database:
a collection of groupings of
information the relate to each other in some
way.
A
Database Management System (DBMS)
is
6
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Appointment File
Appointment Database
8
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
File Attributes
File Attributes
Files contain information formatted for a
particular transaction
Typically organized sequentially
Pointers used to associate records with other
records
File Types
File Types
Master files
store core, important information
Look-up files
store static values
Transaction files
store information that updates a master file
Audit files
record before and after versions of data
History (archive) files
1 0
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Database Types
Database Types
Legacy database
Hierarchical (depict parent-child relationships using
inverted trees)
Network (depict nonhierarchical associations using
pointers)
Relational database
Object database
Hierarchical Database Example
1 2
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Network Database Example
Relational Database Concepts
Relational Database Concepts
Popular; easy for developers to use
Primary
and
foreign keys
used to identify and
link tables
Referential integrity
ensures correct and valid
table synchronization
Structured Query Language (
SQL
)- standard
1 4
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Relational Database Example
Object Database Concepts
Object Database Concepts
Built around
objects
consisting of both data
and processes
Objects are
encapsulated
(self-contained)
Object
classes
– major object categories
OODBMS
– used primarily for applications with
multimedia or complex data
Hybrid OODBMS
– both object and relational
1 6
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Object Database Example
Multidimensional Database Concepts
Multidimensional Database Concepts
Stores data for easy aggregation and
manipulation across many dimensions
Used for data warehouses and data marts
1 8
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Multidimensional Database Example
2 0
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Dimensions of Data Storage Optimization
Dimensions of Data Storage Optimization
Conflicting goals:
Storage efficiency
(minimizing storage space)
Speed of access
(minimizing time to retrieve
Storage Efficiency
Storage Efficiency
Minimize null values and redundancy
Reduce update anomalies
2 2
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Optimizing Data Storage Efficiency
Normalization Steps
2 4
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Optimizing Access Speed
Optimizing Access Speed
Techniques available to increase access speed
after optimizing for efficiency
Denormalization
Clustering
•
Intrafile
•
Interfile
Denormalization
Denormalization
Add redundancy back to data storage design to
reduce the number of joins performed in a
query
Ideal for frequently queried but rarely updated
data
Look-up tables
1:1 relationships
2 6
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Clustering
Clustering
Reduce the number of times storage must be
accessed by physically placing like records
close together.
Intrafile clustering
– similar records in a table are
stored together
Indexing
Indexing
A minitable that contains values from one or
more fields in a table and the location of the
values within the table
2 8
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Payment Type Index
Guidelines for Creating Indexes
Guidelines for Creating Indexes
Use indexes sparingly for transaction systems
Use many indexes to increase response times in
decision support systems
For each table
Create a unique index based on the primary key
Create an index based on the foreign key
3 0
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia
Volumetrics – Estimating Data Storage Size
Volumetrics – Estimating Data Storage Size
Raw data – sum of the average widths of all
fields in a table.
Calculate overhead requirements based on
DBMS vendor recommendations
Data Storage Size Estimator
3 2
Pow erPoint Course Mat er ial for SCELE Graduat e Program I nform at ion Technology
Fa cu lt y of Com put e r Scie n ce – Unive r sit y of I n don e sia