Describe Characteristics of high-performing database
Analyze Logical Database Design using Real-life example
To illustrate the implications that logical design has for performance, let's consider a third normal form database design for placing orders for products. Management wants to know the counts of "State flower" for all orders that have been placed for "sunflower seeds." The output would look something like this:
How "Logical Database Design" impacts Performance
The logical design of a database can have a significant impact on its performance, as it determines how data is organized and accessed within the system. Here are some ways in which the logical design of a database can impact performance:
Data normalization: Normalization is the process of organizing data in a way that minimizes redundancy and dependency. While normalization can improve data integrity and consistency, it can also result in more complex queries and slower performance, particularly when working with large datasets.
Indexing: Indexing is the process of creating data structures that allow for faster retrieval of data. Properly indexing tables and columns can significantly improve query performance, while improperly indexed tables or columns can result in slow performance and increased storage requirements.
Joins: Joins are used to combine data from multiple tables in a query. However, complex joins involving many tables can be resource-intensive and slow down query performance, particularly when working with large datasets.
Data types: The choice of data types for fields can impact performance, particularly when working with large datasets. For example, using a larger data type than necessary can result in slower query performance and increased storage requirements.
Query complexity: Complex queries involving multiple joins, subqueries, or aggregation functions can be resource-intensive and slow down query performance. Optimizing queries and breaking them down into smaller, simpler queries can improve performance.
Table Structure: The structure of tables, such as the number of columns, the size of the rows, and the use of NULL values, can impact performance. Keeping table structures lean and optimized can improve query performance and reduce storage requirements.
The following series of images below will allow you to compare how different database designs will result in different SQL statements.
1) First, here is a very well done, non-redundant database design that will make efficient use of Oracle storage.
CUSTOMER Table
cust_nbr
cust_name
cust_street_address
cust_city
cust_zip_code
CITY Table
cust_city
cost_of_living
city_mascot
state_name
STATE Table
state_name
state_bird
state_flower
region_name
ORDER Table
order_nbr
order_date
cust_nbr
salesperson_name
SALESPERSON Table
salesperson_name
salesperson_address
job_title
city_assigned
ITEM Table
item_nbr
item_name
inventory_amount
list_price
QUANTITY Table
order_nbr
item_nbr
quantity_sold
Data Warehouse Table Design
"Extreme redundant database design" is commonly used with data warehouse tables.
In data warehouses, "wide" tables are created that contain a huge amount of redundant information. Because of the huge amount of data redundancy, these wide tables are very cumbersome to update, but they can be queried with remarkable speed. In our example, all of the other tables could be combined into the QUANTITY table, creating a single, very wide table that contains all of the order, customer, salesperson, and item information. From an Oracle perspective, a single-row fetch would access all of the line item information for each order, and complex queries would no longer require table joins.
Physical Design Next
Now that we have reviewed logical design, let's look at how "physical design" will impact performance.
The logical design of a database plays a critical role in determining its performance. By carefully considering
1) normalization, 2) indexing, 3) joins, 4) data types, 5) query complexity and 6) table structure, designers can create a database that is optimized for performance and able to handle large datasets with efficiency and speed.