Performance Tuning   «Prev  Next»
Lesson 5Implementing logical database performance design
ObjectiveDescribe Characteristics of high-performing database

Analyze Logical Database Design using Real-life example

To illustrate the implications that logical design has for performance, let's consider a third normal form database design for placing orders for products. Management wants to know the counts of "State flower" for all orders that have been placed for "sunflower seeds." The output would look something like this:

How "Logical Database Design" impacts Performance

The logical design of a database can have a significant impact on its performance, as it determines how data is organized and accessed within the system. Here are some ways in which the logical design of a database can impact performance:
  1. Data normalization: Normalization is the process of organizing data in a way that minimizes redundancy and dependency. While normalization can improve data integrity and consistency, it can also result in more complex queries and slower performance, particularly when working with large datasets.
  2. Indexing: Indexing is the process of creating data structures that allow for faster retrieval of data. Properly indexing tables and columns can significantly improve query performance, while improperly indexed tables or columns can result in slow performance and increased storage requirements.
  3. Joins: Joins are used to combine data from multiple tables in a query. However, complex joins involving many tables can be resource-intensive and slow down query performance, particularly when working with large datasets.
  4. Data types: The choice of data types for fields can impact performance, particularly when working with large datasets. For example, using a larger data type than necessary can result in slower query performance and increased storage requirements.
  5. Query complexity: Complex queries involving multiple joins, subqueries, or aggregation functions can be resource-intensive and slow down query performance. Optimizing queries and breaking them down into smaller, simpler queries can improve performance.
  6. Table Structure: The structure of tables, such as the number of columns, the size of the rows, and the use of NULL values, can impact performance. Keeping table structures lean and optimized can improve query performance and reduce storage requirements.

State_flower   count(*)
------------   --------
Begonia        433,233
Pansy          121,455
Tulip           44,233
Zenia            3,245

The following series of images below will allow you to compare how different database designs will result in different SQL statements.

First, here is a very well done, non-redundant database design that will make efficient use of Oracle storage.
1) First, here is a very well done, non-redundant database design that will make efficient use of Oracle storage.
CUSTOMER Table
  • cust_nbr
  • cust_name
  • cust_street_address
  • cust_city
  • cust_zip_code
CITY Table
  • cust_city
  • cost_of_living
  • city_mascot
  • state_name
STATE Table
  • state_name
  • state_bird
  • state_flower
  • region_name
ORDER Table
  • order_nbr
  • order_date
  • cust_nbr
  • salesperson_name
SALESPERSON Table
  • salesperson_name
  • salesperson_address
  • job_title
  • city_assigned
ITEM Table
  • item_nbr
  • item_name
  • inventory_amount
  • list_price
QUANTITY Table
  • order_nbr
  • item_nbr
  • quantity_sold
 


SQL Code:
select state_flower, count(*)
from
    state,
    city,
    customer,
    order,
    quantity,
    item
where
    item_name = 'Flower Seed'
    and item.item_nbr = quantity.item_nbr
    and quantity.order_nbr = order.order_nbr
    and order.cust_nbr = customer.cust_nbr
    and customer.cust_city = city.cust_city
    and city.state_name = state.state_name
group by state_flower;
"Here is the SQL statement that would be used to display the data. Notice how this design will cause excessive table joins. This simple query requires Oracle to join five tables together. This query is not only hard to write, but it will execute very slowly because of all of the tables that must be joined together."

Here we see that the customer table now has data that has once resided in the CITY and STATE and tables. Now let’s take a look at the identical SQL for displaying the counts for state flowers.
3) Here we see that the customer table now has data that has once resided in the CITY and STATE and tables. Now let’s take a look at the identical SQL for displaying the counts for state flowers.

Here we have eliminated two table joins and greatly simplified both the complexity of the query as well as the number of SQL join operations.
4) Here we have eliminated two table joins and greatly simplified both the complexity of the query as well as the number of SQL join operations.


Data Warehouse Table Design

"Extreme redundant database design" is commonly used with data warehouse tables. In data warehouses, "wide" tables are created that contain a huge amount of redundant information. Because of the huge amount of data redundancy, these wide tables are very cumbersome to update, but they can be queried with remarkable speed. In our example, all of the other tables could be combined into the QUANTITY table, creating a single, very wide table that contains all of the order, customer, salesperson, and item information. From an Oracle perspective, a single-row fetch would access all of the line item information for each order, and complex queries would no longer require table joins.

Physical Design Next

Now that we have reviewed logical design, let's look at how "physical design" will impact performance. The logical design of a database plays a critical role in determining its performance. By carefully considering 1) normalization, 2) indexing, 3) joins, 4) data types, 5) query complexity and 6) table structure, designers can create a database that is optimized for performance and able to handle large datasets with efficiency and speed.

Denormalizing Database - Exercise

Click the exercise link below to try your hand at redesigning a database to optimize performance.
Denormalizing Database - Exercise

SEMrush Software Target 5SEMrush Software Banner 5