Hot, Warm and Cold Data Find a Home With Storage Groups
Your data has a temperature. In fact, it might have several. Knowing its temperature will help you know how to manage it. Multitemperature data management refers to the frequency of accessing data in storage. The classifications are often referred to as hot, warm and cold (see Table 1). Hot data is frequently accessed on faster storage, warm data is accessed less frequently and stored on slightly slower storage, and cold data is rarely accessed and stored on even slower storage.
Each type of storage has an associated cost, which is dependent on several factors, such as the type of storage or the size of the storage devices. Costs are also influenced by environmental factors, such as rack space, floor space, the amount of power required, the number of power supplies, and redundancy and recovery capabilities. Costs can further be impacted by additional features incorporated in your storage, such as the amount of memory cache and the use of certain algorithms to assist with performance, error checking or error correction.
Business requirements can help you decide what type of storage to use for different types of data where the temperature of the data is part of the decision-making process. One such possibility is using storage groups.
What Are Storage Groups?
A storage group is a set of storage paths that manage storage allocation for table spaces. Different storage technologies can be defined in different storage groups, thus allowing table spaces to be created using the most effective storage type based on business requirements, such as service-level objectives, recovery requirements (e.g., RAID definition, table recovery requirements, etc.) and cost. Some examples of storage group configurations that can be defined are:
- 144 1 TB drives, 7.5k rpm, RAID 1
- 128 500 GB drives, 10k rpm, RAID 5
- 5 500 GB of solid state drives, RAID 6
- 256 500 GB drives, 10k rpm, RAID 3 with disk replication
Storage Groups and Multitemperature Storage
Storage groups are an excellent fit for the implementation and management of a multitemperature storage allocation scheme. Figure 1 shows data that’s range partitioned in table spaces defined using the storage groups “hot,” “warm” and “cold”—each defined with a different type of storage.
You can have as few or as many storage groups with the same or different characteristics as you wish. Based on requirements, different types of storage—and therefore storage groups—will be best suited for different applications, workloads and business. For example:
- Privacy laws for storing medical data may require all medical data be stored in a storage group that supports disk encryption
- In an attempt to use the least expensive storage with acceptable write performance during Materialized Query Tables (MQTs) maintenance, all MQTs will be placed in table spaces that reside in that storage group, working on the assumption that read performance will be sufficient due to the intrinsic benefits of using MQTs because they’re usually aggregates of base table data and thus contain significantly less rows.
- The deep analytics team has a no-change data capture process and performs a full refresh of its tables nightly. As such, it has no business requirement for anything but RAID 1 as the recovery plan is simply to reload the data. However, it does require performance. One option being discussed for sustained performance is to use numerous, inexpensive 500 GB disks, therefore providing a sufficient number of spindles for disk performance.
comments powered by