Separating configuration from data lowers total cost of ownership

All software systems require configuration. Some require much more configuration that others. Configuration can be an overloaded term, but I’m speaking about the information necessary for the software to function correctly.

Without this information, the software is unstable and will not work. For most business systems, a common configuration item is the database connection string. Without this configuration item, the system cannot function. Other common configurations are the list of valid states or provinces for addresses. Often the local state is defaulted to the top as a shortcut for the users. Another example is a list of statuses, such as Pending, Active, Complete. As usual, requirements should drive the approach taken.

Let’s talk about the Status list first. Where should this configuration be managed? The simplest approach is to maintain the status list in the software itself. The tests around the software build will verify this configuration, and if the software has logic built around these values, the compiler verifies the important values are present. If the Status list changes less than once per week, this is a fine approach since a system under enhancement might have weekly deployments. (Note: Using agile development, systems are easy to deploy, and our current projects get deployed many times per day to multiple environments. Without an automated deployment, deployments are much more costly).

Let’s consider a list of states. For a sales system, we might have 5 states if process orders for these 5 regions. Our state list would have those 5 states with the local state defaulted to the top since it is the most common. Again, we start with storing this configuration with the codebase, especially since the system is unstable without the local state. If the list of states changes more frequently than weekly, we might consider wanting to change the list without requiring a deployment. This would lead us to store the configuration separate from the codebase. Options include putting the configuration in a file or the database. The benefit of the configuration residing in a file allows the configuration to change while the system is running. The system will have to be able to detect the change and reload the configuration, however.

Let’s take the requirements a bit further. Let’s say we want a screen in the system to let authorized users add a state to the state list at any time. They can also remove a state. We have to analyze if there is a gold list (i.e. a list of states that cannot be removed because there are business rules around those specific values). We might be tempted to put the state list in the database, but mixing this configuration with data is risky. First, we need a mechanism to validate the configuration. We need to have the application check the database table and ensure the “required” values are there. Values added in one installation might somehow need to be synced up if a new value is considered to be part of the “gold” list. Putting the list in data confuses data and configuration, and we have shyed away from it since the system is easier to implement and maintain if a new installation starts out with an empty database and functions properly. Then implementation tasks can make the system usable for the target environment.

Ultimately, the requirements drive the approach to configuration. Depending on the frequency of change, the number of unremoveable items, and other factors, you will decide where to store the configuration in a manner that minimizes total cost of ownership.

My recommended progressions of configuration locations are as follows. Specific requirements should be the driving factor that cause you to move configuration from one medium to the next costly alternative.

In the code
In a file
In a database
Some other system-specific external configuration store

There is no cut-and-dried answer for EVERY project, but the requirements should be what drives the decision. Along those lines, no one location is appropriate for EVERY piece of configuration. Depending on the nature of the configuration, it will be appropriate to choose a different storage medium.

Programming with Palermo

Jeffrey Palermo, Microsoft MVP, Author, Speaker, Clear Measure Chief Architect, Azure DevOps Expert

Separating configuration from data lowers total cost of ownership

Related

Related

Discover more from Programming with Palermo