"All Good to Great companies began the process of finding a path to greatness by confronting the brutal facts about the reality of their business. When you start with an honest and diligent effort to determine the truth of your situation, the right decisions often become self-evident.”— Jim Collins
In our last blog, we discussed the first step in Casne’s 7 Step Solution: The Alarm Philosophy. Consider the Alarm Philosophy document as the destination point on a roadmap. The benchmarking activity is the time to plan the journey. First, we need to know the starting point. While some discussions during the Alarm Philosophy point out the areas of need, benchmarking dials in on the specific pains of the current way of operations. Benchmarking takes a closer look at the current system, identifies the alarm performance metrics needed to achieve the objectives set forth in the Alarm Philosophy Document. When we discuss benchmarking, we will want to know the brutal reality of your business operations.
First, we will consider the current operator handling capacity and the alarm flux. Operator Handling Capacity is the total number of events your operations team can handle per unit of time, usually hour. It’s important to know how many alarm events your team can successfully handle. If you are having more alarms than your team can handle, then we will make a note to address this in our plan. An Alarm Flux is the average number of alarms issued per unit of time. While operator handling capacity looks at how many alarms your team can handle, alarm flux benchmarks the average number of alarms coming in during a set amount of time. Ultimately, we want the two benchmarks to be similar.
Next, we will look at the number of alarm floods and resulting actions missed during the floods. An alarm flood is a specific event that causes the system to be flooded with an unusual amount of alarms. An alarm flood is often due to a cascade effect from a single point of failure. When these occur, operators become detectives trying to find out what is the ultimate event that began the flood. When alarm flooding happens, operators begin responding to alarms with little to no severity information. As a result, not all alarms will be responded to in a timely manner. Those alarms that were not responded to are called actions missed.