| Threat Ensemble Vulnerability Assessment: CANARY, Event Detection Software |
Anomalous water quality results can indicate the onset of a contamination incident. For this reason, water utilities must be able to rapidly and accurately identify such results in water distribution systems. Unfortunately, this is a difficult task because water quality varies significantly from location to location and over time. Software that can interpret water quality data in real time can greatly enhance detection. Hence, the EPA has partnered with Sandia National Laboratories to develop CANARY data analysis software to assist water utilities with detecting contamination.
CANARY software has been piloted using data from two drinking water systems. In 2007, it was deployed at a Water Security Initiative pilot utility, where it is operating in real time. Based on data analysis performed by CANARY, the pilot utility has incorporated new guidelines into its emergency response and consequence management plans.
 The figure shows water quality data in a distribution system over a period of 12 hours. Near hour 8, a contamination incident has occurred, and the changes in the water quality signals are evident. |
CANARY Software Information and Availability
CANARY software evaluates standard water quality data (e.g., free chlorine, pH, total organic carbon) over time, and uses mathematical and statistical techniques to identify the onset of anomalous water quality incidents. Before using CANARY for the first time, historical utility data must be used to determine the natural variation of these water quality parameters. This allows the water utility to adapt CANARY to work accurately at multiple locations within the water distribution system and helps utility operators to understand the expected false alarm rates associated with CANARY and contamination incident detection.
The CANARY software allows for:
- The use of a standard data format for input and output of water quality and operations data
- The ability to select different detection algorithms (CANARY contains three different mathematical approaches for analyzing the data)
- The ability to select various water utility and location-specific configuration options
- An online operations mode and an offline evaluation mode
- The ability to generate data needed to establish performance metrics (e.g., false alarm rates).
CANARY assists water utilities with understanding the significance of large volumes of water quality data. It can automatically review incoming data regarding anomalous conditions and alert the water utility if further action is required. Not only can CANARY detect anomalous conditions resulting from contamination incidents, but it can detect unexpected "normal" events, such as a sensor malfunction or a pipe break.
CANARY is a Windows program that runs on nearly any computer. It is written in the MATLAB® m-code language, but also employs some Java programming language. The typical user will not need to know MATLAB®; however, modifications to the source code require knowledge of MATLAB® and/or Java.
The CANARY software is expected to be available to a wider audience in 2008. It is not intended to replace commercially available software, but rather to supplement existing software and motivate commercial development of similar products. It is hoped that the release of CANARY will also motivate additional research and development in this field.
Future CANARY Updates
Several research directions are currently being pursued to improve CANARY.
- An automated configuring approach within a water distribution system is being developed. This approach will use a set of "normal" historical data from a water distribution system and automatically find a set of "good" parameters to be used in the configuration process. Because event detection software has an inherent tradeoff between detection sensitivity and the false alarm rate, each user must eventually decide what the best parameters will be for a specific use of CANARY.
- Researchers are investigating different ways to interface with water utilities' Supervisory Control and Data Acquisition (SCADA) systems to improve the performance of CANARY. SCADA refers to a large-scale, distributed measurement and control system that controls components such as pumps and valves, and records data from sensors.
- Researchers are pursuing methods to use information collected simultaneously from multiple locations (rather than each location separately) to increase the sensitivity of detecting contamination incidents. By incorporating information from several sensor stations, along with information about hydraulic connectivity, CANARY may be able to more accurately and rapidly detect contamination incidents.
For more information, visit the National Home Security Research Center website at http://www.epa.gov/nhsrc and the TEVA Research Program site at http://www.epa.gov/nhsrc/water/dw/teva.html.
|
Return to the
Spring 2008
Midwest Region Newsletter
Table of Contents |
|
|