Development of gradient boosting-assisted machine learning data-driven model for free chlorine residual prediction

  • Wiley Helm
  • , Shifa Zhong
  • , Elliot Reid
  • , Thomas Igou
  • , Yongsheng Chen*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Chlorine-based disinfection is ubiquitous in conventional drinking water treatment (DWT) and serves to mitigate threats of acute microbial disease caused by pathogens that may be present in source water. An important index of disinfection efficiency is the free chlorine residual (FCR), a regulated disinfection parameter in the US that indirectly measures disinfectant power for prevention of microbial recontamination during DWT and distribution. This work demonstrates how machine learning (ML) can be implemented to improve FCR forecasting when supplied with water quality data from a real, full-scale chlorine disinfection system in Georgia, USA. More precisely, a gradient-boosting ML method (CatBoost) was developed from a full year of DWT plant-generated chlorine disinfection data, including water quality parameters (e.g., temperature, turbidity, pH) and operational process data (e.g., flowrates), to predict FCR. Four gradient-boosting models were implemented, with the highest performance achieving a coefficient of determination, R 2, of 0.937. Values that provide explanations using Shapley’s additive method were used to interpret the model’s results, uncovering that standard DWT operating parameters, although non-intuitive and theoretically non-causal, vastly improved prediction performance. These results provide a base case for data-driven DWT disinfection supervision and suggest process monitoring methods to provide better information to plant operators for implementation of safe chlorine dosing to maintain optimum FCR. [Figure not available: see fulltext.]

Original languageEnglish
Article number17
JournalFrontiers of Environmental Science and Engineering
Volume18
Issue number2
DOIs
StatePublished - Feb 2024

Keywords

  • Chlorination
  • Data-driven modeling
  • Disinfection
  • Drinking water treatment
  • Machine learning

Fingerprint

Dive into the research topics of 'Development of gradient boosting-assisted machine learning data-driven model for free chlorine residual prediction'. Together they form a unique fingerprint.

Cite this