Manchester Metropolitan University's Research Repository

    Automated Model Construction for Combined Sewer Overflow Prediction Based on Efficient LASSO Algorithm

    Zhao, Wanqing ORCID logoORCID: https://orcid.org/0000-0001-6160-9547, Beach, Thomas H and Rezgui, Yacine (2019) Automated Model Construction for Combined Sewer Overflow Prediction Based on Efficient LASSO Algorithm. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 49 (6). pp. 1254-1269. ISSN 2168-2216

    Published Version
    Available under License Creative Commons Attribution.

    Download (1MB) | Preview


    The prediction of combined sewer overflow (CSO) operation in urban environments presents a challenging task for water utilities. The operation of CSOs (most often in heavy rainfall conditions) prevents houses and businesses from flooding. However, sometimes, CSOs do not operate as they should, potentially bringing environmental pollution risks. Therefore, CSOs should be appropriately managed by water utilities, highlighting the need for adapted decision support systems. This paper proposes an automated CSO predictive model construction methodology using field monitoring data, as a substitute for the commonly established hydrological-hydraulic modeling approach for time-series prediction of CSO statuses. It is a systematic methodology factoring in all monitored field variables to construct time-series dependencies for CSO statuses. The model construction process is largely automated with little human intervention, and the pertinent variables together with their associated time lags for every CSO are holistically and automatically generated. A fast least absolute shrinkage and selection operator solution generating scheme is proposed to expedite the model construction process, where matrix inversions are effectively eliminated. The whole algorithm works in a stepwise manner, invoking either an incremental or decremental movement for including or excluding one model regressor into, or from, the predictive model at every step. The computational complexity is thereby analyzed with the pseudo code provided. Actual experimental results from both single-step ahead (i.e., 15 min) and multistep ahead predictions are finally produced and analyzed on a U.K. pilot area with various types of monitoring data made available, demonstrating the efficiency and effectiveness of the proposed approach.

    Impact and Reach


    Activity Overview
    6 month trend
    6 month trend

    Additional statistics for this dataset are available via IRStats2.


    Actions (login required)

    View Item View Item