Next Article in Journal
Climate Change Impact Assessment on Freshwater Inflow into the Small Aral Sea
Previous Article in Journal
Assessment of Changes in Annual Maximum Precipitations in the Iberian Peninsula under Climate Change
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Principal Component Weighted Index for Wastewater Quality Monitoring

Institute of Environmental Technology, VŠB-Technical University of Ostrava, 17. listopadu 2172/15, 70800 Ostrava-Poruba, Czech Republic
Water 2019, 11(11), 2376; https://doi.org/10.3390/w11112376
Submission received: 28 October 2019 / Revised: 7 November 2019 / Accepted: 8 November 2019 / Published: 13 November 2019
(This article belongs to the Section Wastewater Treatment and Reuse)

Abstract

:
The quality of raw and treated wastewater was evaluated using the principal component weighted index (PCWI) which was defined as a sum of principal component scores weighted according to their eigenvalues. For this purpose, five principal components (PCs) explaining 88% and 83% of the total variability of raw and treated wastewater samples, respectively, were extracted from 11 original physico-chemical parameters by robust principal component analysis (PCA). The PCWIs of raw and treated wastewater were analyzed in terms of their statistical distributions, temporal changes, mutual correlations, correlations with original parameters, and common water quality indexes (WQI). The PCWI allowed us to monitor temporal wastewater quality by one parameter instead of several. Unlike other weighted indexes, the PCWI is composed of independent variables with minimal information noise and objectively determined weights.

Graphical Abstract

1. Introduction

George E.P. Box has already expressed his skepticism of describing reality by mathematical models by his statement “All models are wrong, some are useful” [1]. In spite of this, various simple and easy-to-use composite indexes have been utilized to describe complex technical and scientific problems. For instance, there are social composite indexes [2,3], economic and well-being indexes [4,5], medical indexes [6,7], soil quality indexes [8,9], environmental quality [10,11,12], air quality indexes [13,14] and agriculture indexes [15], as well as water quality indexes [16,17,18,19,20,21,22,23,24,25,26,27,28,29,30] and so forth.
The composite indexes are based on a principle of the simple additive weighting (SAW) method combining independent criteria of which importance are expressed by their statistical weights [31]. These appropriate weights can be determined by subjective and objective methods. Subjective methods estimate the weights based on expert opinions and judgments of decision makers [32,33] or recommended standards [34]. A typical representation of this approach is the analytical hierarchy process (AHP) developed by Saaty [35]. Other methods are described in the literature [36,37]. The determination of objective weights is based on the application of various statistical measures, such as variably [38,39], correlation [40,41], and information content [42].
The above given requirements on the SAW model are in line with the basic properties of principal components created by principal component analysis (PCA): (i) the components are orthogonal and thus independent, and (ii) the components’ weights correspond to their eigenvalues. Therefore, principal components were already used to construct various composite indexes characterizing the socioeconomic situation [43], soil quality [11], environment assessment [12], and surface water quality [44].
The aim of this paper is to demonstrate the utilization of principal component weighted index (PCWI) for the monitoring of raw and treated wastewater quality, which has never been described in the literature. Wastewater quality is of high importance nowadays because it is associated with the release of a high quantity of contaminants such as dyes [45], pesticides and surfactants [46], heavy metals [47], etc. Such an index is assumed to be mathematically correct due to the use of independent variables (principal components) with minimal data noise and objective weights.

2. Materials and Methods

2.1. Data Collection

The wastewater samples were collected at an inlet and outlet of a mechanico-biological wastewater treatment plant (BWWTP) for the sake of regular monitoring of the treatment process. The BWWTP of type HYDROVIT 650-S (Vítkovice, Ostrava, Czech Republic) was designed for the treatment of common municipal wastewaters of 2100 population equivalents. The working area of the BWWTP was made up of a single overhead glass-fused to steel tank with a discontinuous cleaning process. A designed cleaning efficiency relating to biochemical oxygen demand (BOD) of 10–15 mg/L was 93%–95%, and an average flow of incoming waters was 650 m3/day. The incoming and outgoing water was collected manually once a month (during one hour on the same day). The basic statistics of the samples’ compositions are summarized in Table 1 and Table 2.
The 67 raw and treated wastewater samples were characterized by 11 physico-chemical parameters, such as BOD after 5 days, chemical oxygen demand by dichromate (COD), total phosphorus (TP), total nitrogen (TN), total suspended solids (TSS), total dissolved salts (TDS), pH, ammonium, nitrate, nitrite, and phosphate. Water analyses including sampling and preservation were performed according to ISO and EN standard procedures: EN 1899-1: 1998 (BOD), ISO 6060: 1989 (COD), EN ISO 6878: 2004 (TP and phosphate), EN 25663:1993 (TN), EN 872:1996 (TSS and TDS), ISO 10523:2008 (pH), ISO 7150-1:1984 (ammonium), ISO 7890-3:1988 (nitrate), and ISO 6777:1984 (nitrite). The spectrophotometric determination of ammonium, nitrate, nitrite and phosphate were performed using a UV–VIS spectrometer DR 4200 (HACH Company, Loveland, CO, USA). TDS and TSS were determined gravimetrically after samples had been filtered through 0.85 μm membrane filters. pH was determined using a device pH 197 (WTW, Weilheim, Germany).

2.2. Robust Principal Component Analysis

Principal component analysis looks for new latent variables of n samples, which are orthogonal (not correlated) to each other [48]. Each latent variable principal component is a linear combination of p variables xi and describes a different source of total variation
X = TWT + E = Data structure (model) + Noise
where X (n × m) is the data matrix, T (n × p) and W (m × p) are the matrixes of principal components scores and loadings, respectively, and E (n × m) is the residual matrix. Classical PCA can be performed by the eigenvalue decomposition of a correlation matrix. Robust PCA was performed by the eigenvalue decomposition of a correlation matrix converted from an estimated covariance matrix with the lowest possible determinant computed using the minimum covariance determinant (MCD) algorithm [49,50]. The covariance matrix was computed using a subroutine (mcdcov) in MATLAB (see below). The MCD estimator is considered to be a highly robust estimator of multivariate location and scatter.

2.3. Principal Component Weighted Index

The principal component weighted index was defined in consistency with the SAW model as
P C W I = k = 1 q u k t k = k = 1 q u k j = 1 m w j k x j
where uk stands for the weight of k-th PC computed as
u k = λ k k = 1 q λ k
and where λk is the eigenvalue of k-th PC and q is the number of selected principal components.

2.4. Statistical Calculations

The original data matrixes of 67 wastewater samples were set up and processed in MS Excel. Statistical calculations were performed using the software packages QC.Expert (Trilobyte, Pardubice, Czech Republic) and XLSTAT 2018 (Addinsoft, Boston, MA, USA). The data smoothing was performed by the fast Fourier transform (FFT) algorithm in the program OriginPro 9.0.0. (Origin Corporation, Northampton, MA, USA). The data were also standardized in order for us to avoid misclassifications arising from different orders of magnitude of variables. For this purpose, the data was mean (μ) centered and scaled by standard deviations (σ) as (x − μ)/σ. The statistical calculations were performed at the α = 0.05 significance level.

3. Results and Discussion

3.1. Principal Component Analysis

Robust PCA was performed due to non-normal distributions of the physico-chemical parameters characterizing the wastewaters composition (see Table 1 and Table 2). The eigenvalue decomposition of covariance matrixes with the lowest possible determinant was computed using the MCD algorithm. Based on the PCA results, the wastewater samples were characterized by a few first PCs and relationships between original parameters were discussed.
There is no universal rule for the estimation of a number of PCs. The first five principal components for both raw and treated wastewater were selected according to the magnitudes of corresponding eigenvalues, which should be equal to or higher than 1 [51], and according to their scree plots [52]. The eigenvalue scree and cumulative variability plots are demonstrated in Figure S1. In both cases, the selected PCs explained 88% and 83% of the total variability of raw and treated wastewater, respectively. It agrees with another traditional common rule that the cumulative proportion of variance could be explained by at least 80% [53].

3.2. Interpretation of Selected Principal Components

PCA often includes the interpretation of PCs which is necessary to understand the data structure. The component loadings summarized in Table 3 and Table 4 can explain relationships among the original variables (parameters). In the case of the raw wastewater, the 1st principal component (PC1) was saturated mainly by ammonium, TN, BOD, COD, phosphate, and TP. All these parameters characterize organic and inorganic compounds occurring in municipal wastewater. The 2nd principal component (PC2) was affected by nitrate and nitrite resulting from nitrification processes in raw wastewater. The 3rd principal component (PC3) was affected mostly by TSS and TDS, the 4th principal component (PC4) by BOD and TDS, and 5th component (PC5) by pH.
In the case of the treated wastewater, PC1 was mainly affected by nitrate, phosphate, and also by TN and TP, that is, by nutrients which went through the aerobic part of an activation tank. PC2 was affected by BOD and COD, characterizing the content of organic compounds which were persistent to the treatment process. PC3 was mainly saturated by ammonium and pH, indicating the presence of un-oxidized ammonium under acidic conditions during nitrification process as follows
2NH4+ + 3O2 → 2NO2 + 4H+ + 2H2O
2NO2 + O2 → 2NO3
PC4 and PC5 were saturated by TDS (PC4), nitrate, and TSS (PC5) which were of low concentrations and thus contributed to the less important PCs.

3.3. Principal Component Weighted Index

Considering the fact that scores of individual PCs have different variability depending on their eigenvalues, both PCWIs were composed of the five weighted PCs and plotted in Figure 1. Since the samples were taken approximately monthly, the vertical axis (Sample) also presents the time axis. The plots were also smoothed by the FFT procedure (red and blue curves) for us to clearly see the temporal wastewater quality changes. Approximately six-month cycles were observed and also confirmed by the time series analysis performed using the seasonal autoregressive integrated moving average model (SARIMA). The PCWI values corresponding to the raw wastewater were slowly elevated in time. During the first 48 months they were lower than those of treated wastewater and then became higher. The reason is that new buildings were connected to a local sewage system and wastewater pollution increased. The PCWI of treated wastewater continually elevated up to the 45th month and then changed, similar to the case of the raw wastewater. The similarities between both PCWI values are discussed below.

3.4. Validation of PCWI

The normal distributions of the raw and treated wastewater PCWIs were confirmed by several common statistical tests, such as the Kolmogorov–Smirnov test (p = 0.523 and p = 0.741, respectively), the Shapiro–Wilk test (p = 0.506 and p = 0.915), Anderson–Darling test (p = 0.415 and p = 0.863), and Jarque–Bera test (p = 0.730 and p = 0.811). Their normality was also documented by their skewness and kurtosis of 0.153 and 0.107, resp. 2.64 and 2.64 for the raw and treated wastewater. Their standard deviations of 2.992 and 1.437 for raw and treated wastewaters, respectively, were consistent with the summary statistics given in Table 1 and Table 2.
Although the normality was confirmed, the PCWIs were further analyzed by means of the Gaussian mixture modelling using the iterative EM algorithm [54]. The number of mixtures (groups) was determined according to the Bayesian information criterion, the Akaike information criterion, the Integrate complete likelihood, and the Normalized entropy criterion. Figure 2 displays the PCWI density functions corresponding to the raw (Figure 2a) and treated wastewater samples (Figure 2b) separated into two smaller groups. One group of the raw wastewater samples consisted of the samples 1–34 with the lower PCWIs and the second group consisted of the samples 35–67 with the higher PCWIs in agreement with their temporal changes mentioned above. A similar temporal effect was found for the PCWI values of treated wastewaters. One group was mostly composed of the samples 1–45 and the second one of the samples 46–67. Such temporal behavior of the PCWIs was already demonstrated in Figure 1.
The similarity between the PCWI values of raw and treated wastewater was also confirmed by Pearson’s and Spearman’s correlation coefficients of 0.761 and 0.743, respectively, indicating their significant correlation demonstrated in Figure 3. The monitored BWWTP treated municipal wastewater was coming from households and non-industrial institutions which is why no unexpected pollution was supposed.
However, five samples 17, 22, 35, 51, and 62 lying out of the confidence ellipse indicated some deviations from the steady-state treatment process. For example, the sample 35 was characterized by the high PCWIs corresponding to both raw and treated wastewater. This demonstrates a situation when the raw wastewater was treated very effectively in terms of BOD and COD but not in terms of TN and TDS. The concentrations of nitrite and TDS were too high in the raw wastewater and the concentrations of ammonium, TN, BOD, and COD were too high in the treated wastewater, likely due to some problems in the treatment technology. The physico-chemical parameters of these outlying samples are summarized in Table S1. Individual parameters were assessed by means of the Box and Whisker plots.
There is no “gold” standard composite index which could serve for the PCWI verification. Therefore, the PCWIs were verified based on their relationships with the individual physico-chemical parameters. One example concerning COD is shown in Figure 4. COD is the common parameter used for characterization of the total content of organic and inorganic compounds which can be oxidized by potassium dichromate. Since the COD values in raw and treated wastewaters are very different, their standardized ones were plotted in one graph. The temporal changes of the standardized CODs were similar to those of the PCWI values. The six-month periods, as well as their elevation concerning the raw wastewaters, were observed. The probable reason was already mentioned in case of the PCWI.
The similarities between the PCWIs and the original parameters were also documented by their correlation coefficients summarized in Tables S2 and S3. In the case of the raw wastewater, the PCWI significantly correlated with the parameters except nitrite, pH, and TDS. Nitrite was of low concentration, the pH changed very little, and the TDS changed independently on all the parameters except BOD. In case of the treated wastewater, the correlations were also insignificant for the parameters occurring in low concentrations, such as ammonium, BOD, nitrite, and TSS. The low concentrations of BOD corresponded to the high treatment efficiency of 95% declared by the BWWTP designer.

Comparison of PCWI with WQI

The validation of PCWI was also performed by its comparison with commonly used water quality index (WQI) which is defined as follows
W Q I = i = 1 n C i P i i = 1 n P i
where Ci and Pi are the normalized values and relative weights assigned to each parameter i. The normalization was performed by dividing the values of each parameter by its maximal one. The relative weights ranged from 1 to 4 according to their importance for an aquatic system, which means that they are subjective: Pi = 4 for TSS; Pi = 3 for NH4+, BOD, and COD; Pi = 2 for TDS, TN, NO3, and NO2; Pi = 1 for TP, PO43−, and pH. They were adopted from several papers dealing with assessment of surface waters [23,27,30]. Figure 5 shows the correlation of PCWI and WQI, indicating a good agreement between both indexes. It is possible to emphasize that, unlike WQI, PCWI works with objective weights computed for particular water composition.
These relative weights were also used for computing WQI concerning the treated wastewater but the correlation was very weak. The weights were likely not appropriate for this type of water, but the more suitable ones were not found in the literature.
The normal distribution of WQI was also confirmed by the Kolmogorov–Smirnov test (p = 0.607), Shapiro–Wilk test (p = 0.266), Anderson–Darling test (p = 0.372), and Jarque–Bera test (p = 0.694). The normality was also documented by a skewness of 0.212 and a kurtosis of 2.71. Sensitivity of PCWI and WQI to detect outlying observations was compared using the Grubbs test. The Z-scores of PCWI and WQI were plotted for all samples and those above and/or below +2 and −2 were detected as outliers. Figure 6 shows that using the PCWI Z-scores five samples (47, 51, 61, 62, and 64) were detected, and using the WQI ones only two samples (35 and 51) were identified. These results demonstrate that PCWI is more sensitive for the identification of anomalies in water composition.

3.5. Examples of Possible PCWI Applications

As already mentioned, the PCWI has the potential to describe wastewater quality depending on time. The first example of a possible application concerns the evaluation of seasonal raw wastewater quality in various years as displayed in Figure 7.
A non-parametric Kruskal–Wallis test was performed to compare the PCWIs during 1997–2001. The computed p-values summarized in Table 5 indicate that the wastewater quality in 1997 and 2001 (p = 0.031) was different. It was confirmed by the median concentrations of ammonium, TN, and COD (see Table S4).
The second example concerns the evaluation of the PCWI values according to warning and control limits in analogy to the Shewhart control charts [55,56]. Upper (UWL) and lower warning limits (LWL), and upper (UCL) and lower control limits (UCL) were computed as μ ± 2σ and μ ± 3σ, respectively (see Table 6). A majority of applications of the weighted indexes uses linear scales such as 0–25 excellent, 26–50 good, 51–75 poor, 76–100 very poor, >100 unsuitable [23,24,57]. The suggested ranks were based on the limits specific for this study.
The number N of samples in the individual ranks are listed in Table 6. The PCWIs between UCL and UWL (rank I) as well as between LWI and LCL (rank IV) signify significant deviations from the steady-state treatment process: five samples (7.5%) of raw wastewater and four samples (6.0%) of treated wastewater. Such PCWIs ranking could be useful for operators to simply check raw and treated wastewater quality and to control working conditions on BWWTPs.
In general, the composite indexes are not supposed to be of universal validity and ability to describe reality in detail because one parameter cannot substitute a variety of variables [17,21]. Despite this, the PCWI can be employed as a useful indicator, providing overall information about water quality depending on the type of wastewater and temporal (seasonal) effects.

4. Conclusions

The wastewater quality before and after treatment was characterized by the principal component weighted index constructed as the sum of weighted PCs scores. The robust PCA of the 67 raw and treated wastewater samples extracted five principal PCs explaining 88%, resp. 83% of the total data variability. Based on the PCs loadings, the relationships among the original parameters were discussed. The PCWIs plots were constructed to show the temporal water quality changes. The six-month PCWIs cycles were identified. Using the Gaussian mixture modelling the PCWI values were separated into two groups of samples in agreement with the PCWI temporal plots. The PCWIs scatter plot identified the samples that deviated from the steady-state treatment. PCWI and WQI computed for the raw wastewater were compared and found to be in good agreement.
The possible application of PCWI for the raw wastewater quality monitoring was demonstrated by the evaluation of wastewater quality during 1997–2001 using a non-parametric Kruskal–Wallis test. The years 1997 and 2001 were found to be different which was explained comparing the median concentration of ammonium, BOD, and COD. The PCWI application in analogy with the Shewhart warning and control limits was also demonstrated. The PCWI was found to be used for the overall characterization of wastewater quality, especially from the temporal point of view.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4441/11/11/2376/s1, Figure S1: Eigenvalue scree and cumulative variability plots for raw and treated wastewaters, Table S1: Composition of outlaying samples, Table S2: Spearman’s correlation matrix of original parameters and PCWI of raw wastewater, Table S3: Spearman’s correlation matrix of original parameters and PCWI of treated wastewater, Table S4: Medians of raw wastewater parameters in 1997–2001.

Funding

This work was financially supported by the projects “Institute of Environmental Technology—Excellent Research” (CZ.02.1.01/0.0/0.0/16_019/0000853) and Large Research Infrastructure ENREGAT (project No. LM 2018098) provided by the Ministry of Education, Youth and Sports of the Czech Republic. The author has no conflict of interest to declare.

Conflicts of Interest

The author declares no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Box, G.E.P. Robustness in the Strategy of Scientific Model Building. In Robustness in Statistics; Launer, R.L., Wilkinson, G.N., Eds.; Academic Press: Cambridge, MA, USA, 1979; pp. 201–236. [Google Scholar] [CrossRef]
  2. Schmidtlein, M.C.; Deutsch, R.C.; Piegorsch, W.W.; Cutter, S.L. A sensitivity analysis of the social vulnerability index. Risk Anal. 2008, 28, 1099–1114. [Google Scholar] [CrossRef] [PubMed]
  3. Yoon, D.K. Assessment of social vulnerability to natural disasters: A comparative study. Nat. Hazards 2012, 63, 823–843. [Google Scholar] [CrossRef]
  4. Jefmański, B.; Błoński, K. Composite Index of Local Government Employees Satisfaction. Folia Oeconomica Stetin. 2014, 13, 77–95. [Google Scholar] [CrossRef]
  5. McGillivray, M.; Noorbakhsh, F. Composite Indexes of Human Well-being: Past, Present and Future. In Human Well-Being: Concept and Measurement; McGillivray, M., Ed.; Palgrave Macmillan UK: London, UK, 2007; pp. 113–134. [Google Scholar] [CrossRef]
  6. Berni, A.; Giuliani, A.; Tartaglia, F.; Tromba, L.; Sgueglia, M.; Blasi, S.; Russo, G. Effect of vascular risk factors on increase in carotid and femoral intima-media thickness. Identification of a risk scale. Atherosclerosis 2011, 216, 109–114. [Google Scholar] [CrossRef]
  7. Jones, R.C.; Donaldson, G.C.; Chavannes, N.H.; Kida, K.; Dickson-Spillmann, M.; Harding, S.; Wedzicha, J.A.; Price, D.; Hyland, M.E. Derivation and validation of a composite index of severity in chronic obstructive pulmonary disease: The DOSE Index. Am. J. Respir. Crit. Care Med. 2009, 180, 1189–1195. [Google Scholar] [CrossRef]
  8. Li, P.; Zhang, T.; Wang, X.; Yu, D. Development of biological soil quality indicator system for subtropical China. Soil Tillage Res. 2013, 126, 112–118. [Google Scholar] [CrossRef]
  9. Yu, P.; Liu, S.; Zhang, L.; Li, Q.; Zhou, D. Selecting the minimum data set and quantitative soil quality indexing of alkaline soils under different land uses in northeastern China. Sci. Total Environ. 2018, 616–617, 564–571. [Google Scholar] [CrossRef]
  10. Sands, G.R.; Podmore, T.H. A generalized environmental sustainability index for agricultural systems. Agric. Ecosyst. Environ. 2000, 79, 29–41. [Google Scholar] [CrossRef]
  11. Shi, Z.-H.; Chen, L.-D.; Hao, J.-P.; Wang, T.-W.; Cai, C.-F. The effects of land use change on environmental quality in the red soil hilly region, China: A case study in Xianning County. Environ. Monit. Assess. 2008, 150, 295. [Google Scholar] [CrossRef]
  12. Zou, T.; Yoshino, K. Environmental vulnerability evaluation using a spatial principal components approach in the Daxing’anling region, China. Ecol. Indic. 2017, 78, 405–415. [Google Scholar] [CrossRef]
  13. Kyrkilis, G.; Chaloulakou, A.; Kassomenos, P.A. Development of an aggregate Air Quality Index for an urban Mediterranean agglomeration: Relation to potential health effects. Environ. Int. 2007, 33, 670–676. [Google Scholar] [CrossRef] [PubMed]
  14. Thach, T.-Q.; Tsang, H.; Cao, P.; Ho, L.-M. A novel method to construct an air quality index based on air pollution profiles. Int. J. Hyg. Environ. Health 2018, 221, 17–26. [Google Scholar] [CrossRef] [PubMed]
  15. Dantsis, T.; Douma, C.; Giourga, C.; Loumou, A.; Polychronaki, E.A. A methodological approach to assess and compare the sustainability level of agricultural plant production systems. Ecol. Indic. 2010, 10, 256–263. [Google Scholar] [CrossRef]
  16. Al-Mutairi, N.; AbaHussain, A.; El-Battay, A. Spatial assessment of monitoring network in coastal waters: A case study of Kuwait Bay. Environ. Monit. Assess. 2015, 187, 621. [Google Scholar] [CrossRef]
  17. Gitau, M.W.; Chen, J.; Ma, Z. Water Quality Indices as Tools for Decision Making and Management. Water Resour. Manag. 2016, 30, 2591–2610. [Google Scholar] [CrossRef]
  18. Hoseinzadeh, E.; Khorsandi, H.; Wei, C.; Alipour, M. Evaluation of Aydughmush River water quality using the National Sanitation Foundation Water Quality Index (NSFWQI), River Pollution Index (RPI), and Forestry Water Quality Index (FWQI). Desalin. Water Treat. 2015, 54, 2994–3002. [Google Scholar] [CrossRef]
  19. Lobato, T.C.; Hauser-Davis, R.A.; Oliveira, T.F.; Silveira, A.M.; Silva, H.A.N.; Tavares, M.R.M.; Saraiva, A.C.F. Construction of a novel water quality index and quality indicator for reservoir water quality evaluation: A case study in the Amazon region. J. Hydrol. 2015, 522, 674–683. [Google Scholar] [CrossRef]
  20. Medeiros, A.C.; Faial, K.R.F.; do Carmo Freitas Faial, K.; da Silva Lopes, I.D.; de Oliveira Lima, M.; Guimaraes, R.M.; Mendonca, N.M. Quality index of the surface water of Amazonian rivers in industrial areas in Para, Brazil. Mar. Pollut. Bull. 2017, 123, 156–164. [Google Scholar] [CrossRef]
  21. Rickwood, C.J.; Carr, G.M. Development and sensitivity analysis of a global drinking water quality index. Environ. Monit. Assess. 2008, 156, 73. [Google Scholar] [CrossRef]
  22. Sun, W.; Xia, C.; Xu, M.; Guo, J.; Sun, G. Application of modified water quality indices as indicators to assess the spatial and temporal trends of water quality in the Dongjiang River. Ecol. Indic. 2016, 66, 306–312. [Google Scholar] [CrossRef]
  23. Tomas, D.; Čurlin, M.; Marić, A.S. Assessing the surface water status in Pannonian ecoregion by the water quality index model. Ecol. Indic. 2017, 79, 182–190. [Google Scholar] [CrossRef]
  24. Tyagi, S.; Sharma, B.; Singh, P.; Dobhal, R. Water Quality Assessment in Terms of Water Quality Index. Am. J. Water Resour. 2013, 1, 5. [Google Scholar] [CrossRef]
  25. Yan, F.; Liu, L.; Li, Y.; Zhang, Y.; Chen, M.; Xing, X. A dynamic water quality index model based on functional data analysis. Ecol. Indic. 2015, 57, 249–258. [Google Scholar] [CrossRef]
  26. Debels, P.; Figueroa, R.; Urrutia, R.; Barra, R.; Niell, X. Evaluation of Water Quality in the Chillán River (Central Chile) Using Physicochemical Parameters and a Modified Water Quality Index. Environ. Monit. Assess. 2005, 110, 301–322. [Google Scholar] [CrossRef] [PubMed]
  27. Koçer, M.A.T.; Sevgili, H. Parameters selection for water quality index in the assessment of the environmental impacts of land-based trout farms. Ecol. Indic. 2014, 36, 672–681. [Google Scholar] [CrossRef]
  28. Zhao, P.; Tang, X.; Tang, J.; Wang, C. Assessing Water Quality of Three Gorges Reservoir, China, Over a Five-Year Period From 2006 to 2011. Water Resour. Manag. 2013, 27, 4545–4558. [Google Scholar] [CrossRef]
  29. Pesce, S.F.; Wunderlin, D.A. Use of water quality indices to verify the impact of Córdoba City (Argentina) on Suquía River. Water Res. 2000, 34, 2915–2926. [Google Scholar] [CrossRef]
  30. Kannel, P.R.; Lee, S.; Lee, Y.-S.; Kanel, S.R.; Khan, S.P. Application of Water Quality Indices and Dissolved Oxygen as Indicators for River Water Classification and Urban Impact Assessment. Environ. Monit. Assess. 2007, 132, 93–110. [Google Scholar] [CrossRef]
  31. Hwang, C.L.; Yoon, K. Multiple Attribute Decision Making; Springer: Berlin/Heidelberg, Germany, 1981; p. 269. [Google Scholar] [CrossRef]
  32. de Grosbois, D.; Plummer, R. Problematizing Water Vulnerability Indices at a Local Level: A Critical Review and Proposed Solution. Water Resour. Manag. 2015, 29, 5015–5035. [Google Scholar] [CrossRef]
  33. Hurley, T.; Sadiq, R.; Mazumder, A. Adaptation and evaluation of the Canadian Council of Ministers of the Environment Water Quality Index (CCME WQI) for use as an effective tool to characterize drinking source water quality. Water Res. 2012, 46, 3544–3552. [Google Scholar] [CrossRef]
  34. Wanda, E.M.M.; Gulula, L.C.; Phiri, G. Determination of characteristics and drinking water quality index in Mzuzu City, Northern Malawi. Phys. Chem. Earth Parts A/B/C 2012, 50–52, 92–97. [Google Scholar] [CrossRef]
  35. Saaty, T.L. Eigenvector and logarithmic least squares. Eur. J. Oper. Res. 1990, 48, 156–160. [Google Scholar] [CrossRef]
  36. Alemi-Ardakani, M.; Milani, A.S.; Yannacopoulos, S.; Shokouhi, G. On the effect of subjective, objective and combinative weighting in multiple criteria decision making: A case study on impact optimization of composites. Expert Syst. Appl. 2016, 46, 426–438. [Google Scholar] [CrossRef]
  37. Sierra, L.A.; Yepes, V.; Pellicer, E. A review of multi-criteria assessment of the social sustainability of infrastructures. J. Clean. Prod. 2018, 187, 496–513. [Google Scholar] [CrossRef]
  38. Gao, S.; Zhang, Z.; Cao, C. Calculating Weights Methods in Complete Matrices and Incomplete Matrices. J. Softw. 2010, 5. [Google Scholar] [CrossRef] [Green Version]
  39. Paule, R.C.; Mandel, J. Consensus values and weighting factors. J. Res. Natl. Bur. Stand. 1982, 87, 377–385. [Google Scholar] [CrossRef]
  40. Becker, W.; Saisana, M.; Paruolo, P.; Vandecasteele, I. Weights and importance in composite indicators: Closing the gap. Ecol. Indic. 2017, 80, 12–22. [Google Scholar] [CrossRef]
  41. Diakoulaki, D.; Mavrotas, G.; Papayannakis, L. Determining objective weights in multiple criteria problems: The critic method. Comput. Oper. Res. 1995, 22, 763–770. [Google Scholar] [CrossRef]
  42. Shakouri, H.; Nabaee, M.; Aliakbarisani, S. A quantitative discussion on the assessment of power supply technologies: DEA (data envelopment analysis) and SAW (simple additive weighting) as complementary methods for the “Grammar”. Energy 2014, 64, 640–647. [Google Scholar] [CrossRef]
  43. Al Omar, R.S.; Parslow, R.C.; Law, G.R. Development of two socioeconomic indices for Saudi Arabia. BMC Public Health 2018, 18, 791. [Google Scholar] [CrossRef]
  44. Qian, Y.; Migliaccio, K.W.; Wan, Y.; Li, Y. Surface water quality evaluation using multivariate methods and a new water quality index in the Indian River Lagoon, Florida. Water Resour. Res. 2007, 43. [Google Scholar] [CrossRef] [Green Version]
  45. Collivignarelli, M.C.; Abbà, A.; Carnevale Miino, M.; Damiani, S. Treatments for color removal from wastewater: State of the art. J. Environ. Manag. 2019, 236, 727–745. [Google Scholar] [CrossRef] [PubMed]
  46. Collivignarelli, M.C.; Carnevale Miino, M.; Baldi, M.; Manzi, S.; Abbà, A.; Bertanza, G. Removal of non-ionic and anionic surfactants from real laundry wastewater by means of a full-scale treatment system. Process Saf. Environ. Prot. 2019, 132, 105–115. [Google Scholar] [CrossRef]
  47. Collivignarelli, M.C.; Abbà, A.; Bestetti, M.; Crotti, B.M.; Carnevale Miino, M. Electrolytic Recovery of Nickel and Copper from Acid Pickling Solutions Used to Treat Metal Surfaces. Water Air Soil Pollut. 2019, 230, 101. [Google Scholar] [CrossRef]
  48. Wold, S.; Esbensen, K.; Geladi, P. Principal component analysis. Chemom. Intell. Lab. Syst. 1987, 2, 37–52. [Google Scholar] [CrossRef]
  49. Hubert, M.; Debruyne, M. Minimum covariance determinant. Comput. Stat. 2009, 2, 36–43. [Google Scholar] [CrossRef]
  50. Rousseeuw, P.J.; Driessen, K.V. A Fast Algorithm for the Minimum Covariance Determinant Estimator. Technometrics 1999, 41, 212–223. [Google Scholar] [CrossRef]
  51. Kaiser, H.F. The Application of Electronic Computers to Factor Analysis. Educ. Psychol. Meas. 1960, 20, 141–151. [Google Scholar] [CrossRef]
  52. Cattell, R.B. The Scree Test For The Number Of Factors. Multivar. Behav. Res. 1966, 1, 245–276. [Google Scholar] [CrossRef]
  53. Kumar, K. Principal component analysis: Most favourite tool in chemometrics. Resonance 2017, 22, 747–759. [Google Scholar] [CrossRef]
  54. Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data via the EM Algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1977, 39, 1–38. [Google Scholar]
  55. MacGregor, J.F.; Kourti, T. Statistical process control of multivariate processes. Control Eng. Pract. 1995, 3, 403–414. [Google Scholar] [CrossRef]
  56. Shewhart, W.A. Statistical Method from the Viewpoint of Quality Control; Dover Publications: New York, NY, USA, 1939; p. 105. [Google Scholar]
  57. Ewaid, S.H.; Abed, S.A. Water quality index for Al-Gharraf River, southern Iraq. Egypt. J. Aquat. Res. 2017, 43, 117–122. [Google Scholar] [CrossRef]
Figure 1. Principal component weighted index (PCWI) plots of raw and treated wastewaters.
Figure 1. Principal component weighted index (PCWI) plots of raw and treated wastewaters.
Water 11 02376 g001
Figure 2. Gaussian mixture models of PCWI of raw (a) and treated (b) wastewater.
Figure 2. Gaussian mixture models of PCWI of raw (a) and treated (b) wastewater.
Water 11 02376 g002
Figure 3. Scatter plot of PCWI scores with confidence ellipse. The labeled samples 17, 22, 35, 51, and 62 were out of confidence interval (97.5%, Chi-square).
Figure 3. Scatter plot of PCWI scores with confidence ellipse. The labeled samples 17, 22, 35, 51, and 62 were out of confidence interval (97.5%, Chi-square).
Water 11 02376 g003
Figure 4. Plots of standardized COD values of raw and treated wastewater.
Figure 4. Plots of standardized COD values of raw and treated wastewater.
Water 11 02376 g004
Figure 5. Comparison of PCWI and water quality index (WQI) computed for data of raw wastewater.
Figure 5. Comparison of PCWI and water quality index (WQI) computed for data of raw wastewater.
Water 11 02376 g005
Figure 6. Comparison of PCWI and WQI computed for data of raw wastewater.
Figure 6. Comparison of PCWI and WQI computed for data of raw wastewater.
Water 11 02376 g006
Figure 7. PCWI Box and Whisker plots of raw wastewater over several years. Central lines indicate medians and small rectangle symbols indicate means.
Figure 7. PCWI Box and Whisker plots of raw wastewater over several years. Central lines indicate medians and small rectangle symbols indicate means.
Water 11 02376 g007
Table 1. Summary statistics of raw wastewaters (n = 67).
Table 1. Summary statistics of raw wastewaters (n = 67).
NH4+
mg/L
BOD
mg/L
COD
mg/L
NO3
mg/L
NO2
mg/L
PO43−
mg/L
TN
mg/L
TSS
mg/L
TP
mg/L
pH
mg/L
TDS
mg/L
Aver.45.596.22053.401.1919.942.9927.867.64569
St. dev.15.039.164.96.431.926.7011.0352.380.23200
Min.12.618.080.80.250.0344.0020.2352.247.07231
Max.81.720937537.68.4430.965.818713.08.101532
Median42.099.22070.960.22519.941.2927.717.66527
Skew.0.4350.0630.2123.342.12−0.1440.2550.510−0.171−0.3012.48
Kurt.−0.217−0.185−0.47712.23.71−0.638−0.477−0.091−0.228−0.3128.15
Table 2. Summary statistics of treated wastewaters (n = 67).
Table 2. Summary statistics of treated wastewaters (n = 67).
NH4+
mg/L
BOD
mg/L
COD
mg/L
NO3
mg/L
NO2
mg/L
PO43−
mg/L
TN
mg/L
TSS
mg/L
TP
mg/L
pH
mg/L
TDS
mg/L
Aver.5.243.128.174.80.59517.324.856.127.18587
St. dev.8.582.318.437.01.776.976.7542.350.40197
Min.0.0311.010.70.670.0122.6711.000.946.21302
Max.48.412.516115914.030.742.01910.98.001580
Median1.372.625.175.40.24216.8024.346.107.24551
Skew.2.762.405.74−0.0456.78−0.0710.5671.47−0.155−0.3342.30
Kurt.9.176.4338.9−0.37348.0−0.803−0.1051.68−0.580−0.5818.36
Table 3. Loadings of selected principal components of raw wastewater.
Table 3. Loadings of selected principal components of raw wastewater.
ParametersPC1PC2PC3PC4PC5
NH4+0.9120.111−0.166−0.068−0.007
BOD0.6780.1330.1580.5250.287
COD0.8970.1640.1300.1870.061
NO3−0.3620.8090.153−0.1240.094
NO2−0.3930.7410.137−0.2730.252
PO43−0.8670.040−0.042−0.314−0.296
TN0.9130.164−0.033−0.1030.057
TSS0.553−0.1290.6800.0760.165
TP0.8430.1430.117−0.304−0.270
pH0.434−0.293−0.448−0.3380.622
TDS0.2850.487−0.6210.397−0.135
Table 4. Loadings of selected principal components of treated wastewater.
Table 4. Loadings of selected principal components of treated wastewater.
ParametersPC1PC2PC3PC4PC5
NH4+−0.4080.2790.7600.0410.123
BOD−0.4050.7100.151−0.016−0.246
COD0.3900.561−0.0220.0180.150
NO30.8120.210−0.281−0.117−0.369
NO2−0.2820.4100.562−0.329−0.267
PO43−0.809−0.1280.438−0.1790.263
TN0.8280.2730.114−0.020−0.334
TSS−0.3990.568−0.309−0.3580.390
TP0.823−0.1160.406−0.1670.286
pH−0.335−0.5720.5320.103−0.188
TDS0.2650.5090.1450.7370.141
Table 5. Probabilities of the Kruskal–Wallis test applied on PCWI of raw wastewater.
Table 5. Probabilities of the Kruskal–Wallis test applied on PCWI of raw wastewater.
19971998199920002001
199710.0500.6570.1150.031
19980.05010.8880.9470.347
19990.6570.88810.5990.142
20000.1150.9470.59910.674
20010.0310.3470.1420.6741
Note: Probabilities in bold are not different from 0 with a significance level α = 0.05.
Table 6. Ranking of PCWI values.
Table 6. Ranking of PCWI values.
RankRangePCWI of Raw WWNPCWI of Treated WWN
IUCL to UWL8.489 to 5.49534.266 to 2.8902
IIUWL to µ5.496 to −0.489282.891 to 0.14029
IIIµ to LWL−0.490 to −6.473340.141 to −2.61234
IVLWI to LCL−6.474 to −9.4652−2.613 to −3.9882

Share and Cite

MDPI and ACS Style

Praus, P. Principal Component Weighted Index for Wastewater Quality Monitoring. Water 2019, 11, 2376. https://doi.org/10.3390/w11112376

AMA Style

Praus P. Principal Component Weighted Index for Wastewater Quality Monitoring. Water. 2019; 11(11):2376. https://doi.org/10.3390/w11112376

Chicago/Turabian Style

Praus, Petr. 2019. "Principal Component Weighted Index for Wastewater Quality Monitoring" Water 11, no. 11: 2376. https://doi.org/10.3390/w11112376

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop