Next Article in Journal
The Evapotranspiration of Tamarix and Its Response to Environmental Factors in Coastal Saline Land of China
Previous Article in Journal
Pyrolysis Process as a Sustainable Management Option of Poultry Manure: Characterization of the Derived Biochars and Assessment of their Nutrient Release Capacities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improvement of Hargreaves–Samani Reference Evapotranspiration Estimates with Local Calibration

1
Department of Agricultural Engineering, Federal University of Viçosa (UFV), Avenue Peter Henry Rolfs, Viçosa 36570-900, Minas Gerais, Brazil
2
Biosystems Engineering Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Avenue Pádua Dias, n°11, Piracicaba-SP 13418-900, Brazil
*
Author to whom correspondence should be addressed.
Water 2019, 11(11), 2272; https://doi.org/10.3390/w11112272
Submission received: 8 September 2019 / Revised: 10 October 2019 / Accepted: 26 October 2019 / Published: 30 October 2019
(This article belongs to the Section Water Use and Scarcity)

Abstract

:
Improving irrigation water management is an important asset when facing increased water shortages. The Hargreaves–Samani (HS) method is a simple method that can be used as an alternative to the Penman–Monteith (PM) method, which requires only temperature measurements for estimating reference evapotranspiration (ETo). However, the applicability of this method relies on its calibration to local meteorological specificities. The objective of this study was to investigate the effects of local calibration on the performance of the HS equation. The study was carried out for the middle portion of the São Francisco River Basin (MSFB), Brazil, and considered four calibration approaches: A1—single calibration for the entire MSFB; A2—separate calibration by clusters of months; A3—by clusters of stations; and A4—for all contexts resulting by combining A2 and A3. Months from the wet season showed larger improvements by the calibration of the HS model, since mean air temperature and its daily range showed stronger correlations to ETo. On the other hand, the months from the dry season and stations from the eastern region of MSFB performed poorly regardless of the calibration approach adopted. This occurred because, in those cases, ETo presented larger correlation to variables that are missing in the HS equation, and the use of the full PM equation seems unavoidable.

1. Introduction

The severity of water shortage due to increasing demands for inputs is an important issue of socioeconomic and global sustainability [1]. In this context, Brazil is one of the few countries that is able to increase its productivity in a sustainable way [2]; however, despite the water abundance, most of this resource is available in less populated areas. Its heterogeneous distribution results in conflicts over its use being more common than expected. These disputes arise mainly due to the insufficient amount of water to supply for humans and animals [3]. In the Brazilian Northeast, this problem is commonly related to lower rainfall rates and the large amount of water withdrawn by irrigation activities [4].
Irrigation provides water and better growth conditions to crops, resulting in higher yields [5]. Despite being an intensive water use activity, irrigation is one of the main activities responsible for guaranteeing food security for future generations [6]. However, in order to attain the benefits of this activity, it is fundamental to observe criteria such as the economic and environmental integrity [7]. The proper management of irrigation reduces the consumption of electricity, increases the efficiency of water use and boosts agricultural productivity [8,9]. In addition, adequate management strategies reduce water withdrawn from streams and, consequently, reduce the conflicts with the local community.
Irrigation management via climate is widely studied and applied [10,11,12,13,14], especially due to its practicality in large-scales and economic viability. In order to carry out the irrigation management via climate, the amount of water to be supplied to irrigated crops is calculated as a function of the product between the reference evapotranspiration (ETo) and the coefficients associated with crop and soil [10]. However, the quantification of ETo in a representative manner depends on the availability of reliable meteorological data, as well as the adequate use of methodologies for its estimation.
The Penman–Monteith (PM) method is widely used as the standard for estimating ETo [10,15]. This method is based on physical processes and incorporates an aerodynamic term, the energy balance and physiological parameters. Its calculation depends on the acquisition of several meteorological variables as input, which is costly. Therefore, data may not be available or not in range for farmers in remote areas. In this case, the paper 56 from the Food and Agriculture Organization of the United Nations (FAO-56) [10] recommends the use of the Hargreaves–Samani (HS) model [16]. The advantage that this method offers is the need to only measure one input variable, the air temperature. However, empirical limitations and lower performance of simplified methods make it necessary to calibrate them to the localities of interest, as well as the development of strategies that allow greater efficiency in irrigation management.
Several authors have proposed the calibration of the empirical equation of ETo considering as reference, the PM model [17,18,19,20]. The regional calibration is an alternative that aims to improve the performance of ETo estimation, allowing its use in several locations and under different climatic conditions [17,21,22]. Besides these methods, other authors have proposed different forms of calibration for better performance. Aguilar and Polo [23] proposed the calibration of the equation considering the dry and rainy seasons. Cobaner et al. [11] included the wind speed variable to the simple linear regression used in the calibration. Zanetti et al. [24] used different calibration methodologies, from using simple linear regression to segregating the stations in groups that presented the same climatic type and thermal amplitude class. Regarding more complex techniques used to estimate ETo, such as machine learning models, Reis et al. [25] developed models calibrated for each station and for pooled data from all weather stations used. Ferreira et al. [13] clustered stations by similar climatic characteristics prior to calibrating their models.
Literature provides evidence that both the spatial and temporal dynamics of meteorological variables can be decisive in improving the HS performance. Therefore, comprehending these implications in evapotranspiration is fundamental to understand the water balance of the agro-ecosystem [26] and to develop more efficient calibration techniques for methodologies demanding fewer input variables.
In this context, the objective of the present study was to investigate the spatial and temporal dynamics of meteorological data and their implications in the calibration of the HS model.

2. Materials and Methods

2.1. Study Area

The area of this study comprises the middle portion of the São Francisco River Basin (MSFB) (Figure 1). The middle of São Francisco occupies about 402,500 km2, representing 63% of the entire São Francisco River Basin. The climate of the region is tropical semi-arid and dry sub-humid, with rainfall ranging from 600 mm to 1400 mm, average annual temperatures around 24 °C and an average evapotranspiration of 1300 mm [27].
Since it is an extensive area, there is great diversity in terms of climate, types of soils, vegetation and agricultural use. The region is characterized by the transition between three biomes, the Cerrado, Caatinga and Atlantic rainforest (Figure S1), with predominant dry land and savannah-like vegetation. The region is marked for extensive irrigated perimeters [28,29], cultivating banana, mango, citrus, grape, sugar cane, soybean, corn, cotton and coffee [30].

2.2. Weather Station Data Used

The data used referred to daily observations from 12 conventional weather stations (Table 1) covering the period from 1997 to 2016. All stations were located in the study area and their data were obtained from the Meteorological Database for Education and Research (BDMEP), made available by the National Institute of Meteorology (INMET) of Brazil.
The information acquired by the weather stations were mean (T, °C), maximum (Tmax, °C) and minimum air temperature (Tmin, °C); relative humidity (RH, %); wind speed 10 m above ground (u10, m·s−1); and duration of sunshine hours in a day (n). Sunshine hours and the wind speed were later converted to solar radiation (Rs, MJ·m−2·day−1) and wind speed 2 m above ground (u2, m·s−1), following the methodology proposed by Allen et al. [10].
The data obtained were evaluated to eliminate faulty data, e.g., number of hours of sunshine above the observable limit, negative wind speed [31], relative humidity outside the range of 5% to 100% [32] and values of minimum temperature higher than the maximum temperature [33].

2.3. Reference Evapotranspiration

2.3.1. Penman–Monteith

The Penman–Monteith grass reference evapotranspiration method standardized by FAO [10] was used as a base for comparison and subsequent fit of the Hargreaves–Samani method. The Penman–Monteith equation is written as follows:
ETo = 0.408   ( R n G ) + γ   900 T + 273 u 2   ( e s e a ) Δ + γ   ( 1 + 0.34   u 2 )
where: ETo = reference evapotranspiration (mm·day−1); Rn = net radiation (MJ·m−2·day−1); G = soil heat flux (MJ·m−2·day−1); T = mean air temperature (°C); u2 = wind speed 2 m above ground (m·s−1); es = saturation vapor pressure (kPa); ea = actual vapor pressure (kPa); Δ = slope of saturation vapor pressure curve (kPa·°C−1); and, γ = psychrometric constant (kPa·°C−1). All the parameters from Equation (1) were computed according to Allen et al. [10].

2.3.2. Hargreaves–Samani

In order to estimate ETo considering only temperature measurements, we used the Hargreaves–Samani equation [16] as follows:
ETo = 0.0023   ( T max T min ) 0.5 ( T + 17.8 )   Ra
where: Ra = extraterrestrial radiation (mm·day−1). Extraterrestrial radiation was calculated from the latitude and day data of the year [10], and no equipment was needed for its measurement.
The coefficients adopted in the HS equation present an empirical basis, calibrated for Davis, in California. Thus, calibrating Equation (2) is necessary in order to adequately implement this equation for regions of interest.

2.4. Improving Hargreaves–Samani Estimates

A few different approaches were considered in the calibration of the Hargreaves–Samani: A1—general calibration of coefficients for the entire MSFB region and year; A2—separate calibration for clusters of months; A3—separate calibration for clusters of stations (subregions); A4—separate calibration in the contexts that resulted from clustering months within subregions, a combination of approaches A2 and A3.

2.4.1. General Calibration of Coefficients

In order to calibrate the Hargreaves–Samani empirical coefficients, we must respect the form in which the equation was developed. For instance, in the absence of sunshine hours or solar radiation observation, Hargreaves and Samani [34] adopted the concept that the difference between maximum and minimum air temperature (TR = Tmax − Tmin) can be used to estimate the fraction of extraterrestrial radiation reaching the Earth’s surface [10] (Equation (3)).
Rs = k Rs   ( T max T min ) 0.5 Ra
where kRs = empirical coefficient fitted to Rs/Ra (°C−0.5). The original form or the Hargreaves–Samani equation (Equation (2)) adopts kRs equal 0.17, but was here calibrated considering all observations for the middle of São Francisco.
The mean air temperature offset (“17.8” in Equation (2)) was also calibrated considering the fitted value of kRs. Thus, for the general adjustment of the Hargreaves–Samani coefficients, the equation may be written as:
ETo = 0.0135   k Rs   ( T max T min ) 0.5 ( T + offset ) Ra
where offset = mean air temperature offset.

2.4.2. Regional or Seasonal Clustering of Data

There is evidence in the literature indicating that both seasonal [23,24] and spatial [13] patterns of meteorological variables may influence the performance of predictions made by methods simple in input requirement. Therefore, in an attempt to better calibrate the HS equation for the region, we investigated the effects of clustering data by spatial and seasonal similarities.
The investigation was carried out using the k-means algorithm [35] to check for groups of months or stations which presented similar correlation between meteorological parameters and reference evapotranspiration. Therefore, we used as input for the algorithm, the linear correlation between Penman–Monteith reference evapotranspiration (EToPM) and meteorological parameters (T, TR and Ra). This was performed considering all linear correlations by month and station. The decision on the beneficial effects of separating data into groups of months or stations was based in the optimal numbers of cluster by (i) the total within-cluster sum of squares and by (ii) the average silhouette width method [36].
The total within-cluster sum of squares method is based on observing the decrease of total within-cluster sum of squares by increasing the number of clusters; the optimal number of clusters is chosen when the decrease rate becomes less steep (“elbow method”) [37]. The average silhouette method, on the other hand, measures how similar an object is to its own cluster (cohesion) compared to other clusters (separation). The average silhouette values range from −1 to +1, where positive values closer to 1 indicate good cohesion and separation, and negative values are undesired for clustering.

2.5. Validation and Accuracy Indicators

The data was temporally split into calibration (70%) and test set (30%). That is, using observations from 01-01-1997 to 31-12-2010 (14 years) as calibration data and observations from 01-01-2011 to 31-12-2016 (six years) as a test set. The performance of the proposed approaches was evaluated by comparing their predictions for the test set to corresponding EToPM values. Performance was measured with several criteria:
  • The slope of the regression forced to the origin (FTO) considering EToPM as observed data (Oi) and EToHS as predicted values (Pi).
  • Coefficient of determination (R2) of the ordinary least squares (OLS) regression considering Oi and Pi.
  • Mean bias error (ME, mm·day−1), mean absolute error (MAE, mm·day−1) and root mean square error (RMSE, mm·day−1).
Slopes of the FTO regression close to 1.0 indicate that EToPM and EToHS are similar, while values >1.0 suggest overestimation, and <1.0 underestimation. The R2 of the OLS regression indicates how much of the variation of EToPM is explained by the variation of EToHS. ME indicates the average under or overestimation, MAE indicates the average magnitude of absolute errors and RMSE is used as a measure of dispersion, giving higher importance to larger errors. The statistical indices are described in Equations (5)–(8):
ME = 1 n i = 1 n ( P i O i )
MAE = 1 n i = 1 n | P i O i |
RMSE = 1 n i = 1 n ( P i O i ) 2
R 2 = [ i = 1 n ( P i P ¯ ) ( O i O ¯ ) ] 2 i = 1 n ( P i P ¯ ) 2 i = 1 n ( O i O ¯ ) 2
where: Oi = observed/calculated ETo for the test set (EToPM) and, Pi = predicted ETo for the test set (EToHS).

3. Results and Discussion

3.1. Data Analyses

Rainfall from weather stations and observed well-defined wet and dry seasons (Figure S2). The months from May to September were considered in this study as the dry season, for they presented close to no rainfall for all stations and altogether represented only 3.4% of the total annual rainfall. April and October characterized the transition periods from and to wetter months, with monthly rainfall usually above 50 mm·month−1. Other meteorological parameters for each station are presented on Figures S3–S8, while a general overview on the region is shown in Figure 2.
In Figure 2, we notice that there is an evident seasonality to the data. Temperature presents its lower values in winter, but with higher temperature ranges. Higher temperature ranges coincide with the drier period, the behavior of which is well described by Allen et al. [10]. Fewer clouds occur during the dry season, resulting in higher relative sunshine hours (n/N, where N = maximum number of sunshine hours for given day). Thus, most of the incoming solar radiation reaches ground and Tmax will be larger in relation to T. Lesser clouds during nighttime also result in less downward longwave radiation emitted by the atmosphere and lower Tmin during the night. Solar radiation estimated from sunshine hours presented higher values but also with higher variability during the wet season, which relates to the presence of more clouds. Relative humidity decreased along the dry season, while wind speed increased its values until the end of it.
In Figure 3, the coefficient of correlation between EToPM and the meteorological variables used in the HS model by month and by station are presented. Although only TR, T and Ra are considered in the HS equation, we also presented the correlation to other variables, such as u2, RH and Rs, for this would provide better insights on the outcomes from the equation.
Monthly-wise, TR and T presented stronger correlations to EToPM during the rainy season, while Ra presented lower correlation in this period. Ra showed larger correlation to EToPM during the dry months, from May to August, with an exception for June. It may be noted that the correlations between EToPM and TR, T and Ra decrease during the dry season, while the correlation between EToPM and u2 increases. Even prior to the investigation with the clustering algorithm, the seasonality factor seems to be more relevant than regional.

3.2. Cluster Analyses

The total within-cluster sum of squares and average silhouette width from cluster analyses are presented in Figure 4. The meteorological characteristics monthly-wise resulted in better clustering results, which is observed from larger decay in the total within-cluster sum of squares. That is, for the same number of clusters of months and stations, a greater similarity is observed within clusters of months.
As for the optimal number of clusters, adopting the “elbow” method for the total within-cluster sum of squares method could be subjective; however, the average silhouette width method indicates that it would be more effective to partition clusters into two groups—months and stations. Additionally, the higher value of the average silhouette and lower total within-cluster sum of squares indicates that clusters of months are better matched than clusters of stations. Since the Hargreaves–Samani methodology was developed for simplicity, it is appropriate that we adopt two clusters to proceed with the analyses.
While assessing the clustered months considering k = 2, approach A2, it is observed that the clustered months coincided with the wet and dry seasons. One cluster contained the months from October to April, and the other from May to September. This confirms that it is adequate to calibrate the Hargreaves–Samani equation considering the wet and dry season, as suggested by Zanetti et al. [24] and Aguilar and Polo [23]. We may also conclude that for regions with similar climatic characteristics, seasonality are stronger indicators of the relation between meteorological variables and evapotranspiration. However, this may not be true for really extensive areas [13].
When k = 2 for stations clustering, approach A3, one cluster contained the stations of ID number 83338, 83388 and 83395 (Region One), and the other cluster contained the remaining nine stations (Region Two). The three stations forming Region One are all close together in the eastern region of the middle of the São Francisco Basin (Figure 1). This region is characterized by both the transitions between biomes and climate zones (Figure S1), which resulted in their different behavior when compared to the other stations. Although they are close to regions of humid sub-tropical climate type, their climate seems to be more closely related to a semi-arid one (Caatinga biome), experiencing the lowest rainfall among all stations (Figure S2). A more noticeable characteristic of these stations, however, is the much larger daily wind speed variability and magnitude when compared to the other stations (Figure 5). The higher variability seems to also be the cause for higher correlation between wind speed and ETo throughout the year for these stations (Figure 3).
For approaching A4, the combination of approaches A2 and A3, we separately assessed the clustering of months for Regions One and Two. The total within-cluster sum of squares and average silhouette is presented in Figure 6. In this case, the algorithm showed that the optimal number of clusters of months was also equal to two for both regions. The clusters of months could have been different for each region, however, they are grouped in the same way as in approach A2. That is, a cluster for the dry season (May to September) and another for the wet season (October to April). This only reinforces the belief that rainfall is an important factor determining how ETo interacts with meteorological variables and, therefore, it seems adequate to calibrate the HS equation considering the wet and dry season.

3.3. Parameters of Improved Hargreaves–Samani Models

The four calibration approaches resulted in nine sets of coefficients for the HS equations in the Middle São Francisco River Basin (Table 2). Approach A1 presented one set of coefficients, while A2 and A3 presented two sets of coefficients and A4 presented four sets of coefficients.
Average kRs observed for the middle São Francisco River Basin was equal to 0.166, just below the standard value used by Hargreaves and Samani [16] (0.170). All offsets from calibrated equations were smaller than the traditional equation, ranging from 15.08 to 17.12.
We observed higher temperature ranges in the dry season, but lower extraterrestrial solar radiation, resulting in larger kRs for the dry season when compared to the wet season. This is similar to as observed by Paredes et al. [38], where kRs for a diverse number of stations in an archipelago presented lower values for the rainier period.

3.4. Performances of Improved Hargreaves–Samani Models

The performance criteria obtained before and after the calibration are presented in Table 3. The calibration was fundamental in the minimization of errors in ETo estimates, decreasing MAE and RMSE by up to 0.11 and 0.12 mm·day−1, respectively. The original HS was shown to systematically overestimate EToPM. After calibration, despite FTO regression slopes below 1, the ME became a lot closer to 0, indicating much lower bias. R2 values only increased for calibration approaches involving clustered data, with a maximum observed value equal to 0.57.
The performance of the original and improved HS models by month is presented in Tables S1–S5. The monthly performance criteria for the original model ranged from 0.00 to 0.80 mm·day−1 for ME, from 0.58 to 1.01 mm·day−1 for MAE, from 0.73 to 1.19 mm·day−1 for RMSE, 0.03 to 0.72 for R2 and from 0.98 to 1.11 for the FTO regression slope. The very low values of R2 occurred from May to September and are as a result of the overall poor correlation of TR and T to EToPM (Figure 3). The best performing calibration approach, A4, resulted in a decrease in ME of up to 0.55 mm·day−1; the decrease in MAE ranged from 0.02 mm·day−1, in August and September, to 0.25 mm·day−1, in November.
By clustering data in groups of months, a larger improvement of the predictions was expected. However, the poor correlation of the variables to ETo for the dry season resulted in little performance improvement, e.g., a maximum reduction in RMSE equal to 0.06 mm·day−1 when compared to approach A1. It is noted in Figure 3 that the parameters with highest correlations to EToPM during the dry season were wind speed, relative humidity and solar radiation, which are not considered in the HS model. Although approach A2 and A4, overall, reduced predictions bias, it resulted in a large increase for the months of August and September, which previously had low biases. On the other hand, approach A3 and A4 resulted in a better balanced bias among the months. Improvements from using approach A3 were superior to those when using approach A2, with a higher overall R2. This was not expected at first, since months seemed to group in clusters better than stations did (Figure 4).
As for the stations (Tables S6–S10), the performance of the original HS model by station ranged from −0.52 to 0.90 mm day−1 for ME, 0.54 to 1.00 mm·day−1 for MAE, from 0.69 to 1.24 mm·day−1 for RMSE, 0.34 to 0.76 for R2 and from 0.87 to 1.19 for the FTO regression slope. The stations that benefited most for the best performing calibration approach, A4, were stations 83076, 83334, 83452 and 83389, with a decrease in RMSE ranging from 0.28 to 0.35 mm·day−1. In particular, three stations presented much worse performance, overall, when compared to the rest of them, for which ID numbers are 83338, 83388 and 83395. These three stations coincide with the stations from Region One in approaches A3 and A4. Oddly, stations 83338 and 83388 did not improve the performance for any approach, even decreasing performance for approaches A1 and A2, while station 83395 only slightly improved its performance. In fact, Reis et al. [25] also assessed the HS model for these same three stations and observed no improvements in correlation between observed and predicted values, even when performing the HS model calibration station by station.
It is noted in Figure 5 that all three stations in Region One presented much higher values and variance of wind speed, an important parameter for the aerodynamic term in Penman–Monteith’s equation. This alone may be a strong indication that the HS model would not be able to achieve much higher performances after calibration. In addition, stations 83338 and 83388 showed a tendency to underestimate EToPM, while 83395 to overestimate and, perhaps, this opposite behavior hindered the improvement of the region’s performance as well. Even for a small region, the very dynamic climatic characteristic of the transition between different biomes and climates rendered the stations different relation to EToPM. This suggests that a single calibration for the HS model in such a dynamic zone will not supply the region’s demand for better ETo estimates. For instance, station 83395 is the only station with a large but negative correlation between the temperature range and EToPM during the dry season.
The scatterplots for the test set predictions using the single equation calibrated for the MSFB (A1) and the best calibration approach (A4) are shown in Figure 7 and Figure 8 for each station. It may be noted that, for all cases, the fitted line indicates the improved HS method generally overestimates lower values of ETo, while generally underestimating higher values. This behavior is aggravated for the stations that performed worse (83338, 83388 and 83395). A similar behavior has also been observed while reviewing many studies regarding the HS methodology [24,38,39]. For not considering all input variables in the ETo estimation, the HS model is less prone to extreme values and makes predictions in smaller ranges. Therefore, the OLS fitted line becomes more horizontal and its estimates are closer to the observed mean, thereby decreasing the determination coefficient of the fitted model.
By comparing Figure 7 and Figure 8, a larger improvement is noted for stations 83338 and 83388 from A1 to A4. Oddly, station 83395 showed a decrease in performance upon comparing A1 to A4. In fact, by pooling all the data in approach A1, we improved the performance of most stations, but severely decreased the performance for 83338 and 83388. This is likely an outcome from stations 83338 and 83388 not only presenting high wind speed variability, but also higher daily values, resulting in the original model underestimating EToPM for these stations. In Region One, although the stations’ high wind speed variability resulted in a similar correlation to EToPM, lower values of wind speed for station 83395 resulted in a tendency to overestimate EToPM, similar to the stations in Region Two.
Despite the meteorological spatial variability had proven itself significant, the calibration for Region One using approach A4 showed close to no improvement when compared to the original HS equation. On the other hand, the performance improvement from Region Two was very similar to what is observed in approach A1. This indicates that approach A1 is likely the best option, and an alternative would be to not include stations with similar characteristics to Region One in the calibration, as their performance seems to rely on more complex models. Perhaps, performing calibrations using a larger number of cluster of stations, or station by station, could lead to better results in this region. However, this would have resulted in obtaining far more equations for the middle of São Francisco and deviate from the original purpose of the HS model—simplicity. In addition, each equation would be very specific for very small regions and not convenient overall.
Zhu et al. [39] evaluated the original HS and its calibrated form for 838 weather stations in China, and also observed that the HS model accuracy generally decreases in conditions of extreme values because the model ignores relative humidity and wind speed. For humid regions, however, Paredes et al. [38] documented that wind impacts are unimportant for the HS model, but require proper adjustments. To surpass the limitations that the absence of wind speed may impose in tropical and semiarid regions, Cobaner et al. [11] assessed a modified version of the HS model for 275 weather stations in Turkey. The authors modified version included the wind speed in the equation, resulting in better accuracy for all stations. Similarly, Valiantzas [40] developed new equations, including relative humidity alone or in addition to wind speed. Considering 32 stations in California, Florida, Arizona and Greece, the author documented an improved accuracy of about 45% by including relative humidity alone.
Overall, we may conclude that finding periods or regions with similar climatic conditions is important to calibrate the HS model and, generally, results in better performances, but does not necessarily mean that predictions for a given period of the year or region will largely benefit from it. In the case of the middle of São Francisco, (i) a single equation for the entire region and year (approach A1) is a clear improvement on the Davis-calibrated HS equation. (ii) Improvements from further seasonal/regional calibrations were small and seem to be unjustified. The performance of predictions showed stronger dependency on the correlation between input and output variables used in the model. Thus, (iii) the use of the full PM equation, or at least more complex models, seems to be unavoidable for the dry season, when the HS model variables presented lower correlation to EToPM, and for Region One, where wind speed presents high variability throughout the year. In those situations, the models developed for a region should be assessed carefully.

4. Conclusions

In this paper, we investigate the effects of local calibration of the Hargreaves–Samani equation for estimating reference evapotranspiration in the middle portion of the São Francisco River Basin (MSFB). The calibration approaches used considered a single calibration for the entire MSFB and year, a calibration for clusters of months, a calibration for clusters of stations and a calibration for the contexts resulted by clustering months within sub-regions. In order to cluster data, an algorithm of unsupervised learning was used by considering the correlations between meteorological variables and reference evapotranspiration.
The meteorological data from the MSFB were shown to be more effectively segregated by their seasonal similarities, as opposed to regional similarities. The clustered months coincided with the wet and dry seasons; however, the improvement of the HS model predictions did not depend only on finding similar regions. We observed that the mean air temperature and the range between the daily maximum and minimum showed a poor correlation to reference evapotranspiration during the dry season, resulting in only a small improvement on the performance by the calibration for the period.
When clustering data by stations, the clustering algorithm indicated that the optimal number of clusters was two sub-regions. Yet, the cluster with stations located in the eastern region of the middle São Francisco River Basin presented a poor performance by the HS model for all approaches assessed. The cause for this is that the region presents higher variability in terms of the wind speed values, which the model is not sensitive to. In addition, stations within this small region presented different bias behavior when predicting EToPM.
The HS model is one of the most parsimonious and widely used equations for data-scarce regions, and its calibration is fundamental for reducing bias and making more adequate evapotranspiration predictions. In summary, the results suggest that the use of a single HS equation calibrated for the MSFB region is a clear improvement on the original equation, and further calibrations by season or region seem unjustified. Lower correlation between meteorological variables used in the model and reference evapotranspiration during the dry season and high wind speed variability for stations indicate that the use of the full PM equation is necessary.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4441/11/11/2272/s1, Figure S1: Weather stations in relation to climate zones and biomes, Figure S2. Monthly rainfall by month and station, Figure S3. Daily reference evapotranspiration by month and station, Figure S4. Daily solar radiation by month and station, Figure S5. Daily relative humidity by month and station, Figure S6. Daily wind speed by month and station, Figure S7. Daily mean air temperature by month and station, Figure S8. Daily temperature range by month and station, Table S1: Performance by month of reference evapotranspiration predictions made by the Hargreaves-Samani model, Table S2. Performance by month of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A1, Table S3. Performance by month of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A2, Table S4. Performance by month of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A3, Table S5. Performance by month of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A4, Table S6. Performance by station of reference evapotranspiration predictions made by the Hargreaves-Samani model, Table S7. Performance by station of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A1, Table S8. Performance by station of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A2, Table S9. Performance by station of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A3, Table S10. Performance by station of reference evapotranspiration predictions made by the Hargreaves-Samani model calibrated with approach A4.

Author Contributions

Formal analysis, D.A.; methodology, D.A. and R.A.d.S.; writing—original draft, D.A. and R.A.d.S.; writing—review and editing, H.C.B., F.F.d.C. and R.F.

Funding

This study was supported by the Federal University of Viçosa—UFV, the National Council for Scientific and Technological Development—CAPES (grant number—142273/2019-8) and by the Coordination for the Improvement of Higher Education Personnel—CAPES (finance code 001).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Abdulbaki, D.; Al-Hindi, M.; Yassine, A.; Abou Najm, M. An optimization model for the allocation of water resources. J. Clean. Prod. 2017, 164, 994–1006. [Google Scholar] [CrossRef]
  2. Wendt, D.E.; Rodrigues, L.N.; Dijksma, R.; Van Dam, J.C. Assessing groundwater potential use for expanding irrigation in the Buriti Vermelho watershed. IRRIGA 2015, 1, 81–94. [Google Scholar] [CrossRef]
  3. Silva, J.M.D.; Gurgel, I.G.D.; Santos, M.O.D.; Gurgel, A.D.M.; Augusto, L.G.D.S.; Costa, A.M. Conflitos ambientais e as águas do rio São Francisco. Saúde E Soc. 2015, 24, 1208–1216. [Google Scholar] [CrossRef]
  4. Getirana, A.C.V.; de Fátima Malta, V.; de Azevedo, J.P.S. Decision Process in a Water Use Conflict in Brazil. Water Resour. Manag. 2008, 22, 103–118. [Google Scholar] [CrossRef]
  5. Zohaib, M.; Kim, H.; Choi, M. Detecting global irrigated areas by using satellite and reanalysis products. Sci. Total Environ. 2019, 677, 679–691. [Google Scholar] [CrossRef]
  6. Godfray, H.C.J.; Beddington, J.R.; Crute, I.R.; Haddad, L.; Lawrence, D.; Muir, J.F.; Pretty, J.; Robinson, S.; Thomas, S.M.; Toulmin, C. Food security: The challenge of feeding 9 billion people. Science 2010, 327, 812–818. [Google Scholar] [CrossRef]
  7. Tarjuelo, J.M.; Rodriguez-Diaz, J.A.; Abadía, R.; Camacho, E.; Rocamora, C.; Moreno, M.A. Efficient water and energy use in irrigation modernization: Lessons from Spanish case studies. Agric. Water Manag. 2015, 162, 67–77. [Google Scholar] [CrossRef]
  8. Jiménez-Bello, M.A.; Royuela, A.; Manzano, J.; Prats, A.G.; Martínez-Alzamora, F. Methodology to improve water and energy use by proper irrigation scheduling in pressurised networks. Agric. Water Manag. 2015, 149, 91–101. [Google Scholar] [CrossRef] [Green Version]
  9. Pryor, S.W.; Smithers, J.; Lyne, P.; van Antwerpen, R. Impact of agricultural practices on energy use and greenhouse gas emissions for South African sugarcane production. J. Clean. Prod. 2017, 141, 137–145. [Google Scholar] [CrossRef] [Green Version]
  10. Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop Evapotranspiration-Guidelines for Computing Crop Water Requirements-FAO Irrigation and Drainage Paper 56, 9th ed.; Food and Agriculture Organization of the United Nations: Rome, Italy, 1998; ISBN 92-5-104219-5. [Google Scholar]
  11. Cobaner, M.; Citakoğlu, H.; Haktanir, T.; Kisi, O. Modifying Hargreaves–Samani equation with meteorological variables for estimation of reference evapotranspiration in Turkey. Hydrol. Res. 2017, 48, 480–497. [Google Scholar] [CrossRef]
  12. Dou, X.; Yang, Y. Evapotranspiration estimation using four different machine learning approaches in different terrestrial ecosystems. Comput. Electron. Agric. 2018, 148, 95–106. [Google Scholar] [CrossRef]
  13. Ferreira, L.B.; da Cunha, F.F.; de Oliveira, R.A.; Fernandes Filho, E.I. Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM—A new approach. J. Hydrol. 2019, 572, 556–570. [Google Scholar] [CrossRef]
  14. Tabari, H.; Grismer, M.E.; Trajkovic, S. Comparative analysis of 31 reference evapotranspiration methods under humid conditions. Irrig. Sci. 2013, 31, 107–117. [Google Scholar] [CrossRef]
  15. ASCE-EWRI. The ASCE standardized reference evapotranspiration equation. In Technical Committee Rep. to the Environmental and Water Resources Institute of ASCE from the Task Committee on Standardization of Reference Evapotranspiration; ASCE-EWRI: Reston, VA, USA, 2005; p. 173. [Google Scholar]
  16. Hargreaves, G.H.; Samani, Z.A. Reference crop evapotranspiration from temperature. Appl. Eng. Agric. 1985, 1, 96–99. [Google Scholar] [CrossRef]
  17. Gavilán, P.; Lorite, I.J.; Tornero, S.; Berengena, J. Regional calibration of Hargreaves equation for estimating reference ET in a semiarid environment. Agric. Water Manag. 2006, 81, 257–281. [Google Scholar] [CrossRef]
  18. Shiri, J.; Sadraddini, A.A.; Nazemi, A.H.; Marti, P.; Fakheri Fard, A.; Kisi, O.; Landeras, G. Independent testing for assessing the calibration of the Hargreaves–Samani equation: New heuristic alternatives for Iran. Comput. Electron. Agric. 2015, 117, 70–80. [Google Scholar] [CrossRef]
  19. Tabaru, H.; Talaee, P.H. Local Calibration of the Hargreaves and Priestley-Taylor Equations for Estimating Reference Evapotranspiration in Arid and Cold Climates of Iran Based on the Penman-Monteith Model. J. Hydrol. Eng. 2011, 16, 837–845. [Google Scholar] [CrossRef]
  20. Zhang, Q.; Cui, N.; Feng, Y.; Gong, D.; Hu, X. Improvement of Makkink model for reference evapotranspiration estimation using temperature data in Northwest China. J. Hydrol. 2018, 566, 264–273. [Google Scholar] [CrossRef]
  21. Mendicino, G.; Senatore, A. Regionalization of the Hargreaves Coefficient for the Assessment of Distributed Reference Evapotranspiration in Southern Italy. J. Irrig. Drain. Eng. 2013, 139, 349–362. [Google Scholar] [CrossRef]
  22. Ravazzani, G.; Corbari, C.; Morella, S.; Gianoli, P.; Mancini, M. Modified Hargreaves-Samani Equation for the Assessment of Reference Evapotranspiration in Alpine River Basins. J. Irrig. Drain. Eng. 2012, 138, 592–599. [Google Scholar] [CrossRef]
  23. Aguilar, C.; Polo, M.J. Generating reference evapotranspiration surfaces from the Hargreaves equation at watershed scale. Hydrol. Earth Syst. Sci. 2011, 15, 2495–2508. [Google Scholar] [CrossRef] [Green Version]
  24. Zanetti, S.S.; Dohler, R.E.; Cecílio, R.A.; Pezzopane, J.E.M.; Xavier, A.C. Proposal for the use of daily thermal amplitude for the calibration of the Hargreaves-Samani equation. J. Hydrol. 2019, 571, 193–201. [Google Scholar] [CrossRef]
  25. Reis, M.M.; da Silva, A.J.; Zullo Junior, J.; Tuffi Santos, L.D.; Azevedo, A.M.; Lopes, É.M.G. Empirical and learning machine approaches to estimating reference evapotranspiration based on temperature data. Comput. Electron. Agric. 2019, 165, 104937. [Google Scholar] [CrossRef]
  26. Liou, Y.-A.; Kar, S.K. Evapotranspiration Estimation with Remote Sensing and Various Surface Energy Balance Algorithms—A Review. Energies 2014, 7, 2821–2849. [Google Scholar] [CrossRef]
  27. Alvares, C.A.; Stape, J.L.; Sentelhas, P.C.; de Moraes, G.; Leonardo, J.; Sparovek, G. Köppen’s climate classification map for Brazil. Meteorol. Z. 2013, 22, 711–728. [Google Scholar] [CrossRef]
  28. Pousa, R.; Costa, M.H.; Pimenta, F.M.; Fontes, V.C.; Brito, V.F.A.D.; Castro, M. Climate change and intense irrigation growth in western Bahia, Brazil: The urgent beed for hydroclimatic monitoring. Water 2019, 11, 933. [Google Scholar] [CrossRef]
  29. Filgueiras, R.; Reis, M.M.; Lopes, É.M.G.; Ribeiro, R.B.; Ramos, M.C.A. Spatio-temporal dynamics of climatological variables in the aid of decision making for irrigated agriculture. J. Agric. Sci. 2019, 11, 292. [Google Scholar] [CrossRef]
  30. EMBRAPA. Relatório técnico e de atividades da Embrapa Semiárido 2011. In Technical Report and Activities from the Semiarid Embrapa 2011; Embrapa Semiárido: Petrolina, Brazil, 2012; p. 69. (In English) [Google Scholar]
  31. Shafer, M.A.; Fiebrich, C.A.; Arndt, D.S.; Fredrickson, S.E.; Hughes, T.W. Quality assurance procedures in the Oklahoma mesonetwork. J. Atmos. Ocean. Technol. 2000, 17, 474–494. [Google Scholar] [CrossRef]
  32. Gavilán, P.; Estévez, J.; Berengena, J. Comparison of standardized reference evapotranspiration equations in Southern Spain. J. Irrig. Drain. Eng. 2008, 134, 1–12. [Google Scholar] [CrossRef]
  33. Estévez, J.; Gavilán, P.; Giráldez, J.V. Guidelines on validation procedures for meteorological data from automatic weather stations. J. Hydrol. 2011, 402, 144–154. [Google Scholar] [CrossRef] [Green Version]
  34. Hargreaves, G.H.; Samani, Z.A. Estimating potential evapotranspiration. J. Irrig. Drain. Div. 1982, 108, 225–230. [Google Scholar]
  35. Hartigan, J.A.; Wong, M.A. Algorithm AS 136: A K-Means Clustering Algorithm. J. R. Stat. Soc. Ser. C Appl. Stat. 1979, 28, 100–108. [Google Scholar] [CrossRef]
  36. Charrad, M.; Ghazzali, N.; Boiteau, V.; Niknafs, A. NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set. J. Stat. Softw. 2014, 61, 1–36. [Google Scholar] [CrossRef]
  37. Bholowalia, P.; Kumar, A. EBK-means: A clustering technique based on elbow method and k-means in WSN. Int. J. Comput. Appl. 2014, 105, 17–24. [Google Scholar]
  38. Paredes, P.; Fontes, J.C.; Azevedo, E.B.; Pereira, L.S. Daily reference crop evapotranspiration in the humid environments of Azores islands using reduced data sets: Accuracy of FAO-PM temperature and Hargreaves-Samani methods. Theor. Appl. Climatol. 2018, 134, 595–611. [Google Scholar] [CrossRef]
  39. Zhu, X.; Luo, T.; Luo, Y.; Yang, Y.; Guo, L.; Luo, H.; Fang, C.; Cui, Y. Calibration and Validation of the Hargreaves-Samani Model for Reference Evapotranspiration Estimation in China. Irrig. Drain. 2019, 68, 822–836. [Google Scholar] [CrossRef]
  40. Valiantzas, J.D. Temperature-and humidity-based simplified Penman’s ET0 formulae. Comparisons with temperature-based Hargreaves-Samani and other methodologies. Agric. Water Manag. 2018, 208, 326–334. [Google Scholar] [CrossRef]
Figure 1. Location of study area and weather stations in relation to the São Francisco Basin and Brazil.
Figure 1. Location of study area and weather stations in relation to the São Francisco Basin and Brazil.
Water 11 02272 g001
Figure 2. Descriptive statistics of meteorological parameters for the middle São Francisco from 1997 to 2016.
Figure 2. Descriptive statistics of meteorological parameters for the middle São Francisco from 1997 to 2016.
Water 11 02272 g002
Figure 3. Correlation between Penman–Monteith reference evapotranspiration and meteorological parameters by month and station. Color scale relates to correlation coefficient values, while size scale relates to absolute values of the correlation coefficient.
Figure 3. Correlation between Penman–Monteith reference evapotranspiration and meteorological parameters by month and station. Color scale relates to correlation coefficient values, while size scale relates to absolute values of the correlation coefficient.
Water 11 02272 g003
Figure 4. Total within-cluster sum of squares and average silhouette width versus number of clusters (k) while considering meteorological characteristics monthly and station-wise.
Figure 4. Total within-cluster sum of squares and average silhouette width versus number of clusters (k) while considering meteorological characteristics monthly and station-wise.
Water 11 02272 g004
Figure 5. Daily wind speed by month and station.
Figure 5. Daily wind speed by month and station.
Water 11 02272 g005
Figure 6. Total within-cluster sum of squares and average silhouette width versus the number of clusters (k) of months while considering their meteorological characteristics for two different regions.
Figure 6. Total within-cluster sum of squares and average silhouette width versus the number of clusters (k) of months while considering their meteorological characteristics for two different regions.
Water 11 02272 g006
Figure 7. Daily reference evapotranspiration scatterplots between Penman–Monteith and Hargreaves–Samani improved by fitting new coefficients (approach A1). The red and black lines are the ordinary least squares (OLS) and forced to the origin (FTO) regressions.
Figure 7. Daily reference evapotranspiration scatterplots between Penman–Monteith and Hargreaves–Samani improved by fitting new coefficients (approach A1). The red and black lines are the ordinary least squares (OLS) and forced to the origin (FTO) regressions.
Water 11 02272 g007
Figure 8. Daily reference evapotranspiration scatterplots between Penman–Monteith and Hargreaves–Samani improved by clustering data by stations and months (approach A4). The red and black lines are the ordinary least squares (OLS) and forced to the origin (FTO) regressions.
Figure 8. Daily reference evapotranspiration scatterplots between Penman–Monteith and Hargreaves–Samani improved by clustering data by stations and months (approach A4). The red and black lines are the ordinary least squares (OLS) and forced to the origin (FTO) regressions.
Water 11 02272 g008
Table 1. Weather stations information.
Table 1. Weather stations information.
MunicipalityStateWMO *LatitudeLongitudeAltitude (m)
BarraBA83179−11.08°−43.16°402
Bom Jesus da LapaBA83288−13.26°−43.41°440
EspinosaMG83338−14.91°−42.80°570
FormosoMG83334−14.93°−46.25°840
JanaúbaMG83395−15.80°−43.29°516
JanuáriaMG83386−15.45°−44.00°474
JuramentoMG83452−16.77°−43.66°648
MocambinhoMG83389−15.08°−44.01°452
Monte AzulMG83388−15.16°−42.86°625
Montes ClarosMG83437−16.68°−43.84°652
ParacatuMG83479−17.24°−46.88°712
Santa Rita de CássiaBA83076−11.01°−44.51°450
* Weather stations ID according to the World Meteorological Organization (WMO).
Table 2. Summary of coefficients for Hargreaves–Samani equation calibration.
Table 2. Summary of coefficients for Hargreaves–Samani equation calibration.
ETo ModelskRsOffset
Hargreaves and Samani [16]0.17017.80
A1—General calibration0.16615.30
A2—Clusters by months—wet season0.16115.55
A2—Clusters by months—dry season0.17215.43
A3—Clusters by region—Region One0.17515.88
A3—Clusters by region—Region Two0.16315.18
A4—Clusters by region and months—Region One/wet season0.16815.79
A4—Clusters by region and months—Region One/dry season0.18417.12
A4—Clusters by region and months—Region Two/wet season0.15915.49
A4—Clusters by region and months—Region Two/dry season0.16915.08
kRs = empirical coefficient of the Hargreaves–Samani equation; offset = mean air temperature offset.
Table 3. Performance criteria of original and improved Hargreaves–Samani models for the period from 2011 to 2016 (test set).
Table 3. Performance criteria of original and improved Hargreaves–Samani models for the period from 2011 to 2016 (test set).
ETo ModelsMEMAERMSER2FTO
Hargreaves and Samani [16]0.370.760.960.531.05
A1—General calibration−0.030.670.880.530.96
A2—Clusters by months−0.030.670.870.540.96
A3—Clusters by stations−0.030.660.850.560.96
A4—Clusters by stations and months−0.030.650.840.570.96
ME = mean bias error (mm·day−1); MAE = mean absolute error (mm·day−1); RMSE = root mean square error (mm·day−1); R2 = coefficient of determination; FTO = slope of the forced to the origin regression.

Share and Cite

MDPI and ACS Style

Althoff, D.; Santos, R.A.d.; Bazame, H.C.; Cunha, F.F.d.; Filgueiras, R. Improvement of Hargreaves–Samani Reference Evapotranspiration Estimates with Local Calibration. Water 2019, 11, 2272. https://doi.org/10.3390/w11112272

AMA Style

Althoff D, Santos RAd, Bazame HC, Cunha FFd, Filgueiras R. Improvement of Hargreaves–Samani Reference Evapotranspiration Estimates with Local Calibration. Water. 2019; 11(11):2272. https://doi.org/10.3390/w11112272

Chicago/Turabian Style

Althoff, Daniel, Robson Argolo dos Santos, Helizani Couto Bazame, Fernando França da Cunha, and Roberto Filgueiras. 2019. "Improvement of Hargreaves–Samani Reference Evapotranspiration Estimates with Local Calibration" Water 11, no. 11: 2272. https://doi.org/10.3390/w11112272

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop