According to the Table 1, both data sets showed statistically similar features, for example, BD and sand, silt and clay contents were 1.058-1.940 g.cm
-3, 02-68%, 02-94% and 03-85%, respectively for the training data set. FC and PWP ranged from 0.03 to 0.42 g.g
-1 and from 0.016 to 0.36g.g
-1, respectively. It is obvious that the wide range of physical characteristics in Biskra province is caused by the presence of various soil types.
Fig 3 shows that, despite having opposite polarity, both FC and PWP have a reasonably strong and smooth relationship (r>0.70) with clay and sand contents. Thus, it might be the most accurate indicator of FC and PWP. The relationships between BD, silt and OM and FC and PWP are rather slender. Additionally, it seems that BD has a bad relationship with PWP and FC. As a result, these characteristics by themselves are unlikely to be reliable indicators of FC and PWP; however, when combined with clay and sand in a multiple prediction model, they might enhance model performance. The relationship between FC and PWP seems to be solid and seamless. Because of its collinearity with clay, the sand content was eliminated. In order to predict FC and PWP, the following independent variables were selected for modeling: BD, silt, clay and OM content.
According to the results displayed in Table 2, the model 2 created using clay, silt and BD seems to perform better than the others during the training data sets. These predictor variables explained 76% and 62% of variation for FC and PWP respectively. Additionally, when combined with silt, BD seems to enhance the model’s performance, however, OM only slightly enhances the regression. Overall, FC outperformed PWP in terms of evaluation indices (R
2, RMSE and ME). For FC, the R
2 and RMSE values were 0.76 and 0.095, while for PWP were 0.62 and 0.1161. Both models included the indipendent variables silt, bulk density, BD-silt, for FC and silt- BD and quadratic clay for PWP.
As indicated in Table 3, the model 2 for FC and PWP, respectively, had the lowest level of
RMSE and the highest level of
R2. In general, a lower
RMSE and higher
R2 value are statistical indicators of the model’s strong performance. In our investigation, the
R2 and
RMSE values for the training data set, are 0.761 and 0.095, for FC, while the values for PWP are 0.639 and 0,114, respectively. For the testing data set, the R
2 and RMSE values are 0.798 and 0.080 for FC, while for the PWP these values are 0.652 and 0.105, respectively.
For forecasting soil water contents at FC and PWP, the particle size distribution (PSD), OM and BD were frequently employed as predictors
(Chakraborty et al., 2011; Wösten et al., 2001). In the present study, compared to BD and OM, particle size distribution demonstrated a stronger correlation with soil water content. Consequently, using clay, silt and BD provide a good prediction. Our results were consistent with those of
Ostavari et al. (2015)
Ghorbani et al., (2017) and
Li et al. (2019). In MLR modeling, clay, silt and BD are the variables that explain 76 and 62% of variation in FC and PWP respectivly. The observed correlations between clay, FC and PWP are consistent with those reported for other soil types
Cosby et al., (1984) and
Rab et al., (2011). In our case, OM is unlikely to have an impact on the FC and PWP predictions. This could be as a result of the clay or sand content variation being significantly greater than the OM content variation. As a result, any connection between FC and OM would have been concealed. Our findings are comparable to those obtened by
Minasny and McBratney (2002);
Khlosi et al., (2016) and
Santra et al., (2018). Rab et al., (2011) noted that Australian soils have very little organic matter (OM) and did not include it in their PTFs.
From Table 4, it can be clearly apparent that the ANN produced low RMSE and high R
2 values during the training phase when compared to the conventional method MLR
. The
RMSE values varied from 0.082 to 0.090 g.g
-1 and from 0.082 to 0.090 g.g
-1 for FC and PWP, respectively. These values were lower than those of the MLR models, which ranged from 0.038 to 0.044 g.g
-1. The
R2 values of the ANN models ranged from 0.713 to 0.879 and 0.713 to 0.843, respectively, whereas the MLR models showed the lower values with 0.715-0.788. Nonetheless, the t-statistic (P<0.05) indicates that there was no significant difference in the two methods’ performance for either FC or PWP. Similarity,
Skalová et al. (2011) found that ANN performed slighly better than MLR with limeted data.
In contrast to MLR models, ANN models were able to generate accurate predictions during the testing phase (Table 4). The
RMSE of ANN models varied from 0.002 to 0.048 and were smaller than those of the MLR models, witch ranged from 0.031 to 0.057 and from 0.035 to 0.040, respectively. The R
2 values of the ANN model ranged from 0.739 to 0.879 indicating a more accurate prediction than MLR models, for wich the values ranged from 0.647 to 0.821 and 0.692 to 0.849, respectively.
Overall, Fig 4 displays the performance of every prediction model created during the testing phase in the research area. The best-fit model for predicting FC and PWP was determined to be an ANN model with clay, silt and BD input variables. For all models, evaluation indices performed better at FC than PWP.
Haghverdi et al., (2012) demonstrated that the better performance of either of data mining methods could be related to the PTF type and database characteristics rather than to inherent supremacy of either of the data mining methods. According to the developed PTFs, it was observed that ANN models with silt, clay and BD as predictors and four neurons in hidden layer had better performance in predicting soil water content at FC and PWP.
FC = 0.146-1.131×H
1-0.207×H
2-1.14×H
3+0.292×H
4
PWP = 0.719+0.355×H
1-0.584×H
2+1.131×H
3-0.889×H
4
Where H
1 = Tanh (0.5× (-0.254-1.747×Clay-0.554×Silt-0.174×BD), H
2 = Tanh (0.5×(0.032-1.048×Clay-2.980×Silt-0.075×BD), H
3=Tanh (0.5×(-0.656+1.349×Clay+0.136´Silt-1.016×BD), H
4 = Tanh (0.5× (1.815+1.555×Clay+0.049×Silt-0.180×BD)
Where Tanh stands for hyperbolic tangent of any given number.