Integrated Watershed Management

Integrated Watershed Management

Evaluation of the Efficiency of Artificial Neural Network and Random Forest Models in Predicting Groundwater Quality Parameters of Yazd-Ardakan Plain

Document Type : Original Article

Authors
1 Ph.D, Combating Desertification, Department of Management Arid and Desert regions, Faculty of Natural Resources and Desert, Yazd University, Yazd, Iran
2 Associate Professor, Department of Arid and Ddesert Regions Management, Faculty of Natural Resources and Desert, Yazd University, Iran
Abstract
Extended Abstract
Introduction: Groundwater resources are one of the main natural resources that support socio-economic development of countries (Siebert et al., 2010). Management and protection of groundwater resources is of great importance in countries such as Iran, which are located in arid and semi-arid regions without surface water resources. Therefore, resource conservation, in addition to short-term and long-term planning, is essential to optimizing their productivity (Abu-Khalaf et al., 2013). These natural resources face a variety of issues that threaten their sustainability, such as the effects of climate change, human activities and natural processes (Alabjah et al., 2018; Baghvand et al., 2010; Burri et al., 2019; El Asri et al., 2019; Houemenou et al., 2020; Mountadar et al., 2018). Considering the great importance of recognizing the quality characteristics of groundwater in desert areas and the need for its proper management, predicting groundwater quality for the management and exploitation of water resources in the Yazd-Ardakan plain seems urgent. Therefore, the aim of this study was to evaluate and compare the efficiency of artificial neural network and random forest models in predicting EC, SAR, SO4- and TDS values. In this research, modeling will be done based on the relationship between environmental (auxiliary) data and groundwater quality parameters.
Materials and Methods: The study area is spread over ​​482900 ha of land, which in terms of position is 53° 08´ 36˝ and 54° 85´ 32˝ E longitude, is located in the central plateau of Iran and the central part of Yazd province. The maximum height of the area is 2677 m and the minimum height is 997 m from the sea level. To evaluate the quality of groundwater sample, 201 wells of the monitoring network of Yazd Regional Water Company in 2016, the measured parameters EC, SAR, SO4- and TDS were used. The factors of environmental (auxiliary) data in this research include geological data, land use, vegetation indices, soil salinity indices, derivation of digital elevation model, distance from mines, distance from road, distance from river, distance from residential areas, rainfall and population.
After preparing the groundwater quality parameters and environmental data, the values of EC, SAR, SO4- and TDS were predicted using artificial neural network and random forest models. In order to validate the random forest model, the cross-validation method (10-fold) was used in R statistical software. In this method, the data is divided into 10 parts. 9 parts of the data are used for modeling and the remaining part is used to validate the obtained model. MAE, RMSE and R2 statistical indices were used to evaluate the efficiency of artificial neural network and random forest models for groundwater parameters.
Results and Discussion: In this research, in simulating EC, SAR, SO4- and TDS parameters, the best structure obtained from 100 repetitions of artificial neural network learning has 2 hidden layers and 10 hidden neurons in each layer. The results of random forest sensitivity analysis show the high importance of the parameters extracted from the DEM. The use of satellite image data to investigate the groundwater quality parameters is also a convenient and cost-effective method. Furthermore, combining satellite data with DEM to investigate groundwater quality parameters and their zoning makes the results more efficient and increases its accuracy. A very important point in the analysis of parameters is the important role of parameters such as distance from the road and distance from the mines. This indicates the direct effect of man-made environmental factors on the groundwater quality of the study area. The results of modeling the EC, SAR, SO4- and TDS parameters using artificial neural network and random forest models show the appropriate accuracy of the artificial neural network model in predicting these parameters. Accordingly, the neural network model has predicted the parameters of EC, SAR, SO4- and TDS of groundwater with coefficient of determination of 0.82, 0.92, 0.92 and 0.97, respectively, while the model Stochastic forest has predicted EC, SAR, SO4- and TDS parameters of groundwater with coefficients of determination of 0.37, 0.65, 0.80 and 0.68, respectively. Based on the obtained results, it can be concluded that the artificial neural network model has a higher accuracy than the random forest model in predicting groundwater quality parameters. EC, SAR, SO4- and TDS parameters have the highest values in the north, center and southwest, and the lowest values in the southeast and southwest.
Conclusion: It can be said that the technique is highly efficient for estimating EC, SAR, SO4 and TDS parameters in Yazd-Ardakan plain as long as a proper and a sufficient number of input elements, a proper and compatible artificial neural network, and an appropriate calibration are used.
Keywords

Abu-Khalaf, N., Khayat, S. & Natsheh, B. (2013). Multivariate data analysis to identify the groundwater pollution sources in Tulkarm area/Palestine. Science and Technology, 3(4), 99-104. Doi: 10.5923/j.scit.20130304.01.
Adimalla, N. & Taloor, A. K. (2020). Hydrogeochemical investigation of groundwater quality in the hard rock terrain of South India using Geographic Information System (GIS) and groundwater quality index (GWQI) techniques. Groundwater for Sustainable Development, 10, 100288. Doi: 10.1016/j.gsd.2019.100288.
Adimalla, N., Qian, H. & Li, P. (2019). Entropy water quality index and probabilistic health risk assessment from geochemistry of groundwaters in hard rock terrain of Nanganur County, South India. Geochemistry, 125544. Doi: 10.1016/j.chemer.2019.125544.
Alabjah, B., Amraoui, F., Chibout, M. & Slimani, M. (2018). Assessment of saltwater contamination extent in the coastal aquifers of Chaouia (Morocco) using the electric recognition. Journal of Hydrology, 566, 363-376. Doi: 10.1016/j. jhydrol.2018.09.003.
Almasri, M. N. & Kaluarachchi, J. J. (2005). Modular neural networks to predict the nitrate distribution in ground water using the on-ground nitrogen loading and recharge data. Environmental Modelling & Software, 20(7), 851-871. Doi: 10.1016/j.envsoft.2004.05.001.
Arzani, H. & King, G. W. (2008). Application of remote sensing (Landsat TM data) for vegetation parameters measurement in western division of NSW. In International Grassland Congress. Hohhot, China. ID (No. 1083).
Baghvand, A., Nasrabadi, T., Bidhendi, G. N., Vosoogh, A., Karbassi, A. & Mehrdadi, N. (2010). Groundwater quality degradation of an aquifer in Iran central desert. Desalination, 260(1-3), 264-275. Doi: 10.1016/j.desal.2010.02.038.
Bilali, A. E., Taleb, A., Mazigh, N. & Mokhliss, M. (2020). Prediction of chemical water quality used for drinking purposes based on artificial neural networks. Moroccan Journal of Chemistry, 8(3), 8-3. Doi: 10.48317/IMIST.PRSM/morjchem-v8i3.19786.
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. Doi: 10.1023/A:1010933404324.
Burri, N. M., Weatherl, R., Moeck, C. & Schirmer, M. (2019). A review of threats to groundwater quality in the anthropocene. Science of the Total Environment, 684, 136-154. Doi: 10.1016/j.scitotenv.2019.05.236.
Castrillo, M. & Garcia, A. L. (2020). Estimation of high frequency nutrient concentrations from water quality surrogates using machine learning methods. Water Research, 172, 115490. Doi: 10.1016/j.watres.2020.115490.
Chou, J. S., Ho, C. C. & Hoang, H. S. (2018). Determining quality of water in reservoir using machine learning. Ecological Informatics, 44, 57-75. Doi: 10.1016/j.ecoinf.2018.01.005.
Chowdury, M. S. U., Emran, T. B., Ghosh, S., Pathak, A., Alam, M. M., Absar, N. & Hossain, M. S. (2019). IoT based real-time river water quality monitoring system. Procedia Computer Science, 155, 161-168. Doi: 10.1016/j.procs.2019.08.025.
Conrad, O., Bechtel, B., Bock, M., Dietrich, H., Fischer, E., Gerlitz, L. & Bohner, J. (2015). System for automated geoscientific analyses (SAGA) v. 2.1.4. Geoscientific Model Development, 8(7), 1991-2007. Doi: 10.5194/gmd-8-1991-2015.
Crippen, R. E. (1990). Calculating the vegetation index faster. Remote Sensing of Environment, 34(1), 71-73. Doi: 10.1016/0034-4257(90)90085-Z.
Danielsson, P. E. (1980). Euclidean distance mapping. Computer Graphics and Image Processing, 14(3), 227-248. Doi: 10.1016/0146-664X(80)90054-4.
Das, A., Das, S. S., Chowdhury, N. R., Joardar, M., Ghosh, B. & Roychowdhury, T. (2020). Quality and health risk evaluation for groundwater in Nadia district, West Bengal: An approach on its suitability for drinking and domestic purpose. Groundwater for Sustainable Development, 10, 100351. Doi: 10.1016/j.gsd.2020.100351.
Derakhshan, Z., Faramarzian, M., Miri, M. & Shokouhi, M. R. (2015). The zoning of groundwater quality for drinking purpose using scholler model and geographic information system (GIS). Journal of Community Health Research, 4(2), 138-147.
Di, Z., Chang, M. & Guo, P. (2019). Water quality evaluation of the Yangtze River in China using machine learning techniques and data monitoring on different time scales. Water, 11(2), 339. Doi: 10.3390/w11020339.
El Asri, H., Larabi, A. & Faouzi, M. (2019). Climate change projections in the Ghis-Nekkor region of Morocco and potential impact on groundwater recharge. Theoretical and Applied Climatology, 138(1), 713-727. Doi: 10.1007/s00704-019-02834-8.
El Mountassir, O., Bahir, M., Ouazar, D., Ouhamdouch, S., Chehbouni, A. & Ouarani, M. (2020). The use of GIS and water quality index to assess groundwater quality of krimat aquifer (Essaouira; Morocco). SN Applied Sciences, 2(5), 1-16. Doi: 10.1007/s42452-020-2653-z.
Fathizad, H., Ardakani, M. A. H., Sodaiezadeh, H., Kerry, R. & Taghizadeh-Mehrjardi, R. (2020). Investigation of the spatial and temporal variation of soil salinity using random forests in the central desert of Iran. Geoderma, 365, 114233. Doi: 10.1016/j.geoderma.2020.114233.
Fijani, E., Barzegar, R., Deo, R., Tziritis, E. & Skordas, K. (2019). Design and implementation of a hybrid model based on two-layer decomposition method coupled with extreme learning machines to support real-time environmental monitoring of water quality parameters. Science of the Total Environment, 648, 839-853. Doi: 10.1016/j.scitotenv.2018.08.221.
Foody, G. M., Cutler, M. E., McMorrow, J., Pelz, D., Tangki, H., Boyd, D. S. & Douglas, I. A. N. (2001). Mapping the biomass of Bornean tropical rain forest from remotely sensed data. Global Ecology and Biogeography, 10(4), 379-387. Doi: 10.1046/j.1466-822X.2001.00248.x.
Genuer, R., Poggi, J. M. & Tuleau, C. (2008). Random forests: some methodological insights. arXiv preprint arXiv:0811.3619.
Ha, H. & Stenstrom, M. K. (2003). Identification of land use with water quality data in stormwater using a neural network. Water Research, 37(17), 4222-4230. Doi: 10.1016/S0043-1354(03)00344-0.
Haykin, S. & Network, N. (2004). A comprehensive foundation. Neural Networks, 2(2004), 41.
Heib, L., Bouchaou, L., Tadoumant, S. & Reichert, B. (2020). Index-based groundwater vulnerability and water quality assessment in the arid region of Tata city (Morocco). Groundwater for Sustainable Development, 10, 100344. Doi: 10.1016/j.gsd.2020.100344.
Houemenou, H., Tweed, S., Dobigny, G., Mama, D., Alassane, A., Silmer, R., Babic, M., Ruy, S., Chaigneau, A., Gauthier, P., Socohou, A., Dossou, H.J., Badou, S. & Leblanc, M. (2020). Degradation of groundwater quality in expanding cities in West Africa. A case study of the unregulated shallow aquifer in Cotonou. Journal of Hydrology, 582, 124438. Doi: 10.1016/j.jhydrol.2019.124438.
Huete, A. R. (1988). A soil-adjusted vegetation index (SAVI). Remote Sensing of Environment, 25(3), 295-309. Doi: 10.1016/0034-4257(88)90106-X.
Jahin, H. S., Abuzaid, A. S. & Abdellatif, A. D. (2020). Using multivariate analysis to develop irrigation water quality index for surface water in Kafr El-Sheikh Governorate, Egypt. Environmental Technology & Innovation, 17, 100532. Doi: 10.1016/j.eti.2019.100532.
Jamaa, H., El Achheb, A. & Namr, K. I. (2020). Spatial variation of groundwater quality and assessment of water table fluctuations in Plio-Quaternary aquifer formations in Doukkala Plain, Morocco. Groundwater for Sustainable Development, 11, 100398. Doi: 10.1016/j.gsd.2020.100398.
Jordan, C. F. (1969). Derivation of leaf‐area index from quality of light on the forest floor. Ecology, 50(4), 663-666. Doi: 10.2307/1936256.
Khan, N. M., Rastoskuev, V. V., Sato, Y. & Shiozawa, S. (2005). Assessment of hydrosaline land degradation by using a simple approach of remote sensing indicators. Agricultural Water Management, 77(1-3), 96-109. Doi: 10.1016/j.agwat.2004.09.038.
Krupkova, L., Havrankova, K., Krejza, J., Sedlak, P. & Marek, M. V. (2019). Impact of water scarcity on spruce and beech forests. Journal of Forestry Research, 30(3), 899-909. Doi: 10.1007/s11676-018-0642-5.
Kullberg, E. G., DeJonge, K. C. & Chavez, J. L. (2017). Evaluation of thermal remote sensing indices to estimate crop evapotranspiration coefficients. Agricultural Water Management, 179, 64-73. Doi: 10.1016/j.agwat.2016.07.007.
Lu, H. & Ma, X. (2020). Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere, 249, 126169. Doi: 10.1016/j.chemosphere.2020.126169.
Major, D. J., Baret, F. & Guyot, G. (1990). A ratio vegetation index adjusted for soil brightness. International journal of Remote Sensing, 11(5), 727-740. Doi: 10.1080/01431169008955053.
Meyers, G., Kapelan, Z. & Keedwell, E. (2017). Short-term forecasting of turbidity in trunk main networks. Water Research, 124, 67-76. Doi: 10.1016/j. watres.2017.07.035.
Mountadar, S., Younsi, A., Hayani, A., Siniti, M. & Tahiri, S. (2018). Groundwater salinization process in the coastal aquifer sidi abed-ouled ghanem (province of el Jadida, Morocco). Journal of African Earth Sciences, 147, 169-177. Doi: 10.1016/j.jafrearsci.2018.06.025.
Nicolas, H. & Walter, C. (2006). Detecting salinity hazards within a semiarid context by means of combining soil and remote-sensing data. Geoderma, 134(1-2), 217-230. Doi: 10.1016/j.geoderma.2005.10.009.
Nield, S. J., Boettinger, J. L. & Ramsey, R. D. (2007). Digitally mapping gypsic and natric soil areas using Landsat ETM data. Soil Science Society of America Journal, 71(1), 245-252. Doi: 10.2136/sssaj2006-0049.
Nielsen, D. R. & Bouma, J. (1985). Soil spatial variability. proceedings of a workshop of the ISSS (Int. Society of Soil Science) and the SSSA (Soil Science Society of America), Las Vegas (USA), 30 Nov-1 Dec 1984.
Pettorelli, N., Vik, J. O., Mysterud, A., Gaillard, J. M., Tucker, C. J. & Stenseth, N. C. (2005). Using the satellite-derived NDVI to assess ecological responses to environmental change. Trends in Ecology & Evolution, 20(9), 503-510. Doi: 10.1016/j.tree.2005.05.011.
Pijanowski, B. C., Brown, D. G., Shellito, B. A. & Manik, G. A. (2002). Using neural networks and GIS to forecast land use changes: a land transformation model. Computers, Environment and Urban Systems, 26(6), 553-575. Doi: 10.1016/S0198-9715(01)00015-1.
Plumb, A. P., Rowe, R. C., York, P. & Brown, M. (2005). Optimisation of the predictive ability of artificial neural network (ANN) models: a comparison of three ANN programs and four classes of training algorithm. European Journal of Pharmaceutical Sciences, 25(4-5), 395-405. Doi: 10.1016/j.ejps.2005.04.010.
Rajaee, T., Ebrahimi, H. & Nourani, V. (2019). A review of the artificial intelligence methods in groundwater level modeling. Journal of Hydrology, 572, 336-351. Doi: 10.1016/j.jhydrol.2018.12.037.
Rondeaux, G., Steven, M. & Baret, F. (1996). Optimization of soil-adjusted vegetation indices. Remote Sensing of Environment, 55(2), 95-107. Doi: 10.1016/0034-4257(95)00186-7.
Sabahattin I., Latif K., Jon, S., Puneet, S. & Graeme, L. (2013). Modeling effects of changing land use/cover on daily streamflow: An Artificial Neural Network and curve number based hybrid approach. Journal of Hydrology, 485, 103-112.
Sakai, N., Mohamad, Z. F., Nasaruddin, A., Abd Kadir, S. N., Salleh, M. S. A. M. & Sulaiman, A. H. (2018). Eco-Heart Index as a tool for community-based water quality monitoring and assessment. Ecological Indicators, 91, 38-46. Doi: 10.1016/j.ecolind.2018.03.079.
Siebert, S., Burke, J., Faures, J. M., Frenken, K., Hoogeveen, J., Doll, P. & Portmann, F. T. (2010). Groundwater use for irrigation–a global inventory. Hydrology and Earth System Sciences, 14(10), 1863-1880. Doi: 10.5194/hess-14-1863-2010.
Taloor, A. K., Pir, R. A., Adimalla, N., Ali, S., Manhas, D. S., Roy, S. & Singh, A. K. (2020). Spring water quality and discharge assessment in the Basantar watershed of Jammu Himalaya using geographic information system (GIS) and water quality Index (WQI). Groundwater for Sustainable Development, 10, 100364. Doi: 10.1016/j.gsd.2020.100364.
Tung, T. M. & Yaseen, Z. M. (2020). A survey on river water quality modelling using artificial intelligence models: 2000–2020. Journal of Hydrology, 585, 124670. Doi: 10.1016/j.jhydrol.2020.124670.
Tyagi, S., Sharma, B., Singh, P. & Dobhal, R. (2013). Water quality assessment in terms of water quality index. American Journal of Water Resources, 1, 34-38. Doi: 10.12691/ajwr-1-3-3.
Wagh, V. M., Panaskar, D. B., Muley, A. A., Mukate, S. V., Lolage, Y. P. & Aamalawar, M. L. (2016). Prediction of groundwater suitability for irrigation using artificial neural network model: a case study of Nanded tehsil, Maharashtra, India. Modeling Earth Systems and Environment, 2(4), 1-10. Doi: 10.1007/s40808-016-0250-3.
Wilson, E. H. & Sader, S. A. (2002). Detection of forest harvest type using multiple dates of Landsat TM imagery. Remote Sensing of Environment, 80(3), 385-396. Doi: 10.1016/S0034-4257(01)00318-2.
Wu, J., Zhang, Y. & Zhou, H. (2020). Groundwater chemistry and groundwater quality index incorporating health risk weighting in Dingbian County, ordos basin of northwest China. Geochemistry, 80(4), 125607. Doi: 10.1016/j.chemer.2020.125607.
Yaseen, Z. M., Sulaiman, S. O., Deo, R. C. & Chau, K. W. (2019). An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. Journal of Hydrology, 569, 387-408. Doi: 10.1016/j.jhydrol.2018.11.069.
Zhu, J. J., Yu, L. Z., Xu, T. L., Wei, X. & Yang, K. (2019). Comparison of water quality in two catchments with different forest types in the headwater region of the Hun River, Northeast China. Journal of Forestry Research, 30(2), 565-576. Doi: 10.1007/s11676-018-0688-4.

  • Receive Date 15 December 2021
  • Revise Date 08 February 2022
  • Accept Date 10 February 2022