Prediction of nitrate nitrogen content in soil using machine learning
https://doi.org/10.26898/0370-8799-2021-5-11
Abstract
The possibilities and feasibility of using the Bayesian network of trust and logistic regression to predict the content of nitrate nitrogen in the 0-40 cm soil layer before sowing have been investigated. Data from long-term multifactor field experience at the Siberian Research Institute of Farming and Agricultural Chemization of SFSCA RAS for 2013-2018 were used to train the models. The experiment was established on leached chernozem in the central forest-steppe subzone in 1981 in the Novosibirsk region. Considering the characteristics of the statistical sample (observation and analysis data), the main predictors of the models affecting nitrate nitrogen content in soil were identified. The Bayesian trust network is constructed as an acyclic graph, in which the main (basic) nodes and their relationships are denoted. Network nodes are represented by qualitative and quantitative plot parameters (soil subtype, forecrop, tillage, weather conditions) with corresponding gradations (events). The network assigns a posteriori probability of events for the target node (nitrate-nitrogen content in the 0-40 cm soil layer) as a result of experts completing the conditional probability table, taking into account the analysis of empirical data. Two scenarios were analyzed to test the sustainability of the network and satisfactory results were obtained. The result of the logistic regression is the coefficients characterizing the closeness of the relationship between the dependent variable and the predictors. The coefficient of determination of the logistic regression is 0.7. This indicates that the quality of the model can be considered acceptable for forecasting. A comparative assessment of the predictive capabilities of the trained models is given. The overall proportion of correct predictions for the Bayesian confidence network is 84%, for logistic regression it is 87%.
About the Authors
V. K. KalichkinRussian Federation
Vladimir K. Kalichkin, Doctor of Science in Agriculture, Professor, Head Researcher
Krasnoobsk, Novosibirsk Region
T. A. Luzhnykh
Russian Federation
Tatyana A. Luzhnykh, Junior Researcher
PO Box 463, SFSCA RAS, Krasnoobsk, Novosibirsk Region, 630501
V. S. Riksen
Russian Federation
Vera S. Riksen, Junior Researcher
Krasnoobsk, Novosibirsk Region
N. V. Vasilyeva
Russian Federation
Nadezhda V. Vasilyeva, Candidate of Science in Biology, Senior Researcher
Krasnoobsk, Novosibirsk Region
V. A. Shpak
Russian Federation
Vladimir A. Shpak, Candidate of Science in Physics and Mathematics, Researcher
Krasnoobsk, Novosibirsk Region
References
1. Zhai Z., Fernán J., Beltran M.V., Martínez N.L. Decision support systems for agriculture 4.0: Survey and challenges. Computers and Electronics in Agriculture, 2020, vol. 170, p. 105256. DOI: 10.1016/j.compag.2020.105256.
2. Bendre M., Manthalkar R. Time series decomposition and predictive analytics using MapReduce framework. Expert Systems with Applications, 2019, vol. 116, pp. 108–120. DOI: 10.1016/j.eswa.2018.09.017.
3. Khaki S., Wang L., Archontoulis S.V. A cnn-rnn framework for crop yield prediction. Frontiers in Plant Science, 2020, vol. 10, p. 1750. DOI: 10.3389/fpls.2019.01750.
4. Benos L., Tagarakis A.C., Dolias G., Remigio B. Machine Learning in Agriculture: A Comprehensive Updated Review. Sensors, 2021, vol. 21, no. 11, p. 3758. DOI: 10.3390/s21113758.
5. Cravero A., Sepúlveda S. Use and Adaptations of Machine Learning in Big Data – Applications in Real Cases in Agriculture. Electronics, 2021, vol. 10, no. 5, p. 552. DOI: 10.3390/electronics10050552.
6. Paudel D., Boogaard H., Wit A. Janssen S., Osinga S., Pylianidis C., Athanasiadis I. Machine learning for large-scale crop yield forecasting. Agricultural Systems, 2021, vol. 187, p. 103016. DOI: 10.1016/j.agsy.2020.103016.
7. Drury B., Valverde-Rebaza J., Moura M.-F., Andrade Lopes, A. A survey of the applications of Bayesian networks in agriculture. Engineering Applications of Artificial Intelligence, 2017, vol. 65, pp. 29–42. DOI: 10.1016/j.engappai.2017.07.003.
8. Fenton N., Neil M., Lagnado D., Yet B., Constantinou A.C. How to model mutually exclusive events based on independent causal pathways in Bayesian network models. Knowledge-Based Systems, 2016, vol. 113, pp. 39–50. DOI: 10.1016/j.knosys.2016.09.12.
9. Aivazyan S.A. Bayesian approach in econometric analysis. Applied econometrics. Prikladnaya ekonometrika = Applied Econometrics, 2008, no. 1 (9), pp. 93–130. (In Russian).
10. Toropova A.V. Approaches to Diagnosing Data Consistency in Bayesian Trust Networks. Trudy SPIIRAN = SPIIRAS Proceedings, 2015, vol. 6, no. 43, pp. 156–178. (In Russian).
11. Prishchepov A.V., Pon'kina E.V., Sun Zh., Myuller D. Identification of determinants of wheat yield in Western Siberia using Bayesian networks. Prostranstvennaya ekonomika = Spatial Economics, 2019, vol. 15, no. 1, pp. 39–83. (In Russian). DOI: 10.14530/se.2019.1.039-083.
12. Zvizdojevic J., Vukotic M. Application of statistical methods in analysis of agriculture-correlation and regression analysis. Poljoprivreda i Sumarstvo, 2015, vol. 61, no. 1, pp. 309–323. DOI: 10.17707/AgricultForest.61.1.38.
13. Litvinenko N.G., Litvinenko A.G., Mamyrbaev O.Zh. Bayesian networks. Theory and practice. Almaty: Institute of Information and Computational Technologies, CS MES RK, 2020, p. 197. (In Russian).
14. Nasledov A.D. IBM SPSS Statistics 20 and AMOS. Professionalny Statistichesky analiz dannikh = Professional statistical data analysis, St. Petersburg, Piter, 2013, p. 416. (In Russian).
Review
For citations:
Kalichkin V.K., Luzhnykh T.A., Riksen V.S., Vasilyeva N.V., Shpak V.A. Prediction of nitrate nitrogen content in soil using machine learning. Siberian Herald of Agricultural Science. 2021;51(5):91-100. https://doi.org/10.26898/0370-8799-2021-5-11