Campus Units

Industrial and Manufacturing Systems Engineering, Bioeconomy Institute (BEI), Agronomy

Document Type


Publication Version

Submitted Manuscript

Publication Date


Journal or Book Title


Research Focus Area(s)

​Operations Research


This study investigates whether coupling crop modeling and machine learning (ML) improves corn yield predictions in the US Corn Belt. The main objectives are to explore whether a hybrid approach (crop modeling + ML) would result in better predictions, investigate which combinations of hybrid models provide the most accurate predictions and determine the features from the crop modeling that are most effective to be integrated with ML for corn yield prediction. Five ML models and six ensemble models have been designed to address the research question. The results suggest that adding simulation crop model variables (APSIM) as input features to ML models can make a significant difference in the performance of ML models, and it can boost ML performance by up to 29%. Furthermore, we investigated partial inclusion of APSIM features in the ML prediction models, and we found that soil and weather-related APSIM variables are most influential on the ML predictions followed by crop-related and phenology-related variables. Finally, based on feature importance measure, it has been observed that simulated APSIM average drought stress and average water table depth during the growing season are the most important APSIM inputs to ML. This result indicates that weather information alone is not sufficient, and ML models need more hydrological inputs to make improved yield predictions.


This is a pre-print of the article Shahhosseini, Mohsen, Guiping Hu, Sotirios V. Archontoulis, and Isaiah Huber. "Coupling Machine Learning and Crop Modeling Improves Crop Yield Prediction in the US Corn Belt." arXiv preprint arXiv:2008.04060 (2020). Posted with permission.

Copyright Owner

The Author(s)



File Format


Published Version