The Austrian soil study compared machine learning algorithms like Random Forest and Support Vector Machines with process-based models such as RothC and ICBM. The research found that machine learning algorithms performed better with large datasets, while process-based models provided a better understanding of the underlying mechanisms of soil organic carbon dynamics. Combining both approaches was recommended for more accurate predictions across different scales and conditions.
The Austrian experiments included various soil management practices such as organic amendments, crop rotations, aboveground crop residue handling, mineral fertilizer application, and tillage systems. These practices were studied in long-term field experiments to evaluate their impact on litter decomposition and soil properties.
The study found that the performance of ML algorithms varied with dataset size. Generally, larger datasets led to better performance, as they provided richer information for the models to capture detailed patterns and enhance their generalization capabilities. However, the relationship between dataset size and ML performance was not always direct, as other factors like data quality and model complexity also played significant roles.