Forest Ecosystems

Table 2 Mean squared error (MSE) and squared bias (bias2) of the different machine learning and statistical models depending on different resampling strategies, namely, k-fold, environmental, and spatial cross-validation. ran (random forest), xgb (extreme gradient boosting), svm (support vector machine), ann (artificial neural network), glmm (generalized linear mixed models) and gwr (geographically weighted regression). 05 and 15 refer to the models trained with five and fifteen variables, respectively

From: Performance of statistical and machine learning-based methods for predicting biogeographical patterns of fungal productivity in forest ecosystems

	Environmental cv		Spatial cv		k-fold cv
	MSE	Bias²	MSE	Bias²	MSE	Bias²
ran.05	22,941	88	18,096	37	12,677	1
ran.15	19,875	33	18,356	10	12,148	2
xgb.05	21,778	178	17,433	62	13,744	1
xgb.15	28,654	148	18,473	93	13,231	1
svm.05	30,901	2556	19,930	1226	14,140	657
svm.15	22,910	1214	21,032	797	12,824	392
ann.05	28,950	5206	25,021	4511	20,128	3087
ann.15	26,815	5619	29,516	8033	24,487	5946
glmm	28,318	9528	24,460	5228	21,086	3394
gwr	28,214	9789	20,590	2553	16,078	204

Back to article page