Skip to main content

Table 2 Mean squared error (MSE) and squared bias (bias2) of the different machine learning and statistical models depending on different resampling strategies, namely, k-fold, environmental, and spatial cross-validation. ran (random forest), xgb (extreme gradient boosting), svm (support vector machine), ann (artificial neural network), glmm (generalized linear mixed models) and gwr (geographically weighted regression). 05 and 15 refer to the models trained with five and fifteen variables, respectively

From: Performance of statistical and machine learning-based methods for predicting biogeographical patterns of fungal productivity in forest ecosystems

 

Environmental cv

Spatial cv

k-fold cv

MSE

Bias2

MSE

Bias2

MSE

Bias2

ran.05

22,941

88

18,096

37

12,677

1

ran.15

19,875

33

18,356

10

12,148

2

xgb.05

21,778

178

17,433

62

13,744

1

xgb.15

28,654

148

18,473

93

13,231

1

svm.05

30,901

2556

19,930

1226

14,140

657

svm.15

22,910

1214

21,032

797

12,824

392

ann.05

28,950

5206

25,021

4511

20,128

3087

ann.15

26,815

5619

29,516

8033

24,487

5946

glmm

28,318

9528

24,460

5228

21,086

3394

gwr

28,214

9789

20,590

2553

16,078

204