 Research
 Open Access
 Published:
Longitudinal heightdiameter curves for Norway spruce, Scots pine and silver birch in Norway based on shape constraint additive regression models
Forest Ecosystems volume 5, Article number: 9 (2018)
Abstract
Background
Generalized heightdiameter curves based on a reparameterized version of the Korf function for Norway spruce (Picea abies (L.) Karst.), Scots pine (Pinus sylvestris L.) and silver birch (Betula pendula Roth) in Norway are presented. The Norwegian National Forest Inventory (NFI) is used as data base for estimating the model parameters. The derived models are developed to enable spatially explicit and site sensitive tree height imputation in forest inventories as well as future tree height predictions in growth and yield scenario simulations.
Methods
Generalized additive mixed models (gamm) are employed to detect and quantify potentially nonlinear effects of predictor variables. In doing so the quadratic mean diameter serves as longitudinal covariate since stand age, as measured in the NFI, shows only a weak correlation with a stands developmental status in Norwegian forests. Additionally the models can be locally calibrated by predicting random effects if measured heightdiameter pairs are available. Based on the model selection of nonconstraint models, shape constraint additive models (scam) were fit to incorporate expert knowledge and intrinsic relationships by enforcing certain effect patterns like monotonicity.
Results
Model comparisons demonstrate that the shape constraints lead to only marginal differences in statistical characteristics but ensure reasonable model predictions. Under constant constraints the developed models predict increasing tree heights with decreasing altitude, increasing soil depth and increasing competition pressure of a tree. A twodimensional spatially structured effect of UTMcoordinates accounts for the potential effects of large scale spatially correlated covariates, which were not at our disposal. The main result of modelling the spatially structured effect is lower tree height prediction for coastal sites and with increasing latitude. The quadratic mean diameter affects both the level and the slope of the heightdiameter curve and both effects are positive.
Conclusions
In this investigation it is assumed that model effects in additive modelling of heightdiameter curves which are unfeasible and too wiggly from an expert point of view are a result of quantitatively or qualitatively limited data bases. However, this problem can be regarded not to be specific to our investigation but more general since growth and yield data that are balanced over the whole data range with respect to all combinations of predictor variables are exceptional cases. Hence, scam may provide methodological improvements in several applications by combining the flexibility of additive models with expert knowledge.
Background
The prediction of tree height is of central importance, not only for the calculation of growing stock from sample inventories, but also in the prognosis of middle and longterm forest development for forest planning and in the analysis of timber supply. The estimation of single tree volume and assortment is made using speciesspecific taper functions which use tree height and the diameter at breast height (dbh) (and occasionally other stem diameters) as input parameters. Single tree volumes are then the basis for expanding total timber volume from sample forest inventories for any given evaluation and planning unit. With respect to dbh, information from fully documented experimental plots, or at least from concentric sample plots, can frequently be relied upon for tree height imputation in sample forest inventories and for generating realistic start values to initialize forest growth simulators. Measurements of tree height, however, are considerably more costly to obtain, so that often little or no data is available. If one to several height measurements are available in a stand or sample plot, heightdbh curves based on simple mixed models are applied which allow for local calibration of a mean population relationship and thus local prediction (e.g. CorralRivas et al. 2014). These models that employ exclusively diameter as predictor are purely data imputation tools and do not, for example, describe explicitly the effects of site or competition on the heightdbh relationship. Generalized height curves describe these effects (Larsen and Hann 1987; López et al. 2003; Temesgen and Gadow 2004). However, frequently the information on measured heightdbh pairs is not used for local calibration of the height predictions. A combination of both model approaches leads to generalized heightdiameter (hd) models, which can be locally and temporarily calibrated. Hence, these models are developed using either linear or nonlinear mixed models in which site, stand, competition variables but also regional units or geographic coordinates are used as covariates (Lappi 1997; Eerikäinen 2003; Calama and Montero 2004; Mehtätalo 2004; Nanos et al. 2004; Hökkä, 1997; Schmidt et al. 2011). From a more general point of view mixed models also provide a solution to the problem of correlated errors that results from grouped data structures and they quantify the variability between groups via random effects (Pinheiro and Bates 2000). This is highly relevant in hd modeling since in most forest growth and yield data bases several measurements origin from the same sample plot or trial and measurement occasion.
In forest growth simulations the actual projection of future tree heights is frequently based not on heightdbh curves but on height growth functions of dominant trees. These can be adapted for single trees using additional tree covariates like competition indices (Pretzsch 2009). However, if a longitudinal covariate, such as age or quadratic mean diameter, is used then valid future height projections can be obtained directly using the generalized hd model.
Hd models for Norway spruce (Picea abies (L.) Karst.), Scots pine (Pinus sylvestris L.) and silver birch (Betula pendula Roth) are presented in this paper, which allow an optimal height prediction for any given dbh in all of the example situations described below. This is true regardless of the number of available height measurements, as well as in those cases in which only measurements from an earlier inventory are available. Furthermore, an optimal combination of information from stand and site variables together with local height measurements is ensured. These requirements are fulfilled by using a generalized heightdiameter model which has been parameterized as a mixed model. Mixed models facilitate the local or temporal calibration of global models which have been determined using fixed covariate effects (Lappi 1997; Mehtätalo et al. 2015). As in our investigation only few causal site variables were available the covariates used are mainly proxies. In order to guarantee the highest possible accuracy of prediction in those cases for which no heightdbh pairs were available at all, complex linear predictors for the fixed effects are parameterized in generalized additive mixed models (gamm). Moreover for forecasts of future height development, it seems advantageous to describe the highest possible variance partition as a function of dynamic (i.e. timevarying) covariates through their fixed effects, because it can be assumed that the information from measured dbhheight pairs becomes less meaningful with increasing simulation period. The use of longitudinal variables, such as age or quadratic mean diameter (qmD) increases the possible applications of the models from purely data imputation to height projection in growth simulations. Finally implausible effects, resulting for certain data ranges of covariates in the gamm, are forced into plausible (as decided by experts) patterns by defining monotonicityrestrictions in shapeconstraint additive models (scam).
The developed longitudinal hd models provide solutions for the following applications throughout Norway:

Height imputation, taking into account site and tree effects for single trees in the NFI and also for initializing growth simulators, when no representative height measurements are available. For the application, a measured or estimated dbh must be available.

Mediumterm future height predictions for the analysis of timber supply, forest development scenarios and silviculture scenario simulations, taking (fixed) site and tree effects into account.

Ensuring plausible height predictions for the whole data range of covariates by applying monotonicityconstraints where necessary for the fixed model effects.

Model calibration, i.e. local adaption of height predictions and projections using heightdiameter measurements.
Data
Data from the Norwegian national forest inventory (NFI) for the period 1986–2012 were available. For all three tree species studied, steep gradients were evident in the data for the potential covariates and their combinations. This is extremely advantageous for the development of statistical models, or rather for generating generally acceptable, stable and plausible estimations of model effects (Tables 1 and 2).
Annually, ca. one fifth of the sample plots are inventoried in the NFI (interpenetrating panel design). Below the coniferous forest limit, the permanent sample plots are laid out in a systematic 3 km × 3 km grid. Since 2005, sample plots in high mountain areas above the coniferous limit and in Finnmark are measured on a 3 km × 9 km and 9 km × 9 km sampling grid, respectively. Sample plots in Finnmark below the coniferous limit are measured on a 3 km × 3 km grid as in other parts of the country. Between 1986 and 1993 concentric circular plots of 100 m^{2} (for trees with a dbh of less than 20 cm) and 250 m^{2} (for trees with a dbh greater and equal to 20 cm) were used. From 1994 on, simple circular plots with an area of 250 m^{2} were used. Over the complete inventory period only trees with a dbh of 5 cm or greater were sampled. Tree height measurements in the NFI are made with Vertex inclimeters for a subsample of trees. The subsample of trees is selected proportional to tree diameter. While the expected number of height trees per sample plot was three per species until 2004, the expected number of height trees per plot was 10 independent of tree species from 2005 onwards. Therefore, and due to the inclusion of high mountain areas and Finnmark, there was a clear increase in the number of hd value pairs with time (Fig. 1, left). The greatest numbers of trees were sampled between 200 and 250 m above sealevel (Fig. 1, right). Spruce and birch are more evenly distributed across the altitude gradient than pine. The relatively small proportion of spruce in the lower altitudes (predominantly coastal areas) and its dominance between 300 and 600 m stands out, while above 800 m birch is the most frequently occurring species.
Birch has the greatest regional range, the highest natural tree line (Fig. 1, right) and the most northerly range limit (Fig. 2, right) of the three species. Although pine also has a large regional range (Fig. 2, middle) the natural tree line lies at a much lower altitude (Fig. 1, right) and it occurs much less frequently in the provinces of Nordland, Troms and Finnmark. The data for spruce show that it has a higher natural tree line than pine (Fig. 1, right) but the northern limit of its range lies at a lower latitude (Fig. 2, left). More clearly than for the other two species, a separation of spruce into two distinct ranges can be seen, one in southeast Norway east of the main watershed divide, the other lying in the province of NordTrøndelag and parts of SørTrøndelag. The limited spruce distribution in the coastal regions of south and mid Norway is due to the fact that spruce is not part of the potential natural vegetation in these regions. In total there are 68,426 spruce, 50,852 pine and 59,112 birch hd data pairs and respective covariate vectors available (Tables 1 and 2).
Methods
Model development was a multistep process. In a first step, gamm were parameterized in order to identify covariates with significant effects and to test model effects for nonlinearity. Based on this unrestricted model selection the model effects were tested for plausibility. If necessary, conditions such as monotonicity were specified and scam parameterized. The scam were then validated against the unconstrained models by comparing the fitting statistics standard error, explained deviance and Akaike information criterion (AIC). Because of the computational intensity, a direct parameterization of shape constraint additive mixed models (scamm) was only possible for small datasets given the available computing facilities. To develop models which can be locally calibrated, generalized linear mixed models (glmm) are therefore parameterized in which, on the basis of the scam predictions, conditional expectation values are entered as “a priori” information. The specification as a mixed model enables the partitioning of the total variance on different levels, and thereby, the calibration of a mean population model using additional hd measurements (Mehtätalo et al. 2015). Moreover the mixed model approach accounts for the grouped structure of the used NFIdata and the related correlated errors. The integration of the qmD as a covariate gives the models their longitudinal character and, consequently, the shifting hd relationship with time can be described as a function of the developmental stage of a stand. Even if the shift in the hd relationship should not be confused with incremental height growth, the approach opens up the possibility of sitesensitive height projections in growth simulations.
The choice of the basic model, or rather, of the specific heightdiameter function, is crucial for the longitudinal hd model that is developed from it. Here, a special form of the Korffunction developed by Lappi (1997) is used, which is distinguished by the biological interpretability and comparatively low correlation of its parameters. These qualities are particularly advantageous when, as is the case here, the parameters, and thereby the realisation, of height curves are to be described as a function of site, stand, and single tree variables. Mehtätalo (2004, 2005) built on the work of Lappi (1997) and adapted and validated the model for spruce, pine and birch in Finland, a country with partly similar growth conditions. Additionally it is very important that the model is linear which enables the estimation of site, stand and tree effects and their validation for nonlinearity in a onestep procedure using gamm. Finally in our application the modified Korffunction showed an adequate flexibility which is illustrated in the results chapter. The basic version of the Korffunction used here (Eq. 1) is an alternative to the more frequently used variant, in which breast height (1.3 m) is subtracted from the tree height. In order to prevent the expected height values from taking on the value “zero” when the dbh is very small, Lappi added a small constant λ to the dbh, where dbh + λ can be interpreted as the diameter at ground height. Lappi (1997) then reparameterized the function (Eq. 2) because the expected values and the standard error of the “linear” parameters A and B are strongly correlated and the trend of B with age is difficult to interpret. This reparameterization, on the basis of expected values of the logarithmic tree height for trees with dbh of 30 and 10 cm (Eqs. 2 and 2.1), yields biologically meaningful parameters, as well as a clear reduction in correlation. Parameter A can then be interpreted as the expected value of the natural logarithm [ln(.)] of the height of a tree with dbh = 30 cm, while parameter B is the difference between the expected values of ln(tree height) between trees with dbh = 30 cm and dbh = 10 cm of the respective tree species. The parameters A, B, C and λ are referred to in this paper as first order parameters (of the Korffunction) in order to distinguish them from the second order parameters which describe the effects of site, stand, and single tree variables that are integrated into the model later.
with:
h_{ kti }: height of tree i at time of inventory t at sample plot k;
dbh_{ kti }: dbh of tree i at time of inventory t at sample plot k;
x_{ kti }: reparameterized dbh of tree i at inventory date t at sample plot k;
A_{ kt }, B_{ kt }, C, λ: first order parameters of the heightdiameter model at time of inventory t at sample plot k;
ln(.): natural logarithm.
In keeping with Lappi (1997), the function (Eq. 2) is subsequently linearized by iteratively determining the combination of λ and C for which the corresponding model has the lowest AIC. Differences in the underlying data result, at this point, in a fundamental difference to the approach of Lappi (1997) and Mehtätalo (2004). Lappi (1997) used experimental plots and Mehtätalo (2004) a subsample of the Finnish NFI which, because of the large or least sufficient number of hd value pairs, allow an ordinary least squares estimate of separate hd curves for each plot and inventory date. From these individual parameterizations Lappi (1997) derived not only the optimal parameter combination of λ and C, but also the age trends for the parameters A and B. In contrast, the choice of optimal combinations of λ and C in this study was made using a glmm based on the reparameterized Korffunction (Eq. 2) because the number of measurements per plot in the Norwegian NFI rarely allowed fitting of stable, separate plotspecific models. This glmm includes plotlevel random effects with mean 0 and constant variance for the parameters A and B (Eq. 2.2). Moreover during model development it turned out that the variance of random effects for an inventory date level nested within plot level was extremely low and almost zero for all 3 tree species. Hence all further model selection was restricted to plot level mixed models.
h_{ kti } ~ Gamma(μ,ν) with dispersion parameter ϕ = 1/ν = σ^{2}.
with additionally:
A, B: Fixed effects for the first order parameters of the heightdiameter model (reparameterized Korffunction);
α_{ k }, β_{ k }: Random effects for sample plot k with the vector of random parameters b_{ k } = (α_{ k }, β_{ k })′~N(0, D) and D denoting the corresponding variancecovariance matrix.
In this study all models are paramaterized as glmm or gamm with loglink function and Gamma as distribution assumption. By employing the Gamma distribution we assume a constant coefficient of variation σ with [Var(h_{ kti })]^{1/2} = σ E(h_{ kti }) and Var(h_{ kti }) = ϕ [E(h_{ kti })]^{2}. This corresponds to a loglinear model, but, using generalized models no transformation bias occurs when the prediction is backtransformed. We show in the results chapter that assuming a Gamma distribution leads to a sufficient variance stabilization in our case and in contrast to Lappi (1997) and Mehtätalo (2004, 2005) we did not model the residual variance explicitly. However, the ongoing development of approaches like gam for location and scale (Wood et al. 2016) will allow for a more flexible variance modelling in the future.
The iterative search for the parameters λ and C for spruce, pine, and birch, with respectively 5613, 5219, and 7606 sample plots, proved to be too computationally intensive given the available computing facilities. Instead 20 samples, each containing 500 sample plots, where drawn from the dataset and models with different combinations of λ and C were parameterized (Eq. 2.2). Based on the optimal values determined by Mehtätalo (2004, 2005) for spruce (λ = 7, C = 1.564) and birch (λ = 6, C = 1.809), the value for λ was varied between 3 and 20 (in increments of 1), the value for C was varied between 0.3 and 2.5 (in increments of 0.1) and all of the resulting combinations tested. The AIC values of the resulting 20 models for each parameter combination were then averaged and the optimal parameter combination determined using the lowest average AIC value.
In contrast to Lappi (1997), further model selection in this study followed in a onestep procedure with the help of gamm, without the effects of the longitudinal covariate (age or qmD) on the first order parameters A and B being first approximated. Lappi (1997), on the other hand, assumed that the effects of further covariates would be linear and affect the before approximated age effects. In this study, all further covariate effects are estimated simultaneously with the effect of the longitudinal covariate qmD, whereby, because of the loglink function, the effects act multiplicative exponential on tree height (Eq. 3). Model effects on the first order parameter A are indicated by f_{ 1a }…f_{ na } or f_{ sp a } the latter one indicating a structured spatial effect. Terms affecting the first order parameter B are described by the varying coefficient terms f_{ 1 }....f_{ nb }. Through the simultaneous estimation of the parameters of the 2dimensional trend function f_{ sp a }(east_{ k }, north_{ k }) and the plot level random effects (α_{ k }, β_{ k }), the spatial autocorrelation is separated into a structured and an unstructured spatial effect (Brezger and Lang 2006). The first captures the largescale autocorrelation, while the second describes the small scale correlation within sample plots. The models were fit using software default values (R package mgcv, Wood 2006) for the spline basis dimensions of k = 10 for the 1dimensional and k = 30 for the 2dimensional splines.
h_{ kti } ~ Gamma(μ, ν) with dispersion parameter ϕ = 1/ν = σ^{2}.
with additionally:
x_{ 1 }…x_{ n }: Covariates with 1dimensional effects on the hd relationship;
east_{ k }, north_{ k }: Easting and northing of sample plot k (UTMcoordinates);
f_{ 1a }(x_{ 1 })…f_{ na }(x_{ n }): 1dimensional penalised regression Psplines describing the level of the hd relationship (first order Parameter A);
f_{ 1b }(x_{ 1 })…f_{ nb }(x_{ n }): 1dimensional penalised regression Pspline describing the slope of the hd relationship (first order Parameter B);
f_{ sp a }: 2dimensional isotropic penalised thinplate regression spline capturing the structured spatial effect on the level of the hd relationship (first order Parameter A).
The estimated model parameters were checked for logical validity. If deemed necessary, monotonicity constraints were defined to enforce plausible patterns. A scamm describing the hd relationship under conditions of monotonicity for all 1dimensional effects can be written as follows, with all monotonic model effects denoted by m instead of f:
h_{ kti } ~ Gamma(μ, ν) with dispersion parameter ϕ = 1/ν = σ^{2}.
with differing to Eq. 3:
m_{ 1a }(x_{ 1 })…m_{ na }(x_{ n }): 1dimensional monotonic penalised regression Psplines describing the level of the hd relationship (first order Parameter A);
m_{ 1b }(x_{ 1 })…m_{ nb }(x_{ n }): 1dimensional monotonic penalised regression Psplines describing the slope of the hd relationship (first order Parameter B).
Due to the extensive dataset, with many thousand sample plots, a parallelization is necessary. The parameterization of all plotlevel gamm was made using the R (R Core Team 2016) package mgcv (Wood 2004, 2006, 2011), which can handle parallel calculations. The investigations concerning 2level gamm with additional inventory date level were conducted by combining functions from packages mgcv, nlme (Pinheiro et al. 2013) and MASS (Venables and Ripley 2002). The parameterization of the scam was done using the R package scam (Pya 2015), which is based on the mgcv library and also allows a parameterization of scamm. Parallel computing has not been supported by the scam package up to now, so in this study a 2 step procedure is used. In a first step scam were parameterized (Eq. 5) whose estimates of conditional expected values of ln(tree height) were the only covariate in subsequent glmm (Eq. 6). The resulting glmm makes possible a local calibration using heightdbh measurements, by which the pattern of the fixed shape constrained model effects remain the same. Because the glmm builds on the scam, it will henceforth be labeled as scam_m.
h_{ kti } ~ Gamma(μ, ν) with dispersion parameter ϕ = 1/ν = σ^{2}.
with additionally:
\( {\widehat{\mathrm{In}\left(\mathrm{E}\left[{h}_{kti}\right]\right)}}_{scam} \): Prediction of ln(tree height) using scam (Eq. 5) of tree i at inventory date t and sample plot k.
Results and discussion
Within the studied parameter boundaries the optimal combination for spruce was λ = 20 and C = 2.5, for pine λ = 19 and C = 2.5 and for birch λ = 16 and C = 2.4 (Fig. 3). For each of the 3 tree species studied several different parameter combinations resulted in AIC values near the minimum. The optimal values were, depending on species, for one or both of the parameters near the upper boundary of the studied parameter ranges. It can, therefore, be assumed that the true optima lie at higher values of λ and C. However, further improvements would be marginal as can be seen from the development of the AIC values within the studied parameter boundaries (Fig. 3). There were relatively large differences between the optimal parameter combinations and those determined by Mehtätalo (2004, 2005), although those optima would also lead to relatively low AIC values if applied to the NFI data (Fig. 3). For pine, Mehtätalo (2005) modelled parameter C dependent on qmD, so that in this case there were no constant valuepairs available for comparison.
In the course of model selection of the unrestricted gamm, quadratic mean diameter qmD, the competition index BAL (basal area larger; the sum of basal areas of all trees larger than the reference tree), altitude Alt, soil depth SD, as well as regional location (easting, northing) were all selected as covariates with a significant effect on the first order parameter A. Only qmD showed an additional significant effect on the slope of the hd relationship, the first order parameter B.
h_{ kti } ~ Gamma(μ, ν) with dispersion parameter ϕ = 1/ν = σ^{2}.
with additionally:
qmD_{ kt }: Quadratic mean diameter of the tree species at inventory date t at sample plot k;
Alt_{ k }: Altitude of sample plot k;
BAL_{ kti }: Basal area larger (sum of the basal area of all trees larger than the reference tree) of tree i at inventory date t at sample plot k;
SD_{ k }: Soil depth category of sample plot k: I (0–25 cm), II (25–50 cm), III (50–100 cm), IV (> 100 cm);
east_{ k }, north_{ k }: Easting and northing of sample plot k (UTMcoordinates);
f_{ 1a }(.)…f_{ 3a }(.): 1dimensional penalised regression Psplines describing the level of the hd relationship (first order Parameter A);
f_{ 1b }(.): 1dimensional penalised regression Pspline describing the slope of the hd relationship (first order Parameter B);
ϕ_{ SD }: Vector of the regression coefficients for soil depth categories;
f_{ sp a }: 2dimensional isotropic penalised thinplate regression spline capturing the structured spatial effect on the level of the hd relationship (first order Parameter A);
α_{ k }, β_{ k }: Random effects for sample plot k with the vector of random parameters b_{ k } = (α_{ k }, β_{ k })′~N(0, D) with D denoting the corresponding variancecovariance matrix.
Two level mixed models were excluded from the further process of model selection since the estimated variances of random effects for an additional inventory date level nested within plot level for all 3 tree species were extremely low and hence irrelevant. For all three tree species the 1dimensional effects of all continuous covariates on the first order parameters A and B were more or less nonlinear whereas the effects of BAL showed only minor deviations from linearity (Fig. 4). Based on expert knowledge the flexibility of the 1dimensional splines was validated as sufficient and the default spline basis dimension of k = 10 was not increased. In modeling using scam an assessment of the unrestricted model effects is part of the model building process, as a decision must be made as to what degree plausible model effects could be forced by imposing restrictions. Since hd curves are fitted the model effects have to be validated with regard to their effects on tree height as a surplus to the effects on diameter growth.
For all three tree species the effect of qmD on the first order parameter A decreases in the range of large values (Fig. 4). This was seen as unfeasible since for spruce from a qmD of ca. 25 cm and for pine and birch from a qmD of ca. 40 cm onwards, a decreasing level of the height curve with increasing qmD would be predicted (Fig. 4). It is assumed that the cause of this frequently occurring pattern is that the share of unfavourable sites is much higher in stands in advanced development stages than in younger stands, because such sites have on average poorer access, lower management intensity and a lower timber felling rate. It can also be assumed that, because of lower tree heights, unfavourable sites are less vulnerable to storm damage and that their share will therefore increase with advancing stand development stage.
The effects of BAL, Alt and the different categories of SD on the first order parameters seem plausible. All three species show monotone decreasing effects with increasing Alt (Fig. 4). Under Norwegian growth conditions it can be assumed that Alt is primarily a proxy variable for temperature, which decreases with increasing Alt. Precipitation, which increases with increasing Alt, is not able to fully compensate for the limiting factor of the temperaturesum. The weak gradient between 0 and 150 m Alt for all three tree species seems plausible, because it can be assumed that the growth conditions at these altitudes are relatively uniform, if all other influence factors are constant.
All three tree species show increasing effects with increasing soil depth SD (Fig. 4), although for pine and birch between categories III and IV and for spruce between categories II, III and IV the differences were not significant. The ranking is plausible, because with increasing soil depth better conditions with respect to water regime and nutrient supply can be assumed. Also the much lower level of the SD category I sites with very shallow soils can be judged to be plausible.
The effect of BAL is uniformly monotone increasing with a, depending on the tree species, more or less weak degressive tendency, which is most pronounced for pine (Fig. 4). BAL is a simple index describing the social rank and competition pressure of a tree within a sample plot. In assessing the effect it was assumed, that with increasing BAL (or increasing competition), light would become a growthlimiting factor. Thus, with increasing BAL the relation of heightgrowth to diametergrowth shifts in favour of height growth and greater tree heights are predicted, if all other factors remain constant (Fig. 4). With decreasing BAL the social rank of a tree increases and lower tree heights are predicted, if all other variables are equal because dominant, and as extreme cases solitary trees, invest more into diameter than height growth for stability reasons. Another way of interpreting the BAL effect is to compare trees that grow under equal site conditions but under different competition. These trees will have similar heights but different dbh as a function of growing space. Hence larger hd ratios can be assumed in denser stands and for higher competition (Zhang and Burkhart 1997; Zeide and Vanderschaaf 2002; Calama and Montero 2004). The model effect of competition is in accordance with several investigations about the effects of stand density on the h/d ratio (Calama and Montero 2004). Through the choice of BAL as the competition index it is implied that the hd relationship is not influenced by the harvest or death of trees which are smaller than the reference tree.
The effect of qmD on the first order parameter B is montone increasing with an asymptotic tendency from ca. qmD 45 cm for spruce (Fig. 4). For pine this effect is nearly linear increasing, while for birch it is monotone increasing with a degressive tendency.
The spatial trendfunction f_{ sp a }(east_{ k }, north_{ k }) captured largescale correlated differences of the hd relationship which were not described by the other covariates and lead to a clear improvement of the model accuracy for all tree species (Fig. 4). Apart from the northsouth gradients of temperaturesum and length of the vegetation period, it is perhaps above all the effect of distance to the coast and the resulting site differences that are modelled by this effect. It can also be supposed that the effects of further causal factors like, for instance, large scale geological differences, are accounted for by the regional location proxy variable. No further investigation concerning optimal variance partition between spatial trendfunction and random plot effects was made at this point.
If stand age was used instead of the mean basal tree area as the longitudinal covariate (Eq. 7.1) there is a clear decrease in the model accuracy for all tree species. This is illustrated here using spruce as an example (Fig. 5).
h_{ kti } ~ Gamma(μ, ν) with dispersion parameter ϕ = 1/ν = σ^{2}.
with differing to Eq. 7:
Age_{ kt }: Non speciesspecific stand age at inventory date t at sample plot k.
The effects of stand age on the first order parameters A and B are not very sensitive and display implausible patterns (Fig. 5). In this context it must be mentioned that the specification of conditions in scam should only be applied in those cases in which the patterns of unconstrained effects seem basically plausible. The specification of conditions serves solely to suppress a too great and implausible flexibility, especially at the boundaries of the covariate data ranges. The problem of insensitive model effects and entirely implausible patterns, especially in those dataranges with many available datapoints, cannot be solved using the scam approach.
Hence, the subsequent integration of shapeconstraints in scam to ensure plausible model effects is done on the basis of the gamm, in which qmD (Eq. 7) instead of stand age (Eq. 7.1) is used as the longitudinal covariate. For this monotone increasing effects of qmD on the first order parameters A and B and a monotone increasing and concave effect of BAL on the first order parameter A were parameterized. The remaining model effects were included without shapeconstraints in the model (Eq. 8).
h_{ kti } ~ Gamma(μ, ν) with dispersion parameter ϕ = 1/ν = σ^{2}.
with differing to Eq. 7:
m_{ 1a }(qmD_{ kt }): 1dimensional penalised regression Pspline describing the effect of qmD on the level of the hd relationship;
mcc_{ 3a }(BAL_{ kti }): 1dimensional monotone increasing, and concave, penalised regression Pspline describing the effect of BAL on the level of the hd relationship;
m_{ 1b }(qmD_{ kt }): 1dimensional monotone increasing penalised regression Pspline describing the effect of qmD on the slope of the hd relationship.
In addition to the forced monotone or monotoneconcave patterns clearer, and mostly significant, contrasts of the effects of the soil depth categories for all tree species occur as a sideeffect of this process. Now, solely the soil depth categories III and IV display for spruce no significant differences. The clearer contrasts in the effects of the soil depth categories can be seen as an indication that, with respect to the combinations of qmD and SD the data base is unbalanced. Only after monotonic restriction of the effect of qmD more distinct, significant differences in the hd relationship are depicted by the causal covariate SD. The basic pattern of the unconstrained effects of Alt and of regional location is scarcely changed by the shape constrains (Fig. 6).
The prediction accuracy of the models (standard error of tree height estimation) is only slightly influenced by the specification of shape constraints (Table 3). If only the fixed effects are taken into account (mean population model), the standard error of the gamm (Eq. 7) differs only slightly from the standard error of the scam (Eq. 8). For spruce and pine the scam standard errors are even a little lower. For the purpose of comparison, additional generalized additive models (gam) were parameterized, because the prediction accuracy of the mean population model of mixed models is normally a little lower. When compared to the scam, the prediction accuracy of the gam is only marginally higher (spruce, pine) or the same (birch).
A comparison of the prediction accuracy of the gamm and scam_m, taking both fixed and random effects into account, also shows only minimal differences. The standard errors of the gamm for spruce and pine are slightly lower, and for birch slightly higher, than those of the scam_m. Comparing the gam and scam (or gamm and scam_m) based on their explained deviance confirms again, that shape constraints only result in marginal differences. The AICvalues of the gam are also only slightly lower than those of the scam (Table 3). Because of the stepwise parameterization of the scam_m, a comparison of scam_m and gamm by means of AIC is not possible.
The standard errors of height prediction applying only fixed effects are the highest in pine followed by spruce and birch (Table 3). In comparison the reduction in standard error using the full mixed models are about 1 m for pine, 0.7 m for spruce and 0.6 m for birch. The standard errors of the mixed models are rather similar for spruce and pine and the lowest for birch. The explained deviance using only fixed effects is considerably higher for spruce compared to pine and birch whereas the values of the mixed models are similar for spruce and pine and the lowest for birch.
A comparison of variance components shows that pine has the highest interplot variability for both first order parameters A and B and birch has a higher variability in A than spruce whereas the variability in parameter B is higher for spruce than for birch (Table 4). However the plotlevel variance components estimated by Mehtätalo (2004, 2005) for the same tree species in Finland are considerably lower even if our linear predicator is more flexible. This might be a result of the much more variable growth conditions in Norway with its mountain ranges and complex coastal lines.
Based on a residual analysis the predictions of the scam_m can be validated as more or less unbiased (Fig. 7) which confirms the suitability of the modified Korffunction as basic model. Only for Scots pine a very slight overestimation is present for standardized dbh greater than or equal to 1, whereas the unsystematic deviations at the edges of the dbh ranges are assumed to be random because of very few underlying observations.
The analysis of residuals on logarithmic scale indicates that the assumption of a Gamma distribution stabilizes the variance sufficiently (Fig. 8). However, this finding is in contrast to the investigations of Lappi (1997) and Mehtätalo (2004, 2005).
Conclusions
Based on hd models for spruce, pine and birch in Norway a model comparison of unconstrained gamm and scam_m was made. For the hd models it was shown that scam_m combines the flexibility of gamm with the assurance that all model effects will be plausible. Plausible model effects can be forced by setting conditions such as monotonicity, convexity, concavity or combinations thereof. The full flexibility of additive regression models remains within the constraint conditions. As was shown in the cases studied, constrained and unconstrained effects can be combined within the same model. There were only marginal differences in the predictive accuracy of hd models which had been parameterized as gamm or scam_m. At the same time, the scam_m models are made more generally applicable, especially for predictions based on external data, by the ability to take expert knowledge into account. Because forest growth data is normally more or less unbalanced, the number of potential uses for scam_m models is large.
Abbreviations
 AIC:

Akaike information criterion
 Alt:

Altitude
 BAL:

Basal area larger
 dbh:

Diameter at breast height
 east:

UTMeasting coordinate
 gam:

Generalized additive model
 gamm:

Generalized additive mixed model
 glmm:

Generalized linear mixed model
 h:

Tree height
 NFI:

National forest inventory
 north:

UTMnorthing coordinate
 qmD:

Quadratic mean diameter
 scam:

Shape constrained additive models
 scam_m:

2step shape constrained additive mixed models
 scamm:

Shape constrained additive mixed models
 SD:

Soil depth category
References
Brezger A, Lang S (2006) Generalized structured additive regression based on Bayesian Psplines. Comput Stat Data Anal 50(4):967–991
Calama R, Montero G (2004) Interregional nonlinear heightdiameter model with random coefficients for stone pine in Spain. Can J For Res 34:150–163
CorralRivas S, ÁlvarezGonzález JG, CrecenteCampo F, CorralRivas JJ (2014) Local and generalized heightdiameter models with random parameters for mixed, unevenaged forests in Northwestern Durango, Mexico. Forest Ecosystems 1:6. https://doi.org/10.1186/2197562016
Eerikäinen K (2003) Predicting the heightdiameter pattern of planted Pinus kesiya stands in Zambia and Zimbabwe. Forest Ecol Manag 175(1–3):355–366. https://doi.org/10.1016/S03781127(02)00138X
Hökkä H (1997) Height–diameter curves with random intercepts and slopes for trees growing on drained peatlands. For Ecol Manag 97:63–72
Lappi J (1997) A longitudinal analysis of height/diameter curves. For Sci 43(4):555–570
Larsen DR, Hann DW (1987) Heightdiameter equations for seventeen tree species in southwest Oregon, vol 49. Oregon State University, College of Forestry, Forest Research Laboratory, Corvallis, p 16
López Sánchez CA, Gorgoso JJ, Castedo F, Rojo A, Rodríguez R, Álvarez González JG, Sánchez Rodríguez F (2003) A height–diameter model for Pinus radiata D. Don in Galicia (Northwest Spain). Ann Forest Sci 60:237–245
Mehtätalo L (2004) A longitudinal heightdiameter model for Norway spruce in Finland. Can J For Res 34:131–140
Mehtätalo L (2005) Heightdiameter models for Scots pine and birch in Finland. Silv Fenn 39(1):55–66
Mehtätalo L, deMiguel S, Gregoire T (2015) Modeling heightdiameter curves for prediction. Can J For Res 45(7):826–837. https://doi.org/10.1139/cjfr20150054
Nanos N, Calama R, Montero G, Gil L (2004) Geostatistical prediction of height/diameter models. For Ecol Manag 195(1–2):221–235
Pinheiro JC, Bates DM (2000) Mixedeffects models in S and Splus. Springer, Berlin Heidelberg New York
Pinheiro JC, Bates DM, DebRoy S, Sarkar D, R Development Core Team (2013) nlme: linear and nonlinear mixed effects models. R package version 3, pp 1–108 https://CRAN.Rproject.org/package=nlme. Accessed 12 May 2017
Pretzsch H (2009) Forest dynamics, growth and yield. Springer Verlag, Berlin
Pya N (2015) Scam: shape constrained additive models. R package version 1, pp 1–9 https://cran.rproject.org/web/packages/scam/index.html. Accessed 12 May 2017
R Core Team (2016) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria https://www.Rproject.org/. Accessed 12 May 2017
Schmidt M, Kiviste A, Gadow K (2011) A spatially explicit heightdiameter model for Scots pine in Estonia. Eur J For Res 130:303–315. https://doi.org/10.1007/s1034201004348
Temesgen H, Gadow K (2004) Generalized heightdiameter models–an application for major tree species in complex stands of interior British Columbia. Eur J Forest Res 123(1):45–51
Venables WN, Ripley BD (2002) Modern applied statistics with S, 4th edn. Springer, New York
Wood SN (2004) Stable and efficient multiple smoothing parameter estimation for generalized additive models. J Am Stat Assoc 99:673–686
Wood SN (2006) Generalized additive models: an introduction with R. Chapman and Hall/CRC, Florida
Wood SN (2011) Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J Royal Stat Soc (B) 73(1):3–36
Wood SN, Pya N, Säfken B (2016) Smoothing parameter and model selection for general smooth models. J Am Stat Assoc. https://doi.org/10.1080/01621459.2016.1180986
Zeide B, Vanderschaaf C (2002) The effect of density on the heightdiameter relationship, Gen Tech Rep SRS48. U.S. Department of Agriculture, Forest Service, Southern Research Station, Asheville, pp 463–466
Zhang S, Burkhart HE (1997) The influence of thinning on tree height and diameter relationships in loblolly pine plantations. South J Appl For 21:199–205
Acknowledgements
We would like to thank two anonymous reviewers for constructive comments.
Funding
This study was supported by the Norwegian Institute of Bioeconomy Research (NIBIO).
Availability of data and materials
Data are available upon request under certain constraints.
Author information
Affiliations
Contributions
The first author (MS) conducted the data analysis, model development and wrote the first draft. JB substantially contributed to the data preparation. All authors jointly discussed the results, drew conclusions and finalized the manuscript. All authors read and approved the final manuscript.
Corresponding author
Correspondence to Matthias Schmidt.
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
The authors give the consent for publication.
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Received
Accepted
Published
DOI
Keywords
 Heightdiameter curve
 Norway spruce
 Scots pine
 Silver birch
 Norwegian national forest inventory
 Shape constrained additive models