Skip to main content

Influence of sampling intensity on performance of two-phase forest inventory using airborne laser scanning



Forest inventories have always been a primary information source concerning the forest ecosystem state. Various applied survey approaches arise from the numerous important factors during sampling scheme planning. Paramount aspects include the survey goal and scale, target population inherent variation and patterns, and available resources. The last factor commonly inhibits the goal, and compromises have to be made. Airborne laser scanning (ALS) has been intensively tested as a cost-effective option for forest inventories. Despite existing foundations, research has provided disparate results. Environmental conditions are one of the factors greatly influencing inventory performance. Therefore, a need for site-related sampling optimization is well founded. Moreover, as stands are the basic operational unit of managed forest holdings, few related studies have presented stand-level results. As such, herein, we tested the sampling intensity influence on the performance of the ALS-enhanced stand-level inventory.


Distributions of possible errors were plotted by comparing ALS model estimates, with reference values derived from field surveys of 3300 sample plots and more than 300 control stands located in 5 forest districts. No improvement in results was observed due to the scanning density. The variance in obtained errors stabilized in the interval of 200–300 sample plots, maintaining the bias within +/− 5% and the precision above 80%. The sample plot area affected scores mostly when transitioning from 100 to 200 m2. Only a slight gain was observed when bigger plots were used.


ALS-enhanced inventories effectively address the demand for comprehensive and detailed information on the structure of single stands over vast areas. Knowledge of the relation between the sampling intensity and accuracy of ALS estimates allows the determination of certain sampling intensity thresholds. This should be useful when matching the required sample size and accuracy with available resources. Site optimization may be necessary, as certain errors may occur due to the sampling scheme, estimator type or forest site, making these factors worth further consideration.


Forest inventory

A comprehensive description of the state and structure of woodlands is crucial information for forest owners and decision-makers in particular. Details on the availability of current resources form the basis for sustainable planning of forest use and its development. Hence, a number of requirements concerning forest management has been imposed on forest administrators. The Forest Act (1991) is an official state document that requires the development of a forest management plan (FMP) every 10 years for each forest holding in Poland. Similar binding documents are in effect in other countries (Redmond et al. 2016). For instance, the creation of an FMP, or at least local forest inventory data, is mandatory in the following European countries: Bosnia and Herzegovina, Bulgaria, Croatia, the Czech Republic, Estonia, France, Hungary, Latvia, Macedonia, Poland, Portugal, Romania, Slovakia, Slovenia and Switzerland (Nichiforel et al. 2018). These examples show how crucial knowledge of forest resources is.

Growing stock volume

The growing stock volume (GSV) is one of the most important stand characteristics to be determined as a result of forest inventories (Tonolli et al. 2010). The GSV is a traditional indicator of wood resources, carbon stock, management efficiency, and sustainability in the forest sector (Jung and Mui 2010; EEA 2017). GSV-related indices, e.g., biomass or carbon, must also be reported according to international agreements, e.g., in national forest inventories. Many forest inventories use a design-based approach, where survey crews are contracted to collect data by means of field measurements on sample plots, entailing the use of increasingly expensive human resources (Eurostat 2018). This approach is widely applied across many European countries (FAO 2004; McInerney et al. 2011; FMM 2012; Redmond et al. 2016; Vidal et al. 2016). Although the design-based approach may be sufficient to describe attributes of an entire population, e.g., the forest district, it may not be precise enough to sufficiently describe attributes of elements or small areas, for instance, single trees or stands, especially in areas with a low sample coverage (McRoberts et al. 2013). This issue is becoming more relevant as detailed knowledge of stand-level resources becomes increasingly required (Johnson et al. 2004; Mäkelä & Pekkarinen 2004; Kauranne et al. 2017).

Forest stand

A common definition of a forest stand is as follows: a basic operational unit that is considered to be a spatially consistent part of the forest, homogeneous in terms of species composition, age, site type, tree origin, canopy cover, etc. (Helms 1998; Koivuniemi and Korhonen 2006; Pasalodos-Tato 2010; FMM 2012; Bolton et al. 2018). In Poland, single-stand GSV estimation may be performed for two reasons: once every 10 years for long-term forest planning and 1 or 2 years prior to any cutting operations for a given stand, mostly to evaluate the potential harvest. Such estimation can be conducted by means of a total survey, an approximation based on a similar stand, a visual assessment, or the angle-count method (FMM 2012; DGLLP 2015). None of the above-listed methods appear to provide a balanced trade-off between the accuracy, precision and cost, especially in regard to surveying vast areas. Therefore, there seems to be a well-founded need for more universal solutions, which would provide objective and detailed forest inventory data at large scales (Wulder 1998; Mäkelä & Pekkarinen 2004; Mozgeris 2008; Stereńczak 2010; Tonolli et al. 2010; Holopainen et al. 2014; White et al. 2016; Kankare et al. 2017; Zygmunt et al. 2017).

Remote sensing

The potential of remote sensing (RS) techniques for forest management has been considered since the start of the twentieth century (Hugershoff 1911; Wilson 1920; Gieruszyński 1948). RS techniques are capable of providing continuous information over vast and hardly accessible sites. Nevertheless, the level of technology at that time and awareness of needs versus possibilities in forest communities did not facilitate either research or its practical implementation. The spread of computers and increased computational capacity have led to notable progress in the development of methods related to forest inventories. Aerial images (Bolduc et al. 1999), satellite data (Tomppo 1991) and later laser scanning (Næsset 1997; Næsset et al. 2004) have been intensively tested in Scandinavia and North America, with promising results in terms of their operational implementation, as is the case today in Nordic countries (Næsset et al. 2004; Maltamo and Packalen 2014; Vauhkonen et al. 2014; Kangas et al. 2018), Canada (Woods et al. 2011), the USA (Evans et al. 2006), New Zealand (Pont et al. 2012; Coomes et al. 2018) and Australia (Turner et al. 2011; Pont et al. 2012). In Poland, a joint project is being conducted by the state forests and scientific units to provide RS data as a mandatory part of inventories and forest management planning.

Among RS techniques, airborne laser scanning (ALS) is known to be particularly applicable for the assessment of wood resources (Even et al. 2015). By testing the accuracy of ALS for biomass estimation, Ene et al. (2013) showed that ALS-aided surveys can be an economical alternative to conventional inventories. ALS can provide a 3D representation of an entire forest district in the form of a point cloud (PC). Although such a PC might be a relatively accurate visualization of the forest structure, its quantification (e.g., the GSV) is usually determined through statistical modelling (Wulder et al. 2008; Leeuwen & Nieuwenhuis 2010; Balenović et al. 2013) instead of direct PC measurements. A common design applied to forest attribute extraction from ALS data is two-phase sampling with regression estimators or the area-based approach (Næsset & Bjerknes 2001; Næsset 2002; Köhl et al. 2006; Næsset 2014, Even et al. 2015). In the first phase of this sampling design, a complete wall-to-wall representation of the surveyed area is provided by ALS metrics (auxiliary variables). In the second phase, a limited set of ground sample plots is employed to establish a statistical relationship between the auxiliary variables and target variable (here, the GSV), such that the latter variable can be estimated over the entire area of interest, e.g., the forest district, or its part, viz. the stand.

Sampling intensity

The performance of ALS-enhanced forest inventories may depend on several factors, such as the sampling design, sample size, estimator, PC parameters, and variation in the target variable within the population (Köhl et al. 2006; Yang et al. 2019). There are many theoretical studies concerning the estimation of the required sample size, which chiefly depends on the goal and means of analysis. Recommendations for predictive multiple linear regression modelling (as used in this research) may be found in Knofczynski (2017), who applied a series of Monte Carlo simulations to artificial data to determine the minimum sample size needed to obtain regression parameters close to population parameters. He linked the required sample size to the number of predictors and correlation strength between the most reliable predictor and criterion. Similar findings were obtained by Bujang et al. (2017), who employed power analysis of real data to determine the minimum sample size for multiple linear regression. There are also certain rules of thumb for rapid sample size evaluation. Most of them are based on the principles of power analysis. With regard to regression, Voorhis and Morgan (2007) examined N > 50 observations. Harris (1985) advocated 50 + m (where m is the number of predictors) as the minimum sample size. In contrast, Green (1991) criticized the use of constant values (e.g., 100) and instead supported N > 50 + 8 m as the minimal sample size required to verify the overall fit of a regression model.

A priori sample size considerations undoubtedly constitute a firm base for planning either a research project or a commercial inventory. However, the complex forest structure may not always match all the theoretical assumptions. Related issues have already been considered (Adams et al. 2011; White et al. 2013, 2017) based on empirical data published in good-practice guidelines for the generation of inventory attributes from ALS data. Despite the use of similar methods, different researchers have reported fairly different results in this regard. Notable discrepancies in the number of sample plots utilized have occurred in other studies, starting from 15 plots (Tompalski et al. 2015), up to almost 800 plots (White et al. 2014). General trends regarding the influence of the number and area of sample plots have been presented in Gobakken and Næsset (2008) and Stereńczak et al. (2018). Saarela et al. (2015) tested the effects of the estimator and sample size on the precision of GSV predictions when light detection and ranging (LiDAR) data are adopted in a two-phase sampling design, with the simulated study area resembling the structure of Finnish forests. Having tested various models, they found minor to moderate effects of this forest inventory element. Moreover, they reported that the variance in the model error remains the same regardless of the number of first-phase sample plots; however, the error variance is sensitive to the number of second-phase plots. Similar tests were performed by Fassnacht et al. (2018) to evaluate the effect of the sampling intensity on ALS-based forest biomass prediction (closely related to the GSV). Having simulated Scots pine and European beech stands, they reported little influence of the area and number of sample plots on the root mean square error (RMSE), i.e., 4–7 Mg∙ha− 1, but quite a notable effect on the bias. Ruiz et al. (2014) performed a trial to assess the influence of the sample plot area. Having tested plots from 100 to 3600 m2, they proposed 500 m2 as the minimum area for GSV estimation. The opposite conclusion was drawn by Watt et al. (2013), who found LiDAR models for GSV prediction to be insensitive to the plot size and PC density. Regarding ALS PCs, Montealegre et al. (2016) stated that even densities of circa 1 pulse∙m− 2 seemed to generate relatively accurate GSV estimates, provided that no systematic error occurred caused by an insufficient number of sample plots (Næsset et al. 2004). On the other hand, Smreček and Danihelová (2013) noted that low-PC density data should be carefully assessed. Although these examples show results obtained with not precisely the same methods and for various study areas, the discrepancies remain noteworthy.

Aim and scope

The earlier discrepancies provided the main foundations of this research, particularly since there are few studies concerning Scots pine-dominated stands in Europe. Moreover, if RS methods of forest inventory are to be widely applied in forestry practices, further studies should be prioritized, given their substantial impact on inventory costs. Furthermore, access to extant large datasets urged us to test the methods described in the literature. At our disposal were field measurements from 3305 sample plots and 305 control stands along with ALS PCs acquired from 5 different forest districts. Our sole target variable in this study was the GSV. Of course, many other variables can be determined as a result of forest inventories, e.g., the tree/stand height, diameter at breast height (DBH), basal area, tree density, and species. Many of these variables can be directly measured using RS data, unlike the GSV, which is usually a product of a statistical model incorporating one or more of the above-listed variables. Due to this complexity, estimation of the GSV requires higher sampling intensity, thus making this variable a robust study case. Moreover, the GSV is a universal index of the state of forest ecosystems (Jung and Mui 2010; EEA 2017). It is also a commonly reported index to both state and international environmental agencies.

Given the above, this study aimed to test the influence of the sampling intensity (i.e., the number and area of sample plots, as well as the ALS PC density) on the performance of the two-phase forest stand inventory using an ALS regression estimator.


Study areas

The following forest districts served as investigation areas: Milicz, Suprasl, Katrynka, Piensk and Herby. Most of the stands were pine-dominated (Pinus sylvestris) stands. Nevertheless, the above-listed districts vary in site quality and environmental conditions, which ensures a more reliable result validation. The Katrynka forest district is situated in a lowland region comprising coniferous sites, where Pinus sylvestris is the dominant tree species. Suprasl is located in the same region, but there are more broadleaved sites. The other districts, such as Herby and Piensk, occur in a strip of the Polish highlands. Pinus sylvestris-dominated stands constitute approximately 70% of the Herby sites. A more diverse share of mixed coniferous and broadleaved sites (approximately 30% − 40%) distinguishes Milicz and Piensk, from the other sites. Table 1 contains a quantitative summary of the listed districts. Their locations are shown in Fig. 1.

Table 1 Quantitative description of the study areas according to the field inventory data
Fig. 1
figure 1

Investigated forest districts

Field data

Two types of conventional forest inventories were established for each district: (i) circular sample plots and (ii) a total survey of the control stands. Field measurements were collected in the summer of 2015 for all objects, except Katrynka and Herby, for which surveys were conducted in 2016.

In the first design, a regular network of circular sample plots was deployed across each forest district (Fig. 1). The total number of sample plots varied between the districts (Table 1). In the field, all plots had a fixed area of 500 m2. The following properties were determined for those trees with a DBH exceeding 7 cm: species, DBH, height, age, and position. The DBH was measured with callipers, and the tree height was measured with rangefinders. The position of each tree was determined by the azimuth and distance relative to the centre point of a given sample plot. The coordinates of each centre point were recorded for 20 min utilizing global navigation satellite system (GNSS) static measurements. Having captured all of the above properties, the volume of every single tree was estimated according to equations commonly applied by the Polish state forests (Bruchwald et al. 2000). The volume of each sample plot was then determined via summation of tree volumes inside that plot and normalized to per hectare values, according to the level of factor II, i.e., the sample plot area (refer to the section: Factors). Bruchwald (1999) claimed that at the stand level, the expected mean error of those applied equations should not exceed 8% with a 95% confidence interval. Field-based GSV estimates from this type of inventory were utilized to regress the relationship between the ALS metrics and GSV.

In addition to the circular sample plots, a portion of the control stands was surveyed in each forest district (Table 1, Fig. 1). As the number of control stands was considerably smaller than that of the sample plots, the former were chosen by stratified sampling, with the use of the stand type as a division criterion (Köhl et al. 2006). The approximate distribution of the various forest types was obtained from previous inventories. The reasoning behind this approach was to capture the most common forest types, given the limited resources in terms of the available number of control stands to be surveyed. The same attributes were determined for the trees within the control stands as for those measured within the sample plots. The only difference was the methodology adopted to determine the tree height. Having measured the DBH of all trees within a given control stand, the height of at least 20 trees per every major species was captured in order to cover the entire DBH range. Eventually, the height of every tree was estimated based on Näslund’s DBH-height curve (Siipilehto 2000). As the investigated control stands generally exhibited a one-layer vertical structure, no substantial error was expected from this simplification. Moreover, Bouvier et al. (2019) stressed that DBH/height measurement errors impose a minor to negligible effect upon LiDAR-based biomass estimation for even-aged pine stands. The stand boundaries were first delineated with a GNSS receiver and (if required) manually adjusted based on the canopy height model derived from the ALS data. The mean area of the control stands was approximately 1 ha. The reference variable of interest for the ith stand—GSVREFi (m3∙ha− 1)—was calculated as the sum of the single-tree volumes within the stand and normalized to a per hectare value. The data acquired from this type of inventory served as a validation set for the ALS-based GSV estimation.

Airborne laser scanning data

ALS PCs were acquired in August 2015 for all study areas except Katrynka and Herby, where flight missions were performed 1 year later. The PCs were acquired with a Riegl LiteMapper LMSQ680i scanner operating at frequencies ranging from 300 to 400 kHz. The average flight altitude above ground level was 550 m, and the PC tile overlap was 30%. These settings provided a PC density of approximately 12–13 pulse∙m− 2. Although the ALS data did not vary much between the study areas, standardization measures were applied to minimize the influence of ALS data variation. For more details concerning the standardization routines, refer to the section below—Factors.


The first factor analysed was the number of circular sample plots used to calibrate the model for GSV estimation (Eq. 1). The second factor studied was the area of a single sample plot. Except for a fixed radius, the age-dependent radius (ADR) approach was tested. In the ADR, the area of a given sample plot is fixed by the age of the stand in which the plot is located. The following age-area intervals were assumed: 20–40 (100 m2), 40–60 (200 m2), 60–80 (300 m2), 80–100 (400 m2), and > 100 (500 m2). The ADR approach is commonly applied by the Polish state forests (FMM 2012), as the lower variation in the tree dimensions within younger stands questions the need for extensive sample plots, which in turn are more expensive to survey.

The third factor analysed was the PC density (PCd). Regarding this factor, two sets of various ALS metrics were computed for all the plots and control stands. The first set was calculated using a density of 7 pulse∙m− 2, and the second set was computed using a density of 1 pulse∙m− 2. Thinning routines were conducted using the authors’ original R script. First, the pulses were ordered by the sending time. The loop was designed in such a manner that in a single iteration, every second pulse (by time) was removed, whereupon the PC density was checked. Subsequent iterations were executed until the desired PC density was reached for a given plot. Afterwards, both sets of PCs were independently classified onto ground and non-ground returns and normalized with a 1-m digital terrain model (DTM) that was interpolated from corresponding ground returns. The inverse distance weighting k-nearest neighbour approach from the lidR package (Roussel et al. 2018) was adopted for DTM interpolation.

General concept

The designed methodology relied on a series of scenarios (simulations), which were unique in terms of the levels of the factors. After the levels had been set for a given scenario, two-phase sampling (Köhl et al. 2006; Miścicki and Stereńczak 2013; White et al. 2013; Næsset 2014) was applied to estimate the GSV of the single stands with the regression estimator (Eq. 1). For example, single-scenario estimates were derived from the model calibrated on 100 sample plots, each 300 m2 in size, where the ALS metrics were calculated at a PC density of 1 pulse∙m− 2. The estimated stand-level GSV values from a single scenario were then compared with reference values (GSVREF) derived from ground measurements. As sample plots can be deployed across the investigation area in an infinite number of configurations, 1000 random plot draws were performed under each scenario. Finally, the error distributions were established based on the scores received from every single draw per scenario. Figure 2 shows the above concept. All analyses and data processing steps were performed in the R programming environment (R Core Team 2016).

Fig. 2
figure 2

Graphical representation of the designed methodology

GSV estimation routine and error computation

The two-phase sampling design with the ALS regression estimator (i.e., the area-based approach or ABA) was applied to estimate the stand GSV. Our implementation of the ABA relied on the determination of the relationship between the dependent variable (here, the GSV) measured on the ground sample plots and LiDAR-derived sample metrics (independent/auxiliary variables), both representing the same spatial locations. Over 200 different ALS metrics calculated for a total of 3305 circular sample plots from all investigation sites were analysed to create a general form of the model (Eq. 1). First, a boosted regression trees (BRT) algorithm (Elith et al. 2008) was applied to reduce the high initial number of predictors by depicting those that were highly correlated and the most significant to the dependent variable. Next, the auto-correlated groups of predictors were eliminated by retaining the most frequently occurring ones in particular BRT iterations. With the reduced set of predictors, using stepwise regression, we developed a relatively simple model that would describe the general relationship and would not favour any of the analysed factors. Equation 1 presents the general form of the proposed estimator. In general, the model was capable of explaining approximately 70%–80% (R2 ≈ 0.7–0.8) of the GSV variance among the sample plots.

$$ \hat{y}={a}_0+{a}_1{\left({X}_1\right)}^{1.5}+{a}_2{X}_2+{a}_3{X}_3 $$


\( \hat{y} - \mathrm{dependent}\ \mathrm{variable}\ \left(\hat{GSV}\right) \) (m3∙ha− 1); a0, a1, a2, a3 – regression coefficients; X1 – cubic mean of the height values from all returns over a plot; X2 – ratio of the number of first returns above the 2nd height stratum* (For each plot/cell, the range from 2 m above ground level up to the 95th height percentile was divided into 10 equal strata) to the number of all first returns; X3 – ratio of the number of last returns above the 10th height stratum* to the number of all last returns over plots, X2 and X3 akin to Næsset (2002) and Gobakken et al. (2012).

Having linked the relationship between the ALS metrics and the target variable with Eq. 1, the next step was to estimate the \( \hat{GSV} \) over the control stands under each scenario. A regular wall-to-wall grid was drawn over the boundaries of each control stand, as shown in Fig. 2, step 3. The size of a single grid cell under a particular scenario was identical to the size of the circular sample plots that were used to calibrate the model under this scenario. For instance, if under the sth scenario, the circular sample plots that were used for model calibration had an area of 400 m2 each, the single grid element was then a 20-m side length square, etc. Next, the spatial extent of the edge cells was truncated to the borders of the stand in which they were located. Each cell’s GSV (\( {\hat{GSV}}_c \)) was estimated based on the model predictors (X1, X2, and X3), which were computed according to the cell area (factor II), the level of the PC density (factor III) set under a given scenario, and the model parameters (a0, a1, a2, and a3) calibrated for the sample plots, which were selected in a given draw (factor I). There were 1000 independent sample draws per scenario to account for the variability of a given forest district structure (simple random sampling without replacement from the grid). No plots from outside the forest district given the specific scenario were considered in model calibration. The final GSV estimate of the ith control stand under the sth scenario and dth draw (\( {\hat{GSV}}_{isd}\Big) \) was the mean of all \( {\hat{GSV}}_c \) values determined for that stand, weighted based on the cell area (Fig. 2, steps 3 and 4). Area weighting was performed to reduce the influence of the edge cells (the silver cells) (Næsset 1997), which often exhibit a tree structure different from that observed in the inner parts of the forest stand. Moreover, edge cells are commonly assigned a larger estimation error, as they usually border roads, meadows, or other different structures. Whenever applied in this study, area weighting decreased the magnitude of the estimation error. The estimated \( {\hat{GSV}}_{isd} \) value was juxtaposed with the corresponding reference value—GSVREFi. Based on the obtained residuals, the errors expressed as Eqs. 2 and 3 were calculated for the dth draw and sth scenario. Having computed the errors from all the draws, their distribution under a given scenario could be easily generated, as shown in Fig. 2, step 7.

$$ {nRMSE}_{ds}=\sqrt{\frac{\sum_{i=1}^N{\left({\hat{GSV}}_{isd}-{GSV}_{REF i}\right)}^2}{\mathrm{N}}}\times \frac{100\%}{{\overline{GSV}}_{REF}} $$
$$ {nBIAS}_{ds}=\frac{\sum_{i=1}^N\left({\hat{GSV}}_{isd}-{GSV}_{REF i}\right)}{N}\times \frac{100\%}{{\overline{GSV}}_{REF}} $$


nRMSEds – normalized root mean square error for the N control stands under the sth scenario and dth draw;

nBIASds normalized systematic error for the N control stands under the sth scenario and dth draw;

N – number of control stands in a forest district;

GSVREFi reference growing stock volume of the ith control stand (m3∙ha− 1);

\( {\hat{GSV}}_{isd} \) – estimated growing stock volume of the ith control stand under the sth scenario and dth draw (m3∙ha− 1);

\( {\overline{GSV}}_{REF} \) – mean reference growing stock volume from all the control stands in a forest district (m3∙ha− 1).


Common trends

Figures 3 (nBIAS) and 4 (nRMSE) reveal the general influences of all 3 factors on the performance of the two-phase ABA approach under the applied estimator. The range of the errors obtained is shown according to the levels of particular factors. The gain due to the number of sample plots is shown along the vertical axis of each graph. The gain due to their area is shown from left to right. The whiskers delineate the range where 95% of the best scores were found for a given scenario. The boxplots enclose the interquartile range of the errors, and the centreline indicates the median error. The colours denote the levels of factor III, i.e., the PC density. To ensure figure clarity, we show the results only up to the level of 600 sample plots, as we did not observe any notable improvement in performance when more sample plots were considered.

Fig. 3
figure 3

nBIAS distributions of the stand-level GSV estimations across the analysed factors. Factor III – point cloud density: 1 pulse∙m−2 – grey boxplots, 7 pulse∙m−2 – blue boxplots. Objects: S – Suprasl, P – Piensk, M – Milicz, H – Herby, K- Katrynka. *ADR – age-dependent radius, ** Milicz and Herby only

The most common trend visible in Figs. 3 and 4 is a lack of significant differences among the distributions according to factor III—the PC density. Therefore, this implies that this factor had little influence on the implemented methodology. Next, the number of sample plots (factor I) affected the error distributions more than the sample plot area (factor II). An increase in the number of sample plots improved both the precision, i.e., the range of errors obtained due to repeated application of the sampling procedure, and the accuracy of the estimates, i.e., small absolute error values. This gain was notable only up to a certain level. For instance, we did not observe nBIAS exceeding +/− 5% or a nRMSE value of 20%, above the level of 300 sample plots of 200 m2. However, the results were generally slightly overestimated. The distributions across the sites were similar but not exactly the same. Excluding the most parsimonious scenarios, i.e., < 25 plots of 100 m2, the best results were observed for the Katrynka and Herby forest districts, whereas the largest errors occurred for Milicz and Piensk, regardless of the scenario. Moreover, for Milicz, the widest range of errors was observed, which might be due to the slightly more complex stand structure than that at the other sites (≈ 40% of the broadleaved stands).

Fig. 4
figure 4

nRMSE distributions of the stand-level GSV estimations across the analysed factors. Factor III – point cloud density: 1 pulse∙m−2 – grey boxplots, 7 pulse∙m−2 – blue boxplots. Objects: S – Suprasl, P – Piensk, M – Milicz, H – Herby, K- Katrynka. *ADR – age-dependent radius, ** Milicz and Herby only


In general, the nBIAS gradient was consistent across the investigated sites (Table 2, Fig. 3). A small overestimation was observed for all objects except Katrynka, where the transition from overestimation to underestimation occurred between the levels of 100 and 300 m2 of the sample plot area. We did not observe any substantial change in the nBIAS distribution above the 200-m2 plot level in the other objects. For the nBIAS ranges, the factor of the number of sample plots suppressed the range of the obtained errors the most. The narrowing effect was the most visible up to a level of approximately 300 sample plots, above which the gain due to this factor diminished. However, regarding this effect, one should bear in mind that according to Newton’s binomial theorem, while increasing the number of elements chosen from a finite set of elements, the number of possibilities decreases after a certain subsample level. Therefore, the obtained results should be analysed with a certain degree of caution. However, by increasing the sample size, one should expect more biased results (e.g., > 8%) to be less probable.

Table 2 nBias (%) distributions of the stand-level GSV estimations across the analysed factors. Objects: S – Suprasl, P – Piensk, M – Milicz, H – Herby, K – Katrynka. *ADR – Age-Dependent Radius


Regarding nRMSE, the distributions (Table 3, Fig. 4) were not as coherent across the districts as was the case for nBIAS. Excluding certain scenarios, e.g., < 100 sample plots of 100 m2, the Katrynka forest district attained the lowest nRMSE (≤ 15%) with the narrowest range (≤ 2%), whereas Piensk achieved the highest error rate (≤ 20%), and Milicz was found to have the widest nRMSE range (≤ 5%). Again, the number of sample plots imposed the strongest impact on nRMSE until a level of 300 plots, above which we did not observe nRMSE values higher than 20% in any object, provided that sample plots of at least 200 m2 were used. No remarkable change in the nRMSE distribution was observed above 300 m2. Negligible improvement (up to 1% under most scenarios) was observed with the transition from 1 to 7 pulse∙m− 2, in contrast to nBIAS, where no gain due to the PC density was observed.

Table 3 nRMSE (%) distributions of the stand-level GSV estimations across the analysed factors. Objects: S – Suprasl, P – Piensk, M – Milicz, H – Herby, K – Katrynka. *ADR – Age-Dependent Radius



This study points to the wide benefits of the application of ALS data in conducting forest inventories. The method analysed enables the acquisition of practical information on single-stand wood resources over vast forest areas. The reduction in sampling intensity as a result of using ALS support could likely compensate for the cost of flight missions. Ene et al. (2013) stated that ALS-aided inventories can be a cost-efficient alternative to conventional approaches. Lower inventory costs could result in more frequent data updates than in the case of conventional surveys, which are conducted with intervals of a few years. Moreover, traditional design-based methods aim to assess the mean/variance in the entire population (forest district) or specific strata (forest types) (Köhl et al. 2006; Ståhl et al. 2016), providing little or even no knowledge of the attributes of specific population elements. If detailed inventory data are needed, total field surveying might be conducted. Nevertheless, such an assignment must be limited to a particular stand or small areas for economic reasons. Therefore, the main step towards improvement shall be manifested in the possibility of acquiring knowledge of every single stand or even every single tree (Maltamo et al. 2004; Packalén et al. 2008; Bergseng et al. 2015) across entire regions, including hardly accessible sites while incurring reasonable costs. Thus, the results of this study provide look-up tables representing exemplary commitment (the sampling intensity) and expected outputs in terms of the error distributions.


Our results show the broad possibility of maintaining a relatively low sampling intensity. There seems to be no point in increasing the sampling intensity above a certain threshold. First, increasing the PC density from 1 to 7 pulse∙m− 2 seems to be pointless if the GSV is the target variable to be assessed at the stand level. Similar conclusions regarding this matter have been drawn by Watt et al. (2013) and Bouvier et al. (2019). Second, practical improvement due to the sample plot area was attained only up to a level of approximately 200/300 m2. One has to be careful when using smaller plots on Scots pine-dominated stands as the precision of the GSV estimates may decrease (Fig. 4, Table 3), even up to a nRMSE value of approximately 40% as in Næsset (1997). The ADR also seems to be a promising sampling design, as it can reduce some workload, and the errors obtained did not differ much from fixed-radius scenarios.

The number of sample plots was found to be the factor with the strongest influence on the results. By exploiting 300 sample plots, it was possible to maintain nRMSE (Fig. 4) below 20% for Piensk and below 12% for Katrynka, with a minimal gain due to additional sample plots. The nRMSE value varied between the objects. All these findings suggest that the stand structure imposes a significant influence on the estimation precision. Referring to the level of 300 sample plots, the model produced systematic errors within the range of +/− 5%, indicating a slight overestimation tendency (Fig. 3). The obtained bias could have inter alia occurred due to the slightly lower GSV values characterizing the control stands than those characterizing the sample plots (Table 1). Although the bias issue is known and looms large over many model-based estimators (Köhl et al. 2006), we also obtained unbiased scores. As shown in Fig. 3, quite a few draws derived unbiased model parameters, particularly certain sparse scenarios, e.g., 50 plots of 200 m2. These results could be due to chance, but as simple random samplings were simulated based on a regular network of circular sample plots, in the second phase of the sampling design, a reduction in random scores is expected if ALS variables are used for stratification and deployment of sample plots, as in the case of Gobakken et al. (2013). Moreover, we do not expect any substantial change in error distribution across the analysed factors even if the validation dataset was a perfect representation of the training sample plots.

Relation to existing studies

How does our research correspond to the results published in other studies? According to Knofczynski (2017), n = 95 would be the minimum recommended sample size if the model used in this research was tested with his method, where sample size estimation involves a number of predictors (3 in this case) and correlation of the strongest predictor (X1 in Eq. 1) with the criterion (r ≈ 0.8). As a rule of thumb, Bujang et al. (2017) stated that for multiple linear regression, the minimum sample size should be 300 to derive statistics that sufficiently represent population parameters in non-experimental study designs. Following the recommendations by Voorhis and Morgan (2007), Harris (1985) and Green (1991), the minimum recommended sample size in our research would have been 51, 53 and 74, respectively. Regarding empirical studies, Gobakken et al. (2013) reported that 40 sample plots of 250 m2 allocated with the support of ALS data were enough to provide reliable GSV estimates, i.e., a nBIAS level between 2% and 6% and nRMSE ranging from 15%–18% (average values after 300 iterations). Similar results were found by Bouvier et al. (2019), who also recommended the use of at least 40 plots (but larger than 530 m2) to obtain a robust model for pine plantation biomass estimation; however, as in our study, this led to slightly overestimated results. Junttila et al. (2010) applied non-parametric and Bayesian methods utilizing extra plots from previous missions. In that study, only 30–60 new plots of 250 m2 were sufficient to provide satisfactory GSV estimates, i.e., a nBIAS SD of 3.5% and a mean nRMSE of 21% out of 50 iterations, at a PC density below 1 pulse∙m− 2. In Stereńczak et al. (2018), 100–200 plots of 500 m2 produced nBIAS and nRMSE values below +/− 5% and 20%, respectively. Fassnacht et al. (2018) bridged theoretical and empirical approaches for sampling intensity evaluation in a novel manner, by combining real and simulated data. Despite the biomass being the target variable, the trends reported were similar to those in our study.

As is evident from the above examples, the reality is not too far removed from the theory. Many studies (including ours) have reported nRMSE values oscillating below 20% and nBIAS values up to 5% using a sample size derived from theoretical assumptions. The current research has yet to specify the acceptable intervals for these kinds of errors. The acceptable error magnitude should depend on the data quality, field specification, available resources, and, above all, the inventory goal. In the presented case, the goal was to estimate the stand-level GSV. Ideally, the aptness of this question should be clarified via a comparison to other methods. It is important to recognize the needs and alternative solutions. This study was solely focused on the GSV, as it is closely related to carbon and biomass stocks, which currently seem to be crucial indices for many activities related to climate change mitigation. Nevertheless, ALS-based inventory methods can provide a number of other variables, e.g., the tree height, stem density, basal area, and species identification, many of which may be important to foresters. Therefore, the end-users should also have a say in this regard.


However, prior findings cannot be directly compared to ours because we investigated the error rate at the single-stand level, in contrast to many other studies, in which models were validated using only sample plots. Eventually, foresters are interested in stands, not merely sample plots. We recognize this issue as a novelty in the field. Moreover, not exactly the same sampling designs and estimators were applied in the cited examples. Some limitations of our test should also be mentioned. The presented results should be considered with a degree of caution as the following error components have not been excluded by the research: (i) only one instance of the general form of the model (note: Saarela et al. (2015) found the type of regression model to have a moderate effect on the precision) and (ii) only one aggregation function (the area-weighted mean) was used for the transition from the sample plot level to the stand level, (iii) the intrinsic error of dendrometric equations was considered to derive the reference data (Bruchwald 1999), (iv) only object-specific stand structures (mostly Pinus sylvestris-dominated stands) were studied, and (v) one sampling method. Therefore, the obtained distributions can only be considered expected results, although the entire dataset was large. The above-listed issues deserve to be further investigated in greater detail, for example, by testing non-parametric estimators such as random forest or regression trees, especially considering the notable results obtained by Yang et al. (2019), who found random forest imputations to be efficient for small sample sizes (e.g., below 50). The stratification of the area according to the stand structure should also be investigated, as we have observed its potential influence, which would be perhaps more relevant if one had to evaluate more species-diversified forest districts. Lastly, we assumed that applied sampling method would not favour any analysed factor, as the draws were random from a very dense grid of sample plots. It also enabled us getting many possible outcomes. We do not recommend any sampling method in this article, however this certainly should have some influence on the final performance. On the other hand it would dramatically expand the article, thus making it more convoluted. In this study we wanted to present only general trends and possibilities. However, accounting for these factors would probably more reliably validate the results and ensure that they were more generally applicable.


A considerable number of studies dedicated to the utilization of ALS data to aid of contemporary forest inventories have emphasized the relevance of RS techniques for environmental surveys. Knowledge of the relationship between the sampling intensity and possible accuracy may be relevant for decision-makers prior to survey campaigns. Having outlined the above relationship, we can draw the following conclusions. Even a low scanning density such as 1 pulse∙m− 2 may be sufficient for the growing stock inventory. The analysed inventory method enables maintaining the sampling intensity at 200 − 300 sample plots ranging from 100 − 200 m2. Negligible score improvement is attained above these thresholds. Reduced inventory costs can result in more frequent data updates than in the case of conventional surveys, data from which expire quickly.

In this study, simple random sampling was applied to systematic network of plots. This means no prior knowledge of the surveyed object. However, as in the case of managed forest districts, such information should usually be available. Moreover, certain drawings at sampling levels below the above thresholds generated results comparable to those obtained at higher levels. This could be related to factors such as the sampling scheme and intensity, the estimator type, and their synergistic effect on the overall performance of the ALS-aided stock inventory. The abovementioned aspects support the need for further sampling optimization with respect to other sampling methods and local ecosystem conditions.

Availability of data and materials

The data that support the findings of this study are available from the Forest Research Institute (Poland), but restrictions apply to the availability of these data, which were used under licence for the current study and are thus not publicly available. The data are, however, available from the authors upon reasonable request and with permission from the Forest Research Institute (Poland).



Area-Based Approach


Growing Stock Volume


Age-Dependent Radius


Light Detection and Ranging


Airborne Laser Scanning


Normalized Systematic Error


Boosted Regression Trees


Normalized Root Mean Square Error


Diameter at Breast Height


Point Cloud


Forest Management Plan


Global Navigation Satellite System


  • Adams T, Brack C, Farrier T, Pont D, Brownlie R (2011) So you want to use LiDAR? A guide on how to use LiDAR in forestry. N Z J Forest 55(4):19–23

    Google Scholar 

  • Balenović I, Alberti G, Marjanović H (2013) Airborne laser scanning - the status and perspectives for the application in the South-East European Forestry. South-East Eur For 4(2):59–79.

    Article  Google Scholar 

  • Bergseng E, Ørka HO, Næsset E, Gobakken T (2015) Assessing forest inventory information obtained from different inventory approaches and remote sensing data sources. Ann Forest Sci 72(1):33–45

    Article  Google Scholar 

  • Bolduc P, Lowell K, Edwards G (1999) Automated estimation of localized forest volume from large-scale aerial photographs and ancillary cartographic information in a boreal forest. Int J Remote Sens 20:3611–3624.

    Article  Google Scholar 

  • Bolton DK, White JC, Wulder MA, Coops NC, Hermosilla T, Yuan X (2018) Updating stand-level forest inventories using airborne laser scanning and Landsat time series data. Int J Appl Earth Obs Geoinf.

  • Bouvier M, Durrieu S, Fournier R, Saint-Geours N, Guyon D, Grau E, De Boissieu F (2019) Influence of sampling design parameters on biomass predictions derived from airborne LiDAR data. Can J Remote Sens.

  • Bruchwald A (1999) Dendrometria. Wydawn, Warszawa ISBN:83-00-02889-7

    Google Scholar 

  • Bruchwald A, Dudek A, Michalak K, Rymer-Dudzińska T, Wróblewski L, Zasada M (2000) Wzory empiryczne do określania wysokości i pierśnicowej liczby kształtu grubizny drzewa (empirical formulae for defining height and dbh shape figure of thick wood). Sylwan 10:5–13 (in Polish)

    Google Scholar 

  • Bujang MA, Sa’at N, Sidik TMITAB (2017) Determination of minimum sample size requirement for multiple linear regression and analysis of covariance based on experimental and non-experimental studies. Epidemiol Biostat Public Health.

  • Coomes DA, Safka D, Shepherd J, Dalponte M, Holdaway R (2018) Airborne laser scanning of natural forests in New Zealand reveals the influences of wind on forest carbon. Forest Ecosyst 5:10.

    Article  Google Scholar 

  • DGLLP (2015) Appendix 1 of order no. 33. The State Forests National Forest Holding (in Polish)

  • EEA (2017) Forest: growing stock, increment and fellings. Accessed 15 Jun 2018

    Google Scholar 

  • Elith J, Leathwick JR, Hastie T (2008) A working guide to boosted regression trees. J Anim Ecol 77(4):802–813.

    Article  PubMed  CAS  Google Scholar 

  • Ene LT, Næsset E, Gobakken T, Gregoire TG, Göran S, Holm S (2013) A simulation approach for accuracy assessment of two-phase post-stratified estimation in large-area LiDAR biomass surveys. Remote Sens Environ 133:210–224.

    Article  Google Scholar 

  • Eurostat (2018) Labour cost levels by NACE Rev. 2 activity. Accessed 10 Apr 2018

    Google Scholar 

  • Evans D, Roberts S, Parker R (2006) LiDAR - a new tool for forest measurements? Forest Chron.

  • Even B, Ørka HO, Næsset E, Gobakken T (2015) Assessing forest inventory information obtained from different inventory approaches and remote sensing data sources. Ann Forest Sci 72(1):33–45

    Article  Google Scholar 

  • FAO (2004) National forest inventory. Field manual template Accessed14 May 2018

    Google Scholar 

  • Fassnacht FE, Latifi H, Hartig F (2018) Using synthetic data to evaluate the benefits of large field plots for forest biomass estimation with LiDAR. Remote Sens Environ.

  • FMM (2012) Forest management manual. In: Święcicki Z (ed) Instrukcja Urządzania Lasu cz. 1. Ośrodek Rozwojowo-Wdrożeniowy Lasów Państwowych w Bedoniu, Andrespol (in Polish)

  • Gieruszyński T (1948) Zastosowanie fotogrametrii przy urządzaniu gospodarstw leśnych. Wydawnictwa pomocnicze i techniczno-gospodarcze, Instytut Badawczy Leśnictwa, Seria B, Nr 16 (in Polish)

  • Gobakken T, Korhonen L, Næsset E (2013) Laser-assisted selection of field plots for an area-based forest inventory. Silv Fenn 47(5):943.

    Article  Google Scholar 

  • Gobakken T, Næsset E (2008) Assessing effects of laser point density, ground sampling intensity, and field sample plot size on biophysical stand properties derived from airborne laser scanner data. Can J For Res 38:1095–1109.

    Article  Google Scholar 

  • Gobakken T, Næsset E, Nelson R, Bollandsås OM, Gregoire TG, Ståhl G, Holm S, Ørka HO, Astrup R (2012) Estimating biomass in Hedmark County, Norway using national forest inventory field plots and airborne laser scanning. Remote Sens Environ.

  • Green SB (1991) How many subjects does it take to do a regression analysis? Multivar Behav Res 26:499–510.

    Article  CAS  Google Scholar 

  • Harris RJ (1985) A primer of multivariate statistics, 2nd edn. Academic Press, New York

    Google Scholar 

  • Helms JA (1998) The dictionary of forestry. Society of American Foresters, Bethesda

    Google Scholar 

  • Holopainen M, Vastaranta M, Juha H (2014) Outlook for the next generation’s precision forestry in Finland. Forests. 5:1682–1694.

    Article  Google Scholar 

  • Hugershoff R (1911) Die Photogrammetrie und ihre Bedeutung fUr das Forstwesen. Tharander forstliches Jahrbuch 62:123–132 (in German)

    Google Scholar 

  • Johnson L, Debora & Norman JK, Hann D (2004) The importance of forest stand-level inventory to sustain multiple forest values in the presence of endangered species. Develop change. Accessed 10 Apr 2018

    Google Scholar 

  • Jung SL, Mui HP (2010) Estimation of stand volume of conifer forest: a Bayesian approach based on satellite-based estimate and forest register data. Forest Sci Technol 6(1):7–17.

    Article  Google Scholar 

  • Junttila V, Kauranne T, Leppänen V (2010) Estimation of forest stand parameters from airborne laser scanning using calibrated plot databases. For Sci 56:257–270

    Google Scholar 

  • Kangas A, Gobakken T, Puliti S, Hauglin M, Næsset E (2018) Value of airborne laser scanning and digital aerial photogrammetry data in forest decision making. Silv Fenn.

  • Kankare V, Ivan I, Singleton A, Horák J, Inspektor T (2017) Outlook for the single-tree-level forest inventory in Nordic countries. In: Igor I, Alex S, Jiri H, Tomas I (eds) The rise of big spatial data. Lecture notes in geoinformation and cartography. Springer, Cham.

    Chapter  Google Scholar 

  • Kauranne T, Pyankov S, Junttila V, Kedrov A, Tarasov A, Kuzmin A, Peuhkurinen J, Villikka M, Vartio V-M, Sirparanta S (2017) Airborne laser scanning based forest inventory: comparison of experimental results for the perm region, Russia and prior results from Finland. Forests 8:72.

    Article  Google Scholar 

  • Knofczynski TG (2017) Sample sizes for predictive regression models and their relationship to correlation coefficients. J Math Sci Math Educ 12(2) Accessed 10 Apr 2018

  • Köhl M, Magnussen SS, Marchetti M (2006) Sampling methods, remote sensing and GIS multiresource forest inventory. Trop Forest ISBN: 3540325727, 9783540325727

  • Koivuniemi J, Korhonen KT (2006) Inventory by compartments. In: Kangas A, Maltamo M (eds) Forest inventory – methodology and applications, Managing Forest ecosystems, vol 10. Springer, Dordrecht, pp 271–278

    Chapter  Google Scholar 

  • Leeuwen M, Nieuwenhuis M (2010) Retrieval of forest structural parameters using LIDAR remote sensing. Eur J Forest Res 129:749–770.

    Article  Google Scholar 

  • Mäkelä H, Pekkarinen A (2004) Estimation of forest stand volumes by Landsat TM imagery and stand-level field-inventory data. Forest Ecol Manag 196(2–3):245–255.

    Article  Google Scholar 

  • Maltamo M, Eerikäinen K, Pitkänen J, Hyyppä J, Vehmas M (2004) Estimation of timber volume and stem density based on scanning laser altimetry and expected tree size distribution functions. Remote Sens Environ 90(3):319–330. ISSN 0034-4257.

    Article  Google Scholar 

  • Maltamo M, Packalen P (2014) Species-specific management inventory in Finland. Forest Appl Airborne Laser Scan.

  • McInerney D, Suarez MJ, Nieuwenhuis M (2011) Extending forest inventories and monitoring programmes using remote sensing: a review. Irish Forest 68:6–22

    Google Scholar 

  • Mcroberts R, Næsset E, Gobakken T (2013) Inference for lidar-assisted estimation of forest growing stock volume. Remote Sens Environ 128:268–275.

    Article  Google Scholar 

  • Miścicki S, Stereńczak K (2013) Określanie miąższości i zagęszczenia drzew w drzewostanach centralnej Polski na podstaie danych lotniczego skanowania laserowego w dwufazowej metodzie inwentaryzacji zasobów drzewnych. Leśne Prace Badawcze 74:127–136 (in Polish)

    Google Scholar 

  • Montealegre A, Lamelas M, Riva J, García-Martín A, Escribano F (2016) Use of low point density ALS data to estimate stand-level structural variables in Mediterranean Aleppo pine forest. Forestry.

  • Mozgeris G (2008) Estimation and use of continuous surfaces of forest parameters: options for Lithuanian forest inventory. Baltic Forest 14(2):176–184

    Google Scholar 

  • Næsset E (1997) Estimating timber volume of forest stands using airborne laser scanner data. Remote Sens Environ 61:246–253

    Article  Google Scholar 

  • Næsset E (2002) Predicting forest stand characteristics with airborne scanning laser using a practical two-stage procedure and field data. Remote Sens Environ 80(1):88–99.

    Article  Google Scholar 

  • Næsset E (2014) Area-based inventory in Norway – from innovation to an operational reality. In: Matti M, Erik N, Jari V (eds) Forestry applications of airborne laser scanning: concepts and case studies, vol 27, pp 215–240.

    Chapter  Google Scholar 

  • Næsset E, Bjerknes KO (2001) Estimating tree heights and number of stems in young forest stands using airborne laser scanner data. Remote Sens Environ 78:328–340

    Article  Google Scholar 

  • Næsset E, Gobakken T, Holmgren J, Hyyppä H, Hyyppä J, Maltamo M, Nilsson M, Olsson H, Persson Å, Söderman U (2004) Laser scanning of forest resources: the Nordic experience. Scand J Forest Res 19(6):482–499.

    Article  Google Scholar 

  • Nichiforel L, Keary K, Deuffic P, Weiss G, Thorsen B, Winkel G, Avdibegovic M, Dobšinská Z, Feliciano D, Gatto P, Górriz ME, Hoogstra-Klein M, Hrib M, Hujala T, Jager L, Jarský V, Jodłowski K, Lawrence A, Lukmine D, Bouriaud L (2018) How private are Europe’s private forests? A comparative property rights analysis. Land Use Policy doi:

  • Packalén P, Pitkänen J, Maltamo M (2008) Comparison of individual tree detection and canopy height distribution approaches: a case study in Finland. Proceedings of SilviLaser 2008, 8th International Conference on LiDAR applications in Forest Assessment and Inventory, Heriot-Watt University, Edinburgh, UK, 17-19 September, 2008, pp 22-29

  • Pasalodos-Tato M (2010) Optimising forest stand management in Galicia, North-Western Spain. Dissertationes Forestales. Doi:

  • Pont D, Watt M, Adams T, Marshall H, Lee J, Crawley D, Pete W (2012) Modelling variation in Pinus radiata stem velocity from area and crown-based LiDAR metrics. N Z J Forest Sci 43:1.

    Article  Google Scholar 

  • R Core Team (2016) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna Accessed 20 July 2018

    Google Scholar 

  • Redmond J, Gschwantner T, Riedel T, Alberdi I, Vidal C, Bosela M, Fischer C, Hernández L, Kučera M, Kuliešis A, Tomter S, Vestman M, Lanz A (2016) Comparison of wood resource assessment in national forest inventories. In: Claude V, Iciar AA, Laura HM, John JR (eds) National Forest Inventories: assessment of wood availability and use. Springer, Cham

    Google Scholar 

  • Roussel JR, Auty D, De Boissieu F, Meador AS (2018) Package lidR - Airborne LiDAR data manipulation and visualization for forestry applications. Accessed 20 July 2018

    Google Scholar 

  • Ruiz LA, Hermosilla T, Mauro F, Godino M (2014) Analysis of the influence of plot size and LiDAR density on forest structure attribute estimates. Forests 5(5):936–951.

    Article  Google Scholar 

  • Saarela S, Schnell S, Grafström A, Tuominen S, Nordkvist K, Hyyppä J, Kangas A, Ståhl G (2015) Effects of sample size and model form on the accuracy of model-based estimators of growing stock volume. Can J Forest Res 45:1524–1534.

    Article  Google Scholar 

  • Siipilehto J (2000) A comparison of two parameter prediction methods for stand structure in Finland. Silv Fenn 34(4):617.

    Article  Google Scholar 

  • Smreček R, Danihelová Z (2013) Forest stand height determination from low point density airborne laser scanning data in Roznava Forest enterprise zone (Slovakia). iForest - Biogeosci Forest 6:48–54.

    Article  Google Scholar 

  • Ståhl G, Saarela S, Schnell S, Holm S, Breidenbach J, Healey S, Patterson P, Magnussen S, Næsset E, Mcroberts R, Gregoire T (2016) Use of models in large-area forest surveys: comparing model-assisted, model-based and hybrid estimation. Forest Ecosyst 3:5.

    Article  Google Scholar 

  • Stereńczak K (2010) Airborne laser scanner technology as a source of data for semi-automatic forest inventory. Sylwan 154:88–99 (in Polish)

    Google Scholar 

  • Stereńczak K, Lisańczuk M, Parkitna K, Mitelsztedt K, Mroczek P, Miścicki S (2018) The influence of number and size of sample plots on modelling growing stock volume based on airborne laser scanning. Drewno 61(201).

  • The Forests Act (1991) Official journal of laws 05.45.435. Accessed 20 July 2018 (in Polish)

    Google Scholar 

  • Tompalski P, Coops NC, White JC, Wulder MA (2015) Enriching ALS-derived area-based estimates of volume through tree-level downscaling. Forests 6:2608–2630

    Article  Google Scholar 

  • Tomppo E (1991) Satellite image-based national forest inventory of Finland. Int Arch Photogr Remote Sensing 28:419424 Proceedings of the Symposium on Global and Environmental Monitoring, Techniques and Impacts, 1721 Sept 1990, Victoria, British Columbia, Canada

    Google Scholar 

  • Tonolli S, Dalponte M, Vescovo L, Rodeghiero M, Bruzzone L, Gianelle D (2010) Mapping and modeling forest tree volume using forest inventory and airborne laser scanning. Eur J Forest Res 130:569–577.

    Article  Google Scholar 

  • Turner R, Goodwin N, Friend J, Mannes D, Rombouts J, Haywood A (2011) A national overview of airborne Lidar application in Australian forest agencies. SilviLaser 2011, Oct 16–19. Hobart, TAS, AU

  • Vauhkonen J, Ørka H, Holmgren J, Dalponte M, Heinzel J, Koch B (2014) Tree species recognition based on airborne laser scanning and complementary data sources. In: Matti M, Erik N, Jari V (eds) Forestry applications of airborne laser scanning. Springer, Dordrecht.

    Chapter  Google Scholar 

  • Vidal C, Alberdi I, Hernández L, Redmond JJ (2016) National forest inventories, assessment of wood availability and use. Springer International Publishing, Switzerland.

    Book  Google Scholar 

  • Voorhis C, Morgan B (2007) Understanding power and rules of thumb for determining sample size. Quant Method Psychol.

  • Watt M, Adams T, Gonzalez AS, Marshall H, Watt P (2013) The influence of LiDAR pulse density and plot size on the accuracy of New Zealand plantation stand volume equations. N Z J Forest Sci 43:15.

    Article  Google Scholar 

  • White J, Wulder M, Buckmaster G (2014) Validating estimates of merchantable volume from airborne laser scanning (ALS) data using weight scale data. Forest Chron 90:378–385.

    Article  Google Scholar 

  • White J, Wulder M, Whitehead R (2013) A best practices guide for generating forest inventory attributes from airborne laser scanning data using an area based approach. BC Forest Profess 20(6):20–21

    Google Scholar 

  • White JC, Nicholas CC, Michael AW, Mikko V, Thomas H, Piotr T (2016) Remote sensing technologies for enhancing forest inventories: a review. Can J Remote Sens 42(5):619–641.

    Article  Google Scholar 

  • White JC, Piotr T, Mikko V, Michael AW, Ninni S, Christoph S, Nicholas CC (2017) A model development and application guide for generating an enhanced forest inventory using airborne laser scanning data and an area-based approach. Canadian Forest Service, Canadian Wood Fibre Centre, Natural Resources, Canada. Information report FI-X-018

  • Wilson E (1920) The use of seaplanes in forest mapping. J Forest 18(1):1–5.

    Article  Google Scholar 

  • Woods M, Pitt D, Penner M, Lim K, Nesbitt D, Etheridge D, Treitz P (2011) Operational implementation of a LiDAR inventory in boreal Ontario. Forest Chron 87:512–528.

    Article  Google Scholar 

  • Wulder M (1998) Optical remote-sensing techniques for the assessment of forest inventory and biophysical parameters. Prog Phys Geogr 22:449.

    Article  Google Scholar 

  • Wulder MA, Bater CW, Coops NC, Hilker T, White JC (2008) The role of LiDAR in sustainable forest management. For Chron 84(6):807–826.

    Article  Google Scholar 

  • Yang TR, Kershaw JA, Weiskittel AR, Lam TY, McGarrigle E (2019) Influence of sample selection method and estimation technique on sample size requirements for wall-to-wall estimation of volume using airborne LiDAR. Forestry 92(3):311–323.

    Article  Google Scholar 

  • Zygmunt R, Banaś J, Bujoczek L, Zięba S (2017) Monetary value tariff of timber calculated using databases of forests. Sylwan. 161(2):91–100

    Google Scholar 

Download references


The authors of this study would like to thank all people involved in the assignments within the scope of the project, related to the issues presented in this article. The authors would also like to express their gratitude to Miriam O’Regan, who edited the manuscript.


This study was performed under the research project entitled “Remote sensing-based assessment of woody biomass and carbon storage in forests”, which was financially supported by the National Centre for Research and Development (Poland), under the BIOSTRATEG programme (Agreement No. BIOSTRATEG1/267755/4/NCBR/2015). Financial support was also received from the project entitled “Rozbudowa metody inwentaryzacji urządzeniowej stanu lasu z wykorzystaniem efektów projektu REMBIOFOR” (Project No. 500463, agreement No. EO. with the Polish State Forests National Forest Holding, signed on 14.10.2019), which constitutes a continuation of the former project.

Author information

Authors and Affiliations



ML: literature review, goal and scope determination, method development, data preparation, data processing, data analysis, graph generation, result interpretation, critical revision, conclusion formulation, and manuscript drafting. KM: literature review, data preparation, processing output approval, result interpretation, table generation, critical manuscript revision, conclusion formulation, and consulting. KP: literature review, data preparation, result and manuscript revision, and consulting. GK: literature review, data preparation, result and manuscript revision, and consulting. KS: literature review, data acquisition, concept delineation, goal and scope determination, supervision, consulting, manuscript revision, and fundraising. EWF: consulting. SM: statistical consulting and critical manuscript revision. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Marek Lisańczuk.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lisańczuk, M., Mitelsztedt, K., Parkitna, K. et al. Influence of sampling intensity on performance of two-phase forest inventory using airborne laser scanning. For. Ecosyst. 7, 65 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: