- Open Access
Does spatial auto-correlation call for a revision of latest heavy metal and nitrogen deposition maps?
Environmental Sciences Europe volume 24, Article number: 20 (2012)
Within the framework of the Convention on Long-range Transboundary Air Pollution atmospheric depositions of heavy metals and nitrogen as well as critical loads/levels exceedances are mapped yearly with a spatial resolution of 50 km by 50 km. The maps rely on emission data and are calculated by use of atmospheric modelling techniques. For validation, EMEP monitoring data collected at up to 70 sites across Europe are used. This spatially sparse coverage gave reason to test if the chemical and physical relations between atmospheric depositions and their accumulation in mosses collected at up to 7000 sites throughout Europe can be quantified in terms of statistical correlations which, if proven, could be used to calculate deposition maps with a higher spatial resolution. Indeed, combining EMEP maps on atmospheric depositions of cadmium, lead and nitrogen and the related maps of their concentrations in mosses by use of a Regression Kriging approach yielded deposition maps with a spatial resolution of 5 km by 5 km. Since spatial auto-correlation can make testing of statistical inference too liberal, the investigation at hand was to validate the 5 km by 5 km deposition maps by analysing if spatial auto-correlation of both EMEP deposition data and moss data impacted on the significance of their statistical correlation and, thus, the validity of the deposition maps. To this end, two hypotheses were tested: 1. The data on deposition and concentrations in mosses of heavy metals and nitrogen are not spatially auto-correlated significantly. 2. The correlations between the deposition and moss data lack statistical significance due to spatial autocorrelation.
As already published, the regression models corroborated significant correlations between the concentrations of heavy metals and nitrogen in atmospheric depositions on the one hand and respective concentrations in mosses on the other hand. This investigation proved that atmospheric deposition and bioaccumulation data are spatially auto-correlated significantly in terms of Moran’s I values and, thus, hypothesis 1 could be rejected. Accordingly, the degrees of freedom were reduced. Nevertheless, the results of the calculations regarding the reduced degrees of freedom indicate that the statistical relations between atmospheric depositions and bioaccumulations remained statistically significant so that hypothesis 2 could be rejected, too.
The positive auto-correlation in data on atmospheric deposition and bioaccumulation does not call for a revision of the 5 km by 5 km deposition maps published in recent papers. Therefore we can conclude that the European moss monitoring yields data that support the validation of modelling and mapping of atmospheric depositions of heavy metals and nitrogen at a high spatial resolution compared to the 50 km x 50 km EMEP maps.
Measurements of atmospheric depositions are needed as a basis to evaluate environmental quality. To this end, deposition data are, amongst others, used to calculate exceedance maps for critical loads. Critical loads are defined as quantitative estimates of an exposure to one or more pollutants below which significant harmful effects on specified ecosystem functions are not expected to occur according to present knowledge . In Europe, the control of heavy metals and reactive nitrogen emissions to air is regulated under several directives of the European Union and protocols of the Long-range Transboundary Air Pollution (LRTAP) Convention. Under the LRTAP Convention, the European Monitoring and Evaluation Programme (EMEP) collects emission data from European countries in order to model atmospheric transport and depositions of air pollutants. Amongst others, depositions of cadmium (Cd), lead (Pb) and nitrogen (N) are calculated using chemical transport models yielding deposition maps with a grid size of 50 km by 50 km. The modelling results are validated by use of deposition data collected at EMEP monitoring sites. However, the number of EMEP measurement stations is rather limited across Europe and EMEP stations are generally under-represented in Southern and Eastern Europe. In 2005, 53 EMEP stations measured the concentration of nitrogen compounds in precipitation and wet deposition, whereas up to 41 stations reported air concentrations of nitrogen compounds . In case of heavy metals, the number of EMEP measurement stations accounts for up to 70 throughout Europe .
For ecosystem-specific evaluations of exposure in terms of atmospheric depositions or critical loads information with high spatial resolution is crucial [4–10]. To enhance the spatial resolution of the deposition maps data on phenomena that are physically and statistically related with depositions and collected at higher spatial density could be utilised. Once substances emitted to air have been deposited, they can accumulate in plant biomass, as for instance in mosses. The European moss biomonitoring network encompassing up to 7000 sites was established in 1990 and has been repeated every five years since then . Carpet-forming, ectohydric mosses obtain most trace elements and nutrients directly from precipitation, occult deposition and dry deposition. Therefore, the moss technique has been shown to provide a complementary, time-integrated measure of element deposition from the atmosphere to terrestrial systems quantifying the potential availability of potentially harmful substances such as heavy metals  or nutrients such as nitrogen . With the moss technique a much higher sampling density can be achieved than with conventional deposition analysis. The national moss surveys across Europe are coordinated by the ICP Vegetation and follow recommendations regarding sampling, preparation and chemical analyses of the mosses put down in an experimental protocol. Figure 1 shows the distribution of moss species sampled in the 2005 survey together with the EMEP 50 km x 50 km raster.
The European moss monitoring produces datasets at high spatial resolution which was used to evaluate the performance of the EMEP model  and to calculate deposition maps with a spatial resolution of 5 km by 5 km through modelling the statistical relations between atmospheric deposition and bioaccumulation of of Cd, Pb and N by use of Regression Kriging [12, 13]. The corresponding methodology and results can be summarised as follows: The EMEP deposition maps were intersected within a GIS with Kriging maps on N, Cd and Pb accumulations in mosses. The maps were calculated by Ordinary Kriging on basis of the variograms presented in the ‘Results’ section of this paper. Next medians were calculated for all moss estimations within each EMEP grid cell. Both moss data and corresponding modelled deposition values were ln-transformed and their relationship investigated and modelled by linear regression analysis. The regression models corroborate that the Cd concentration in mosses is correlated with the EMEP modelled total Cd deposition across Europe (regression coefficient according to Pearson, rp = 0.67; regression coefficient according to Spearman, rs = 0.69). The coefficient of determination is R2 = 0.44. The same is true for Pb with rp = 0.76 and rs = 0.77 and R2 = 0.58 . The regression analysis of the estimated N concentrations in mosses and the modelled EMEP depositions, too, resulted in clear linear regression patterns with coefficients of determination of R2 = 0.62 and Pearson correlations of rp = 0.79 and Spearman correlations of rs = 0.70, respectively . The regression equations were applied on the moss kriging estimates of the element concentration in mosses. The respective residuals were projected onto the centres of the EMEP grid cells and were mapped using variogram analysis and ordinary kriging. Finally, the residual and the regression map were summed up to the map of total N, Cd, and Pb deposition in terrestrial ecosystems throughout Europe. This was done for a 5 km by 5 km raster which was chose due to the results of nearest neighbourhood statistics: All nearest neighbour distances of all moss sites were calculated in ArcGIS 10.0 and summarised in terms of quantile statistics. The 10th quantile was chosen in order to adjust the interpolation raster to the high density of the moss monitoring net approximating ca. 5000 m (exact value: 5468.5 m).
By application of this environmental mapping methodology the EMEP maps could be improved in both spatial resolution and, by adding more empirical data, in terms of validation aspects. Due to the use of moss data the maps furthermore depict direct impacts of atmospheric pollution to terrestrial ecosystem functions since the uptake of pollutants by plants can be seen as the first step towards an effect.
Auto-correlation is a widespread phenomenon in environmental systems [14, 15]. In statistics, the auto-correlation of a random process is defined as the similarity of, or correlation between, values of a process at neighbouring points in time or space. Auto-correlation describes the similarity between observations as a function of the separation of time and space intervals between them. Positive auto-correlation means that the individual observations contain information which is part of other, timely or spatial neighbouring, observations. Subsequently, the effective sample size will be lower than the number of realized observations. Negative auto-correlation can have the opposite effect, thus, making the effective sample larger than the realized sample . Therefore, autocorrelation can have several implications for calculating statistics of measurement data in terms of statistical inference testing [17, 18]. Initially, investigations of statistical implications of auto-correlation concentrated mainly on time series analysis and were followed by investigations of the impacts of spatial auto-correlation on inference testing methods. For instance, it could be shown, that positive spatial auto-correlation enhances type I errors, so that parametric statistics such as Pearson correlation coefficients, are declared significant when they should not be . These findings gave reason for the investigation at hand aiming at validating recently published deposition maps which were derived by a Regression Kriging approach [12, 13].
The results are presented in Figures 2, 3, 4 and Tables 1 and 2. Variogram analyses (Figures 2, 3, 4) reveal that the concentrations of Cd, Pb and N in mosses measured at 5731 (Cd, Pb) and 2781(N) sites, respectively, exhibit positive spatial auto-correlation. The measurement values were transformed log normally due to the highly skewed data distributions of the elements investigated (Skewness Cd = 8.1; skewness Pb = 11; skewness N=1). With variogram analysis experimental semi-variances are calculated in terms of half of the average squared differences of all pairs of measurement values within each distance interval. The mean nearest neighbour distances were chosen as a starting point for the distance intervals resulting in 15.6 km for Cd, 15.8 km for Pb and 16.5 km for N. The width of the variogram window was set so that both the increase and the flattening of the semi-variance values with the separation distance could be clearly observed. Then, semi-variogram models were fitted to the experimental semi-variograms by a least squared regression line. The variogram model can be described by three parameters: range, sill and nugget-effect. The range equals the maximum separation distance for which a distinct increase of semi-variogram values, and therefore spatial autocorrelation, can be observed. The sill corresponds to the semi-variance assigned to the range. High spatial variability within the first distance interval can be caused by measurement errors and other confounding factors resulting in nugget-effects. Accordingly, the variogram model will tend to cut the ordinate of the variogram plot above the origin. Even though such a high nugget effect can be observed for Cd, Pb and N a distinct increase of experimental semi-variances with separation distance proves that spatial autocorrelation exists in all three cases.
Table 1 corroborates by means of calculated Moran’s I values for the same distance intervals that this positive spatial auto-correlation is also statistically significant. Moran’s I values range from approximately +1 to −1, where positive Moran’I values represent positive and negative Moran’s I values represent negative spatial autocorrelation . P-values may be calculated for each of the derived Moran’s I values and therefore the statistical significance of spatial autocorrelation can be assessed.
Consequently, this positive spatial auto-correlation was accounted for in the calculation of statistical correlation between Cd, Pb and N (for N: dry, wet and total) medians for EMEP cells and the corresponding EMEP according to . Table 2 contains some descriptive statistical measures for all variables investigated. The results of the correlation analysis show that the auto-correlation considerably reduces the degrees of freedom. Despite this, the correlations remained statistically significant (p < 0.01 for Cd and Pb; p < 0.05 for N) (Table 3). As a result, both hypotheses which have been tested were to be falsified. Thus, the 5 km by 5 km deposition maps which have been calculated based on the correlations between atmospheric depositions and bioaccumulations, and by means of Regression Kriging [12, 13] are statistically valid and could be used for ecosystem-specific exposure evaluations or calculations of critical loads.
Neighbouring measurement values along time series or across geographic space that are more similar or less similar than expected for randomly associated pairs of measurements are positively auto-correlated or negatively auto-correlated, respectively. Temporal and spatial auto-correlation is a widespread property of environmental variables and as such the result of abiotic and biotic processes and their interrelations. Thus, spatial patterns existing across the whole spectrum of spatial scales are functional in ecosystems and not the result of pure random effects. This fact conflicts with the assumptions of statistics such as, e.g., the independence of observations. The problem with auto-correlated data is that an observation at a certain point in time or space does not bring 100 % additional information and, hence, cannot be accounted for one full degree of freedom due to its similarity with neighbouring measurements [22, 23]. Taken the computation of a Pearson or Spearman correlation coefficient as an example, positive spatial auto-correlation of the two variables, e.g. atmospheric deposition and concentrations in mosses, provoke that the coefficient is declared too often significant. The fact that ecological reality in terms of auto-correlation often violates the assumption of inference statistical methods is of crucial importance for ecological sampling design, analysis and evaluation of field experiments and surveys [22, 24, 25]. The same holds true for spatial analysis of landscapes , including for instance testing the significance of the relation between spatially auto-correlated data at the landscape level . The latter case was examined in this investigation by example of data on atmospheric deposition and physically related concentrations of heavy metals and nitrogen in mosses. Even when accounting for spatial auto-correlation and applying the method proposed by  the relation between deposition and bioaccumulation remained statistically significant.
The positive auto-correlation in data on atmospheric deposition and concentrations in mosses does not call for revision of the 5 km by 5 km deposition maps published recently [12, 13]. Therefore, the European moss monitoring yields data that support the validation of modelling and mapping of atmospheric depositions of heavy metals and nitrogen at a high spatial resolution. The validation of the 5 km by 5 km deposition maps in terms of the auto-correlation tests presented in this investigation allows for the maps to be used to calculate critical loads exceedances complementing the ecotoxicological endpoint ‘accumulation’. Thus, the complementary use of data derived from two internationally harmonized monitoring networks, the EMEP deposition measurement and the ICP Vegetation moss monitoring, allows for synergies enhancing the spatial validity of deposition maps and subsequent products.
The EMEP deposition data for the year 2005 and the moss concentration data collected within the International Cooperative Programme on Effects of Air Pollution on Natural Vegetation and Crops (ICP Vegetation, http://icpvegetation.ceh.ac.uk) were analysed in a two step procedure: Firstly, the deposition and moss data were mapped by use of Regression Kriging (see ‘Introduction’) [12, 13]. Secondly, in this investigation we analysed how spatial auto-correlation in the modelled deposition data and the moss data influences the testing of statistical inference. To this end, two hypotheses were tested: 1. The data on deposition and concentrations in mosses of Cd, Pb and N are not spatially auto-correlated significantly. 2. The correlations between the deposition and moss data lack statistical significance due to spatial auto-correlation. Both hypotheses were tested through calculation of:
Experimental and modelled semi-variograms of ln transformed moss data for Cd, Pb and N;
Amount and significance of spatial auto-correlation for the first ten distance classes of the semi-variograms by use of Moran’s I ;
Significance of correlations between data on atmospheric deposition and concentrations in mosses with regard to the potential reduction of degrees of freedom due to positive spatial auto-correlation according to .
The extension Geostatistical analyst from ESRI ArcGIS 10.0 was used for calculation of semi-variograms. The software SAM v4.0 (Spatial Analysis in Macroecology) was applied in order to calculate Moran’s I values and to account for spatial auto-correlation when testing the correlation between EMEP values and moss data for statistical significance .
Nilsson J, Grennfelt P (Eds): Critical loads for sulphur and nitrogen. UNECE /Nordic Council workshop report, Skokloster, Sweden. March 1988. Copenhagen: Nordic Council of Ministers; 1988.
Harmens H, Norris DA, Cooper DM, Mills G, Steinnes E, Kubin E, Thöni L, Aboal JR, Alber R, Carballeira A, Coskun M, De Temmerman L, Frolova M, González-Miqueo L, Jeran Z, Leblond S, Liiv S, Mankovská B, Pesch R, Poikolainen J, Rühling Å, Santamaria JM, Simonèiè P, Schröder W, Suchara I, Yurukova L, Zechmeister HG: Nitrogen concentrations in mosses indicate the spatial distribution of atmospheric nitrogen deposition in Europe. Environ Pollut 2011, 159: 2852–2860. 10.1016/j.envpol.2011.04.041
Harmens H, Norris DA, Steinnes E, Kubin E, Piispanen J, Alber R, Aleksiayenak Y, Blum O, Coskun M, Dam M, De Temmerman L, Fernandez JA, Frolova M, Frontasyeva M, González-Miqueo L, Grodzinska K, Jeran Z, Korzekwa S, Krmar M, Kvietkus K, Leblond S, Liiv S, Magnusson SH, Mankovska B, Pesch R, Rühling Å, Santamaria JM, Schröder W, Spiric Z, Suchara I, Thöni L, Urumov V, Yurukova L, Zechmeister HG: Mosses as biomonitors of atmospheric heavy metal deposition: spatial and temporal trends in Europe. Env Pollut 2010, 158: 3144–3156. 10.1016/j.envpol.2010.06.039
Bertino L, Wackernagel H: Case studies of change-of-support problems. Technical report N–21/02/G, ENSMP—ARMINES. France: Centre de Géostatistique, Fontainebleau; 2002.
Genikhovich E, Filatova E, Ziv A: A method for mapping the air pollution in cities with the combined use of measured and calculated concentrations. Int J Environ Pollut 2002, 18: 56–63. 10.1504/IJEP.2002.000694
Goovaerts P: Geostatistical approaches for incorporating elevation into the spatial interpolation of rainfall. J Hydrol 2000, 228: 113–129. 10.1016/S0022-1694(00)00144-X
Pauly M, Drueke M: Mesoscale spatial modelling of ozone immissions. An application of geostatistical methods using a digital elevation model. Gefahrstoffe - Reinhalt Luft 1996, 56: 225–230.
Spranger T, Kunze F, Gauger T, Nagel D, Bleeker A, Draaijers G: Critical loads exceedances in Germany and their dependence on the scale of input data. Water Air Soil Pollut 2001, (Focus 1):335–351.
Van de Kassteele J, Stein A, Dekkers ALM, Velders GJM: External drift kriging of NOx concentrations with dispersion model output in a reduced air quality monitoring network. Environ Ecol Stat 2009, 16: 321–339. 10.1007/s10651-007-0052-x
Wuyts K, De Schrijver A, Verheyen K: The importance of forest type when incorporating forest edge deposition in the evaluation of critical load excedance. iForest 2009, 2: 43–45. 10.3832/ifor0486-002
Schröder W, Holy M, Pesch R, Harmens H, Fagerli H, Alber R, Coskun M, De Temmerman L, Frolova M, González-Miqueo L, Jeran Z, Kubin E, Leblond S, Liiv S, Mankovská B, Piispanen J, Santamaría JM, Simonèiè P, Suchara I, Yurukova L, Thöni L, Zechmeister HG: First Europe-wide correlation analysis identifying factors best explaining the total nitrogen concentration in mosses. Atmos Environ 2010, 44: 3485–3491. 10.1016/j.atmosenv.2010.06.024
Schröder W, Holy M, Pesch R, Harmens H, Fagerli H: Mapping background values of atmospheric nitrogen total depositions in Germany based on EMEP deposition modelling and the European Moss Survey 2005. Environ Sci Europe 2011, 23: 18. dx.doi.org/10.1186/2190-4715-23-18
Schröder W, Holy M, Pesch R, Zechmeister GH, Harmens H, Ilyin I: Mapping atmospheric depositions of cadmium and lead in Germany based on EMEP deposition data and the European Moss Survey 2005. Environ Sci Europe 2011, 23: 19. dx.doi.org/10.1186/2190-4715-23-19
Brown DG, Aspinall T, Bennett DA: Landscape models and explanation in landscape ecology – a space for generative landscape science? Prof Geograph 2006, 58: 369–382. 10.1111/j.1467-9272.2006.00575.x
Legendre P: Spatial autocorrelation: Trouble or new paradigm? Ecology 1993, 74: 1659–1673. 10.2307/1939924
Dale MRT, Fortin M-J: Spatial autocorrelation and statistical tests: Some solutions. J Agr Biol Environ Stat 2009, 14: 188–206. 10.1198/jabes.2009.0012
Cliff A, Ord J: The problem of spatial autocorrelation. In London Papers of Regional Science. Edited by: Scott A. London: Pion; 1969:25–55.
Fortin MJ, Dale MRT: Spatial autocorrelation in ecological studies: a legacy of solutions and myths. Geographical Analysis 2009, 41: 392–397. 10.1111/j.1538-4632.2009.00766.x
Fortin J-M, Payette S: How to test the significance of the relation between spatially autocorrelated data at the landcape scale: A case study using fire and forest maps. Ecosci 2001, 9: 213–218.
Moran PAP: Notes on continuous stochastic phenomena. Biometrika 1950, 37: 17–23.
Dutilleul P: Modifying the t-test for assessing the correlation between two spatial processes. Biometrics 1993, 49: 305–314. 10.2307/2532625
Legendre P, Dale MRT, Fortin M-J, Gurevitch J, Hohn M, Myers D: The consequences of spatial structure for the design and analysis of ecological field surveys. Ecography 2002, 25: 601–615. 10.1034/j.1600-0587.2002.250508.x
Legendre P, Fortin M-J: Spatial pattern and ecological analysis. Vegetation 1989, 80: 107–138. 10.1007/BF00048036
Fortin M-J, Drapeau P, Legendre P: Spatial autocorrelation and sampling design in plant ecology. Vegetation 1989, 83: 209–222. 10.1007/BF00031693
Legendre P, Dale MRT, Fortin M-J, Casgrain P, Gurevitch J: Effects of spatial structures on the results of field experiments. Ecology 2004, 85: 3202–3214. 10.1890/03-0677
Wagner HH, Fortin M-J: Spatial analysis of landscapes: Concepts and statistics. Ecology 2004, 86: 1975–1987.
Fortin J-M, Payette S: How to test the significance of the relation between spatially autocorrelated data at the landscape scale: A case study using fire and forest maps. Ecosci 2002, 9: 213–218.
We thank the United Kingdom Department for Environment, Food and Rural Affairs (Defra; contract AQ0810 and AQ0816), the UNECE (Trust Fund) and the Natural Environment Research Council (NERC) for funding the ICP Vegetation Programme Coordination Centre at CEH Bangor, UK. The contributions of many more scientists in 2005/6 and all the funding bodies in each country are gratefully acknowledged (see [2, 3], for details).
No competing interests do exist.
WS wrote the text. RP conducted the computations. HH, HF and II supported the work by dealing with the validity of experimental and modelling data. All authors read and approved the final manuscript.
Winfried Schröder, Roland Pesch contributed equally to this work.
About this article
Cite this article
Schröder, W., Pesch, R., Harmens, H. et al. Does spatial auto-correlation call for a revision of latest heavy metal and nitrogen deposition maps?. Environ Sci Eur 24, 20 (2012). https://doi.org/10.1186/2190-4715-24-20
- Concentrations of Cd
- Pb and N in mosses
- Atmospheric depositions of Cd
- Pb and N
- EMEP deposition network and modelling
- ICP Vegetation