Creating jigsaw puzzles in ArcMap

From AWF-Wiki
(Difference between revisions)
Jump to: navigation, search
(jigsaw puzzles for k-tree sampling)
 
(32 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==The concept of cluster inclusionzones for fixed area sample plots==
+
==The concept of cluster selection zones for fixed area sample plots==
  
The so called '''jigsaw puzzle view''' (Roesch et al. 1993<ref name="roesch1993">Roesch, F.A.; Green, E.J.; Scott, C.T. 1993. An Alternative View of Forest Sampling. Survey Methodology 19 (2), 199-204.</ref>) is a decomopsition of the total domain of interest in non-overlapping inclusion zones for exclusive clusters of trees. In the [[infinit population approach]] sample elements are dimensionless points drawn from an infinit [[Lecturenotes:Population|sample frame]]. Observations are derived as generalization over a number of nearest neighbours (trees) to a certain sample point. In fixed area sampling neighbours are considered up to a fixed distance (the radius of a circular sample plot). This "inclusion distance" can be assigned to every tree in the domain of interest, as a sample point falling in that distance would lead to a selection of the respective tree. This single tree inclusion zones are overlapping (as in most cases more than only one tree is selected by a sample point) whereas the resulting pattern is a decomposition of the total area in non-overlapping polygons, that would lead to a selection of a particular cluster of trees. As the number of possible observations (derived from clusters) is much smaller than the invinit number of possible sample points, the jigsaw puzzle is a classification of the infinit sample frame in areas of possible realizations. The size of the polygons is an expression of the [[selection probability]] for a certain observation.
+
The so called '''jigsaw puzzle view''' (Roesch et al. 1993<ref name="roesch1993">Roesch, F.A.; Green, E.J.; Scott, C.T. 1993. An Alternative View of Forest Sampling. Survey Methodology 19 (2), 199-204.</ref>) is a tessellation of the total domain of interest in non-overlapping selection zones for exclusive clusters of trees. In the [[infinite population approach]] sample elements are dimensionless points drawn from an infinite [[Lecturenotes:Population|sample frame]]. On the other side we can only imagine a certain discrete number of possible observations derived over a number of nearest neighbors (trees) to a certain sample point. With fixed area [[sample plot]]s neighbors are included up to a fixed distance (the plot radius). This "inclusion distance" can be assigned to every tree in the domain of interest. Whenever a sample point falls in that distance it would lead to an inclusion of the respective tree in a sample. This single tree [[inclusion probability|inclusion zone]]s are overlapping (as in most cases more than only one tree is selected by a sample point) whereas the resulting pattern is a tessellation of the total area in non-overlapping polygons, that would lead to a selection of a particular cluster of trees. As the number of possible observations (derived from clusters of conjointly included trees) is much smaller than the infinite number of possible sample points, the jigsaw puzzle is a classification of the infinite sample frame in a finite number of areas of possible observations. This number of observations that can be obtained by sampling is the virtual [[Lecturenotes:population|population]] in which sampling is conducted. The size of the polygons is an expression of the [[inclusion probability|selection probability]] for a certain observation.  
This article describes briefly, how jigsaw puzzles can be created from a map with tree locations in ArcGIS.
+
This article describes, how jigsaw puzzles can be created from a map with tree locations in ArcGIS.
  
 
===Workflow of creating jigsaw puzzles===
 
===Workflow of creating jigsaw puzzles===
 +
A tessellation of the area of interest can e.g. be obtained by means of GIS software. The following description refers to a simple model that can be implemented in ArcGIS and is able to derive a tesselation for circular sample plots. Basis for the calculation is the information about the [[inclusion probability]] of each spatially dispersed element that is derived based on the relation between a tree's inclusion zone and the total area of interest. Therefore trees are buffered with the radius of sample plots.
  
*Basis for this decomposition is a map with tree locations (x and y coordinates) that might be generated or the result of a full assessment of a forest stand as shapefile or layer.
+
[[file:jigsawmodel.jpg|center]]
[[Image:Tree_coordinates.png]]
+
  
 +
For each tree it is possible to calculate the number of trees/ha that it represents once it is included in a sample under the stipulated plot design. By intersecting all buffers one can tessellate the area in single polygons that are related to a certain combination of included trees and/or to a certain observation that can be made on this mutually exclusive set. By populating all single tree observations that are included by sample points in such a polygon, it is possible to calculate the resulting estimate of the number of trees/ha that would result at this point.
  
====Creating buffers====
+
[[file:jigsaw.png|700px|center]]
*All trees must be buffered with the radius of a cirlular sample plot (their [[inclusion zone]]). Therefor create a new empty shapefile for polygons (with ArcCatalog) and load it to ArcMap. Start editing the tree-shape and select the new empty polygon shapefile as target for creating new features. Choose Buffer from the edit menu.
+
 
+
[[Image:create buffer.jpg]]
+
 
+
 
+
*After buffering all points your output looks like this:
+
 
+
[[Image:buffers.png]]
+
 
+
===="Union"====
+
*The next step is to identify all overlapping polygons and the resulting number of selected inclusion zones for the particular area. Therefore one has to add a new column to the data table (of the buffer shape) for the number of stems that are related to the specific inclusion zones. As every inclusion zone is exclusive for only one tree, this column contains the value 1 for all buffer polygons (use the field calculator and set the new field =1 for all).
+
 
+
*Now we create a new layer containing all resulting polygons by '''Union''' all buffers. Therefore select '''Analysis Tools - Overlay - Union''' from the ArcToolbox and choose the buffer layer as input. The output will be a new layer (_union) that contains all polygons that are resulting from the intersection of the input buffers. In this case one has to account, that an intersection between two buffer polygons A and B will lead to an output of 4 intersection polygons (the part of A and B that is not intersecting, and two idetical polygons for the intersection area). The task is to reduce this number to only three exclusive polygons. Therefor add three new columns in the datatable of the _union layer that helps to later identify identical polygons. One feature that is exclusive for these polygons are the centroids (x and y) and the area. If you add these columns you can calculate the respective values by calculate goemetry:
+
 
+
[[Image:calculategeometry.jpg]]
+
 
+
===="Dissolve"====
+
 
+
*To reduce the number of polygons choose Data Management Tools - Dissolve from the ArcToolbox and select the _union shape as input. Now select the new fields (centroid_x, centroid_y, area) as dissolve fields (to identify the identical polygons). As we are interested in the number of overlapping inclusion zones per polygon (in case that number of stems is the target), choose the field you added for the stem number as statistics field and calculate the sum:
+
 
+
[[Image:dissolve.jpg]]
+
 
+
*The resulting output (_union_dissolve) should look like this if you choose a color ramp (number of classes=number of possible realizations) for the symbology:
+
 
+
[[Image:jigsaw.png]]
+
 
+
The number of polygons in this view is the number of possible clusters of trees (and in consequence the number of different possible realizations that are derived from these clusters). In the given example a number of 200 trees with the given spatial distribution leads to a number of ~9100 possible clusters. As the target value in this example is the number of stems per ha, polygons that would lead to identical estimates if a sample point falls inside, can be classified. The number of classes is the number of possible realizations (here 1-32).
+
The differnt colors show the inclusion area (if related to the total area this is the selection probability of a certain realization) for the different classes.
+
  
 
===jigsaw puzzles for k-tree sampling===
 
===jigsaw puzzles for k-tree sampling===
In case of k-tree sampling single trees are selected as most nearest neighbours to a sample point. In contrast to fixed area plots the neighbourhood is not defined by a maximum inclusion distance (a certain bandwith) but is restricted to a fixed number of neighbours. Also in this case it is possible to create jigsaw puzzles. Same like in the example above they are a decomposition of the total domain of an infinit number of possible sample point locations in exclusive areas (polygons) for each particular realization that is possible based on the given spatial distribution of trees. Each polygon is composed by the infinit number of points that would lead to the selection of a particular cluster of k trees. The decomposition of the domain is a so called Voronoi diagramm in case of k=1 and a higher order Voronoi diagramm (kth order) in case that k neighbours are selected per sample point.
+
In case of [[Distance based plots|k-tree sampling]] single trees are selected as most nearest neighbors to a sample point. In contrast to fixed area plots the neighborhood is not defined by a maximum inclusion distance (a certain fixed bandwidth) but is restricted to a fixed number of neighbors. Also in this case it is possible to create jigsaw puzzles. Same like in the example above they are a decomposition of the total domain of an infinite number of possible sample point locations in exclusive areas (polygons) for each particular realization that is possible based on the given spatial distribution of trees. Each polygon is composed by the infinite number of points that would lead to the selection of a particular cluster of k trees. The decomposition of the domain is a so called Voronoi diagram in case of k=1 and a higher order Voronoi diagram (kth order) in case that k neighbors are selected per sample point.
The following example shows a Voronoi diagramm for 6-tree sampling.
+
The following example shows a Voronoi diagram for 6-tree sampling.
  
 
[[Image:k-treejigsaw.jpg|center]]
 
[[Image:k-treejigsaw.jpg|center]]
  
The number of polygons is equal to the number of all possible clusters with 6 nearest trees (all points inside one polygon would select the same cluster) and as result also the number of all possible realizations.
+
The number of polygons is equal to the number of all possible clusters with 6 nearest trees (all points inside one polygon would select the same cluster) and as result also the number of all possible realizations of a certain target variable. The area of single polygons is, if related to the total area, an expression of the selection probability of the observation derived from the related cluster.
 +
 
 +
Another point in this context is the determination of unbiased expansion factors for the selected trees. Therefore the inclusion probability of each single tree has to be derived. The inclusion area related to each tree is the number of all sample points, that would lead to a selection of the particular tree as cluster element. The tree is a cluster element in case that it is one of the k nearest neighbors to a sample point.  
  
 
[[Category:HowTo]]
 
[[Category:HowTo]]

Latest revision as of 14:00, 25 October 2015

Contents

[edit] The concept of cluster selection zones for fixed area sample plots

The so called jigsaw puzzle view (Roesch et al. 1993[1]) is a tessellation of the total domain of interest in non-overlapping selection zones for exclusive clusters of trees. In the infinite population approach sample elements are dimensionless points drawn from an infinite sample frame. On the other side we can only imagine a certain discrete number of possible observations derived over a number of nearest neighbors (trees) to a certain sample point. With fixed area sample plots neighbors are included up to a fixed distance (the plot radius). This "inclusion distance" can be assigned to every tree in the domain of interest. Whenever a sample point falls in that distance it would lead to an inclusion of the respective tree in a sample. This single tree inclusion zones are overlapping (as in most cases more than only one tree is selected by a sample point) whereas the resulting pattern is a tessellation of the total area in non-overlapping polygons, that would lead to a selection of a particular cluster of trees. As the number of possible observations (derived from clusters of conjointly included trees) is much smaller than the infinite number of possible sample points, the jigsaw puzzle is a classification of the infinite sample frame in a finite number of areas of possible observations. This number of observations that can be obtained by sampling is the virtual population in which sampling is conducted. The size of the polygons is an expression of the selection probability for a certain observation. This article describes, how jigsaw puzzles can be created from a map with tree locations in ArcGIS.

[edit] Workflow of creating jigsaw puzzles

A tessellation of the area of interest can e.g. be obtained by means of GIS software. The following description refers to a simple model that can be implemented in ArcGIS and is able to derive a tesselation for circular sample plots. Basis for the calculation is the information about the inclusion probability of each spatially dispersed element that is derived based on the relation between a tree's inclusion zone and the total area of interest. Therefore trees are buffered with the radius of sample plots.

Jigsawmodel.jpg

For each tree it is possible to calculate the number of trees/ha that it represents once it is included in a sample under the stipulated plot design. By intersecting all buffers one can tessellate the area in single polygons that are related to a certain combination of included trees and/or to a certain observation that can be made on this mutually exclusive set. By populating all single tree observations that are included by sample points in such a polygon, it is possible to calculate the resulting estimate of the number of trees/ha that would result at this point.

Jigsaw.png

[edit] jigsaw puzzles for k-tree sampling

In case of k-tree sampling single trees are selected as most nearest neighbors to a sample point. In contrast to fixed area plots the neighborhood is not defined by a maximum inclusion distance (a certain fixed bandwidth) but is restricted to a fixed number of neighbors. Also in this case it is possible to create jigsaw puzzles. Same like in the example above they are a decomposition of the total domain of an infinite number of possible sample point locations in exclusive areas (polygons) for each particular realization that is possible based on the given spatial distribution of trees. Each polygon is composed by the infinite number of points that would lead to the selection of a particular cluster of k trees. The decomposition of the domain is a so called Voronoi diagram in case of k=1 and a higher order Voronoi diagram (kth order) in case that k neighbors are selected per sample point. The following example shows a Voronoi diagram for 6-tree sampling.

K-treejigsaw.jpg

The number of polygons is equal to the number of all possible clusters with 6 nearest trees (all points inside one polygon would select the same cluster) and as result also the number of all possible realizations of a certain target variable. The area of single polygons is, if related to the total area, an expression of the selection probability of the observation derived from the related cluster.

Another point in this context is the determination of unbiased expansion factors for the selected trees. Therefore the inclusion probability of each single tree has to be derived. The inclusion area related to each tree is the number of all sample points, that would lead to a selection of the particular tree as cluster element. The tree is a cluster element in case that it is one of the k nearest neighbors to a sample point.

[edit] References

  1. Roesch, F.A.; Green, E.J.; Scott, C.T. 1993. An Alternative View of Forest Sampling. Survey Methodology 19 (2), 199-204.
Personal tools
Namespaces

Variants
Actions
Navigation
Development
Toolbox
Print/export