Transformations create new objects and data sets from existing objects and data sets
buffering takes points, lines, or areas and creates areasTwo versionsevery location within the resulting area is either:in/on the original objectwithin the defined buffer width of the original object
discrete object:Applicationsfor every object, result is a new polygon objectfield (objects cannot overlap):new objects may overlapevery location on the map has one of two values:inside buffer distance
outside buffer distance
find all households within 1 mile of a proposed new freewayVariantsand send them notification of proposalfind all areas of Los Padres National Forest beyond 1 mile from a roadfind all liquor stores within 1 mile of a school
and notify them of a proposed change in the lawfind households within a fixed service radiusCSISS cookbook and UCI's medical center
raster and vector versionsvary the object's buffer width according to an attribute value
e.g. noise buffers depending on road traffic volumevary the rate of spread according to a friction fieldonly in rasterThiessen polygons for point objects
e.g. travel speed variesthe area closest to each point forms a polygon
Determine whether a given point lies inside or outside a given polygon
assign a set of points to a set of polygonsAlgorithme.g. count numbers of accidents in counties
e.g. whose property does this phone pole lie in?
draw a line from the point to infinityField casecount intersections with the polygon boundary
inside if the count is odd
outside if the count is even
point must lie in exactly one polygonDiscrete object case
point can lie in any number of polygons, including zeroIssues
algorithm for a coveragewhat if the point lies on the boundary?
special cases
Create polygons by overlaying existing polygons
how many polygons are created when two polygons are overlaid?Discrete object caseexample
find overlaps between two polygonsField casee.g. a property and an easementcreates a collection of polygons
overlay two complete coveragesApplicationcreates a new coverage
e.g. find all areas that are owned by the Forest Service and classified as wetlandin vector or rasterin raster the values in each cell are combined, e.g. added
areal interpolationIssuessource zones with known data
target zones with unknown data
estimates based on areas of overlap
spatially extensive or spatially intensive
major computing workloadindexingswamped by sliverstolerance
What is interpolation?
intelligent guessworkTwo methods commonly used in GISan interval/ratio variable conceived as a field
temperaturesampled at observation points
soil pH
population densityneeded:
values at other points
a complete surfacea contour map
a TIN
a raster of point values
inverse-distance weighting (IDW)Moving average/distance weighted average/inverse distance weightingKriging (geostatistics)
estimates are averages of the values at n known pointsExampleknown values z1,z2,...,znis the most widely used methodunknown value z = Sum over i (wizi) / Sum over i (wi)
where w is some function of distance, such as:
w = 1/dkan almost infinite variety of algorithms may be used, variations include:w = e-kd
the nature of the distance function
varying the number of points used
the direction from which they are selectedobjections to this method arise from the fact that the range of interpolated values is limited by the range of the data
no interpolated value will be outside the observed range of z valuesother problems include:peaks and pits will be missed if they are not sampled
outside the area sampled the surface must flatten to the average value
how many points should be included in the averaging?summary: IDW is popular, easy, but full of problemswhat to do about irregularly spaced points?
how to deal with edge effects?
ozone concentrations at CA measurement stationsKrigingobjectives:
1. estimate a complete field, make a mapdata sets:
2. estimate ozone concentrations at other locationse.g. citiesmeasuring stations and concentrations (point shapefile)IDW wizard in Geostatistical Analyst
CA outline (polygon shapefile)
DEM (raster)
CA cities (point shapefile)opening screen defines data sourcethings to noticenext screen defines interpolation method
which power of distance? (2)next screen gives results of cross-validation
how many sectors? (4)
how many neighbors in each sector? (10-15)amount of detail where there is no datagenerally smooth surface
highs in LA, S central valley
developed by Georges Matheron, as the "theory of regionalized variables", and D.G. Krige as an optimal method of interpolation for use in the mining industryVariogramsthe basis of this technique is the rate at which the variance between points changes over space
this is expressed in the variogram which shows how the average difference between values at points changes with distance between pointsKriging is based on an analysis of the data, then an application of the results of this analysis to interpolation
vertical axis is E(zi - zj)2, i.e. "expectation" of the differenceDeriving the variogrami.e. the average difference in elevation of any two points distance d apartmost variograms show behavior like the diagramd (horizontal axis) is distance between i and j
the upper limit (asymptote) is called the sillin developing the variogram it is necessary to make some assumptions about the nature of the observed variation on the surface:the distance at which this limit is reached is called the range
the intersection with the y axis is called the nugget
a non-zero nugget indicates that repeated measurements at the same point yield different valuessimple Kriging assumes that the surface has a constant mean, no underlying trend and that all variation is statisticaluniversal Kriging assumes that there is a deterministic trend in the surface that underlies the statistical variation
in either case, once trends have been accounted for (or assumed not to exist), all other variation is assumed to be a function of distance
the input data for Kriging is usually an irregularly spaced sample of pointsComputing the estimatesto compute a variogram we need to determine how variance increases with distance
begin by dividing the range of distance into a set of discrete intervals, e.g. 10 intervals between distance 0 and the maximum distance in the study area
for every pair of points, compute distance and the squared difference in z valuesassign each pair to one of the distance ranges, and accumulate total variance in each range
after every pair has been used (or a sample of pairs in a large dataset) compute the average variance in each distance range
plot this value at the midpoint distance of each range
fit one of a standard set of curve shapes to the points
"model" the variogram
once the variogram has been developed, it is used to estimate distance weights for interpolationinterpolated values are the sum of the weighted values of some number of known points where weights depend on the distance between the interpolated and known points
weights are selected so that the estimates are:
unbiased (if used repeatedly, Kriging would give the correct result on average)problems with this method:minimum variance (variation between repeated estimates is minimum)
when the number of data points is large this technique is computationally very intensivesimple Kriging routines are available in the Surface II package (Kansas Geological Survey) and Surfer (Golden Software), in the GEOEAS package for the PC developed by the US Environmental Protection Agency, and in ArcInfo 8 as an add-on Geostatistical Analystthe estimation of the variogram is not simple, no one technique is best
since there are several crucial assumptions that must be made about the statistical nature of the variation, results from this technique can never be absolute
example
selection of methodsimple Krigingthings to noticeordinary Kriging allows for a trendanalysis of the variogram
co-Kriging includes a correlated variable
indicator Kriging is for binary datafitting a modelhow many neighbors?
directional effectssimilar patternless detail in remote areasrebounds to the mean at the edge
smoother
Suppose you had a map of discrete objects and wanted to calculate their density
density of populationMethodsdensity of cases of a disease
density of roads in an area
density would form a field
density estimation is one way of creating a field from a set of discrete objects
count the number of points in every cell of a rasterDensity estimation using kernelsmeasure the length of lines, e.g. roadsresult depends on cell sizeresult is very noisy, erratic
think of each point being replaced by a pile of sand of constant shapeDensity estimation and spatial interpolation applied to the same dataadd the piles to create a surface
example kernel
width of the kernel determines the smoothness of the surface
density of ozone measuring stationsusing Spatial Analyst
kernel is too small (radius of 16 km)what's the difference?