What is geographic information?
information that is geographically referencedWhat is information?every set of geographic information must contain some form of coordinate referenceinformation that links a point on the Earth's surface to some general property, and perhaps a time intervalsome agreed method for identifying location on the Earth's surface
agreed in the sense that the method is known to all users of the informationthe atom of geographic information <x, t, z><120W, 34N; 10am 4/8/01; air temperature 10C>if you had enough such atoms you could build up a complete picture of the Earthcompression methodsan infinity of atoms would be needed
spatial aggregation"Mt Everest is 8848m high"general property applies to a regionsamplingregional geography
area class maps
properties between samples can be interpolated
is it physical stuff or something else?Shannon-Weaver information theoryhow do I know when I have it?
can I measure how much I have?
the information content of a messageShannon-Weaver is a syntactic theoryinformation content associated with a code is determined by the number of options
26 options for a single letter in the Roman alphabetone letter resolves uncertainty by specifying one of 26 optionssome letters occur more often than othersvalue in resolving uncertainty varies in inverse proportion to rate of occurrenceif there are n symbols in a code, and the proportion of each symbol i is denoted by pi, then a measure of the information content of the code is its information statistic:e is less valuable than z
- (sum over i) pi log2 pithis is maximum (the code is most efficient) when all symbols are used equallyin this case the information statistic is log2 nif there are 60,000 words in the English language, the most efficient code would have 16 bits (2 bytes)this is the number of binary digits needed to represent the code
English could be written in a language of pairs of 8-bit ASCII codesinstead of combinations of Roman letters ranging from 1 letter to ~16
concerned with the syntax of communicationIssues not resolved by Shannon-Weaversender and receiver both understand the codeno concern for semanticswhat does the code mean?
what if the receiver does not understand the code?Systemwhat if the receiver already knew the content?
no measure of the impact of the message on the receivertheory is dependent on mediaa message in codewhat about paper maps?
a container of informationA system contains a given item of information if it is capable of resolving a query to which that information is the answerhuman being, computer, map
I know the temperature in SB if I can tell you the temperature in SBInformation links already-familiar conceptsa map or GIS "knows" the temperature in SB if it can tell you....definition is medium-independent
a map requires a trained usersuppose a system was capable of resolving all geographic queries
Digital Earthare there exceptions to this idea?it would contain all of the information necessary to create a complete geographic Earth
is any query capable of determining whether such a system is Digital Earth or the real Earth?
compare the Turing testcan a query resolve whether a computer is a machine or a human being?
facts about the Earth that cannot be derived from any representation of the Earth?naive geography
Mt Everest is 8848m highExample:meaningless to someone who does not understand the concept Mt Everest or the concept 8848m higha concept is understood or already familiar if it is already linked to other conceptsgeographic information is meaningless to someone who does not understand the Earth's space-time frame
Mt Everest to images, facts, coordinatesinformation's semantic content is related to the number of links that already exist in the receiving system for the concepts at both ends of the link8848m to systems of measurement, definition of m, definition of high
and is zero if the link it contains already exists in the receiving systemthe semantic value of an atom of information is determined by the receiving systemnot by the sender, the channel, or the message
20 coordinate pairs representing the outline of CaliforniaShannon-Weaver perspectiveeach resolved to 1m (Mercator projection)
each x coordinate specifies one of 40,000,000 optionsHow many distinct and new geographic queries does this information allow the receiving system to resolve?
each y coordinate specifies one of 20,000,000 options51 bits in total
8 bytes160 bytes for the whole message
how many queries of the form "is x,y in California"?1. Infinityhow many queries of the form "is z in California" given knowledge of the location of z
but coordinates are only to 1m accuracy2. The number of sq m in Californiaanswers should be also
411,000,000,000Conclusions:the system appears to be extremely knowledgeable
because it possesses a simple rule that allows a very short message to resolve an enormous number of apparently independent queriesthe rule:the rule (or theory, generalization) compresses information at a very high ratecompare the rule for generating the infinite number of digits of pi
allows a small amount of information to be expandedthe sequence of ordered coordinate pairs defines a polygon
all points within this polygon have the property specified in the message
geographic information consists of atoms that link propertiesinformation is possessed when queries can be resolved
information is not physical stuffShannon-Weaver information theory deals with the efficiency of codesthe semantic value of information is determined by the receiving system alone
by the number of new queries that can be answered using the informationgeographic information has properties that allow answers to queries to be imputed, predicted, and otherwise obtained from rulesby the number of links already existing for the pairs of concepts that the information links
Tobler's lawuniform regions