LECTURE 2: REPRESENTATION
1. WHY ARE REPRESENTATIONS NEEDED?
How do we know what we know?
The human senses
sightEverything else we know about the world we know through communicationvisible spectrumsound
line of sighthorizon at ~10km
visibility 100kmaudible spectrumtaste50Hz to 15KHzrange 100m
touch
smell
textKnowledge of the surface of the Earth
speech
maps
photographs
radio, TV
Internet
databases
500,000,000 sq kmWe rely on communicated information to:on average 100 sq m is sensed directly at any time5 billion years
p=100/500,000,000,000,000=.0000000000002extend that through migration, travel
we live through 70we know almost nothing about the surface of the Earth via our senses
p=70/5,000,000,000=.000000014
decide where to go as tourists, shoppersAll such information must use a representation
run large corporations
manage agriculture, forestry
choose where to live
what is communicated is a representation of the real thing
Locations in time and space reduced to a few straight lines
what would more detail show?Representations occur:
representations always simplify the real worldthe real world is infinitely complex
representations reduce information to manageable volume
in the human mind, in memory and reasoningWe must have them:
in speech
in written text
in photographs
in digital databases
in GIS
to communicate
to go beyond the space-time limits of our senses
to deal with an infinitely complex world
Much human communication is now digital
sent through a "pipe" that can transmit only 0s and 1s
stored on devices that can store only 0s and 1s
processed as 0s and 1s
text in email, word processorsWhen two humans communicate at a distance, chances are the content is expressed at some point in digital form
voice in telephone
music on CD
DVD, digital TV
FAX
Digital
from "digit" meaning fingerDigital vs Analog
a character in a counting system
how many symbols?0 thru 9to all intents and purposes "digital"="binary"
0 and 1
the binary system
how to express knowledge exclusively in 0s and 1s?how to describe what we know about the world in 0s and 1s?
the fundamental question of data modeling for GIS
analogWhy binary?information expressed by scaling quantitiesdigital
good for quantitative information
a paper map is analogthe world is scaled to a miniature representation
representative fraction is key, e.g. 1:24,000information expressed by symbols
there must be a coding scheme to express the representation in symbols
sender and receiver must agree on the schemewhat does scale mean?
same for all kinds of informationSome digital coding schemessame technology whatever the content
"a bag of bits"
massive economies of scale
miniaturization
textDigital coding schemes important in GISASCII codeimages
one code per characterA = 65, B = 66, etc.26 letters plus common symbols
originally 128, extended to 256
8 binary digits (one byte) per characterJPEG, TIFF, GIF, BMP, ...musicMIDI, MP3FAXCCITTmaps, geographic informationGIS data models and structures
ASCIIWhat if you received this message:eight bits per characterinteger
names, text annotation3 bits per decimal digitfloat (single precision)
n bits give 2n optionsit takes about 3 times as many binary digits to express a number16 or 32 bits per whole number (short, long integer)-32767 to +32767
-2 billion to +2 billion1 sign bitdouble precision
7 exponent bits (-63 to +63)
24 mantissa bits (8 significant digits)1 sign bitwhat might you need to know to 18 significant digits?
7 exponent bits
56 mantissa bits (18 significant digits)BLOB
binary large object
e.g. image
01000001010101000101010001000001
01000011010010110101000001000101
01000001010100100100110001001000
01000001010100100100001001001111
01010010
What if you had to send a message to outer space?
the Voyager messageCommunication of information via a channel
How efficient is the channel of communication?
is there information that can't be expressedThe Tower of Babel allegorytext omits gesture, pronunciation, voice inflectionwhat are the limits of a GIS as a communication channel?can "duh" be written in text?what information about a place can't be expressed in GIS?what if the sender and receiver can't understand each other?different language
different alphabet
different GIS
interoperability
Geographic information
information about some place on the surface of the Earthor near the surfaceone of the earliest forms of shared information
at some point in timehunters and gatherers reporting back to the bandstoring on paper
"there's good hunting near the old tree"
early stick maps for navigation in the Pacific
drawings on cave wallsthe printing press in the 15th Century
information accessible to all
shared knowledge as a human community asset
Prince Henry the Navigator, 1394-1460
The Internet
massive new capability for sharing, communicating geographic informationin digital form
The atom of geographic information
<location, time, attribute>it's cold today in Ottawageneral methods for describing location
at 45 North, 75 East at 12 noon EST the temperature was -10 Celsiuseveryone around the world understands latitude and longitude
similarly for timeattributes must also be generally understood
"cold" is subjective and relative
-10 Celsius is generally understood
did Hugh Grant climb a hill or a mountain?
Suppose we could capture it all
complete representation of the planetAl Gore's dream of a Digital Earth
past, present, and future
a "mirror world"
“Imagine, for example, a young child going to a Digital
Earth exhibit at a local museum. After donning a head-mounted display,
she sees Earth as it appears from space. Using a data glove, she zooms
in, using higher and higher levels of resolution, to see continents, then
regions, countries, cities, and finally individual houses, trees, and other
natural and man-made objects. Having found an area of the planet she is
interested in exploring, she takes the equivalent of a ‘magic carpet ride’
through a 3-D visualization of the terrain. Of course, terrain is only
one of the numerous kinds of data with which she can interact. Using the
system’s voice recognition capabilities, she is able to request information
on land cover, distribution of plant and animal species, real-time weather,
roads, political boundaries, and population. She can also visualize the
environmental information that she and other students all over the world
have collected as part of the GLOBE project. This information can be seamlessly
fused with the digital map or terrain data. She can get more information
on many of the objects she sees by using her data glove to click on a hyperlink.
To prepare for her family’s vacation to Yellowstone National Park, for
example, she plans the perfect hike to the geysers, bison, and bighorn
sheep that she has just read about. In fact, she can follow the trail visually
from start to finish before she ever leaves the museum in her hometown.
She is not limited to moving through space, but can also
travel through time. After taking a virtual field-trip to Paris to visit
the Louvre, she moves backward in time to learn about French history, perusing
digitized maps overlaid on the surface of the Digital Earth, newsreel footage,
oral history, newspapers and other primary sources. She sends some of this
information to her personal e-mail address to study later. The time-line,
which stretches off in the distance, can be set for days, years, centuries,
or even geological epochs, for those occasions when she wants to learn
more about dinosaurs.” (U.S. Vice President Al Gore, in a speech written
for presentation at the California Science Museum, Los Angeles, January
1998)
How many atoms are there?
an infinite numberReduce the level of detail, aggregate, generalize, approximate
to make a two-word description of every sq km on the planet would require 10 Gigabytes
to store one number for every sq m on the planet would require 1 Petabytethat's too many for any system
how to limit?
ignore the waterThe problemthat's 2/3 of the planetone temperature for all of Ottawaone number for an entire polygonsample the spaceonly measure at weather stationsall geographic data miss detail
because temperature varies slowlyall are uncertain to some degree
there are many ways of doing this
a GIS user must make choices
GIS designers must allow for many options
geographic description is complex
The most important of the options
how we think about the worldDiscrete objects
points, lines, areas (or volumes) having known propertiesFieldslittering an otherwise empty space
can be manipulated/edited
can be found in the real world
may overlap
can be counted
how many people living in Santa Barbara?represent as shapefiles
how many vehicles in California?
how many trees growing on the UCSB campus?
how many lakes are there in Minnesota?
how many mountains in Scotland over 3000 ft?
how many clouds in the sky?
how many cities over 1 million population?
how many atmospheric lows in the northern hemisphere today?
things it's worth measuring at every location on the planettemperature
soil pH
soil type
land cover type
elevation
rainfall
ownership
each of these variables has one value everywherevariable is a function of locationfield = a way of conceiving of geography as a set of variables each having one value at every location on the planetz = f(x,y,z,t)represent one variable as:polygons
raster
TIN
sample points
contours
number of such objects is property of the representation, not the phenomenonLakes in Minnesotacannot be manipulated
cannot overlap
how many are there?Weather forecasting
fronts, highs, lows, or pressure surfaces?Objects are intuitive, part of everyday life
fields are more associated with scienceBoth objects and fields can be represented either in raster or in vector form
Ontology: the study of the basic elements of description
"what we tell about"
discrete objects and fields are two different ontologies