[DDI-ADG] initial use case ideas: user searching
Katherine McNeill-Harman
mcneillh at MIT.EDU
Thu Jun 9 17:23:17 EDT 2005
Following is my initial brainstorming on use cases for geography. I'm
starting from the perspective of ways in which users would want to search
by geography, giving examples of types of data, rather than specific files
(maybe we also want latter). So this is coming from the perspective of how
we want systems to be designed to be useful to users (as well as to be able
to accommodate various file types). Not sure if this is what the group had
in mind, but hopefully it's a helpful start.
This is based on my experience in helping users find and work with data
files, and many are actual questions I've received; this is not based on
experience actually implementing 2.0, so I'm sure others who have will have
other helpful ideas. I've started to try to group them into categories,
but even describing those categories proved to be a challenge.
Ways in which users want to use geography to search for data:
1. Searching for variable information within a specific geographic coverage
area at a specific level of geographic unit (yet still aggregated), e.g.
- data on voter registration by precinct within the United States
- unemployment rate by county within the United States
- population and housing data for Eastern European cities (in this case the
coverage and unit of analysis are more closely related than in the above
examples)
2. Similar to "a" above, but for microdata, in which case the information
is not aggregated to a certain level, but where the user wants a
micro-level identifier at a certain geographic level, e.g.
- want public opinion poll on a certain topic and to be able to
differentiate among user responses by state (i.e. where state of residence
was a question asked of the respondent)
3. Situations in which a time series has a quasi-geographic aspect, but
geography isn't exactly a variable, e.g.
- historical prices for municipal bonds within the United States (i.e. the
geographic level of a municipality is a quality of the bond (i.e., as
opposed to federal bonds), so might users think to search by geographic level?)
4. User looking to compare information about similar-level (yet
differently-defined) geographic areas (either aggregate or microdata), e.g.
- comparing literacy rates at the sub-national level among several
countries (i.e. some countries call those areas "states," others
"provinces;" how will they readily identify both?)
(Relating to some of this week's discussion about labels on variables)
5. Issues of geographic correspondence (this may seem to be more of a data
management issue, so I'm not sure exactly how it applies to the DDI, but
there are some data files designed to provide correspondence)
- users needing to match up two data files by a common geographic unit of
analysis (e.g. a data file on education variables to be merged with one on
economic variables, both with a common county code).
- user needing to expand their geographic understanding of a data file,
either for its own sake or to match two data files with different
geographic units, e.g. one user requesting a particular data product that
matches zip codes to telephone area codes (something one might not think of
as a geographic ID, but they can function as such in the U.S.)
Hope this is a helpful start.
Kate
___________________________________________
Katherine McNeill-Harman
Data Services Librarian
Dewey Library for Management and Social Sciences
Massachusetts Institute of Technology
77 Massachusetts Avenue, E53-100
Cambridge, MA 02139
mcneillh at mit.edu
617-253-0787
More information about the DDI-ADG
mailing list