[DDI-SRG] Updated 2.1 to 3.0 mapping UPDATE
Wendy Thomas
wlt at pop.umn.edu
Tue Jan 13 15:26:19 EST 2009
Sanda,
This is the intended mapping based on the definitions in DDI 2.1 and 3.0.
There are some things in DDI 3.0 that are simply not supported by the
content of 2.1. Your example of Concept and Universe are a case in point.
In terms of mechanical translation either the software has to remove the
duplication during the process automatically or with human intervention OR
cleanup and modification have to take place after the intial conversion.
If you look at any of the Concept entries you will note that the content
goes to an entry in the ConceptScheme and the variable or question
references that entry. If you look at the output of the SPSS and SAS
coversions to DDI 3.0 you will note that they do not take into
consideration replication of categories. I've got two options. Use it as
is and live with a DDI 3.0 complient but not optimized file OR run a post
translation cleanup. Well a third option is not to use it all.
Clearly translation tools will need to deal with these problems. The point
of the spreadsheet is to indicate into which object in DDI 3.0 the
information would go. The biggest problem I was finding was "felxibility"
of 2.1 in terms of how information was entered in any given field and what
this means for the translation system.
What I need to know from reviewers is errors in Xpaths, misdirection of
information to the wrong location and instances of locations where you
have consistantly interpreted the content of a 2.1 field in a different
way.
Not sure if this clarifies things for you but keep asking the questions
and I'll try to answer.
Wendy
On Tue, 13 Jan 2009, Sanda Ionescu wrote:
> Hi, Wendy.
>
> As I look at your 2->3 mapping, I have a "philosophical" question:
> Is this an "ideal world" mapping, meaning - this is how we would map the
> 2.1 fields if we could (or if we did the conversion manually, or
> semi-manually?); or, is it supposed to have a practical application
> (i.e. specs for a conversion tool)?
>
> And this is the reason I'm asking: as I'm sure you know, I attempted a
> mapping myself this past summer, with the practical intention of
> building a conversion stylesheet. So I had to figure out what the
> stylesheet would do. In some instances I found out that even though a
> theoretical mapping was possible, it was not feasible in practice.
> Examples would be "concept" and "universe" at variable level - if we
> created an entry for each concept and universe statement found under
> variables, we would end up with identical entries repeated at nauseam in
> the schemes, with the same content but different IDs! We would also be
> unable to use the in-built hierarchy in the universe scheme, since a
> machine would not be able to establish whether a certain universe
> (expressed in 2.1 as a string) is a child of another, or else. The same
> thing applies to some of the responsibility statements in DDI 2.1 - we
> may have the same organization as author, producer, distributor, and if
> we created organization entries for each time it appears, we would end
> up with a lot of redundant entries in the Organizations Scheme.
>
> So, to me it is important at this point to establish whether we plan to
> release an "ideal" mapping, or a "practical" mapping, because this
> knowledge would inform my evaluation of the current mapping.
>
> Thank you so much.
> Sanda.
>
> Sanda Ionescu
> ICPSR
> University of Michigan
> P.O. Box 1248
> Ann Arbor, MI 48106
>
> Phone, Fax: 734-615-7890
>
> -----Original Message-----
> From: Wendy Thomas [mailto:wlt at pop.umn.edu]
> Sent: Monday, January 12, 2009 5:59 PM
> To: TIC list
> Cc: Sanda Ionescu; tassoukis at iza.org; askitas at iza.org; Mary Vardigan
> Subject: Re: Updated 2.1 to 3.0 mapping UPDATE
>
> In working with the mapping I have already found and corrected the
> following errors:
>
> VersionResponsibiliyt corrected spelling to Responsibility
> subTitl should be ../r:SubTitle not r:Title
> altTitl should be ../r:AlternateTitle not r:Title
> pPhysicalDataStructure should be p:PhysicalDataStructure
> r:FundingInformation/r:Role should be r:FundingInformation at role
> r:GrantNumber/r:Role should be r:GrantNumber at role
> COMPLETED CONTENT FOR 1.2 and 1.3 as follows
> 1.2 guide? PCDATA
> s:StudyUnit/l:LogicalProduct/l:DataRelationship/r:Description
> r:Description r:StructuredStringType Initial home will need to be
> parsed
> 1.3 docStatus? PCDATA s:StudyUnit/r:VersionRationale
> r:VersionRational r:InternationalStringType
>
>
>
> Wendy L. Thomas Phone: +1 612.624.4389
> Data Access Core Director Fax: +1 612.626.8375
> Minnesota Population Center Email: wlt at pop.umn.edu
> University of Minnesota
> 50 Willey Hall
> 225 19th Avenue South
> Minneapolis, MN 55455
>
Wendy L. Thomas Phone: +1 612.624.4389
Data Access Core Director Fax: +1 612.626.8375
Minnesota Population Center Email: wlt at pop.umn.edu
University of Minnesota
50 Willey Hall
225 19th Avenue South
Minneapolis, MN 55455
More information about the DDI-SRG
mailing list