[DDI-SRG] tomorrows agenda

Joachim Wackerow joachim.wackerow at gesis.org
Thu May 14 05:03:49 EDT 2009


some topics for 1
- HTML documentation: how should it be done for 3.1? With docflex? If 
yes, the DDI Alliance should get a license of the software.
- genericode: clarification needed for fixed field names in genericode 
file like "Code" and "CodeDescription" (documentation issue). Also I 
will provide a better genericode example for the documentation.

other topics
- review the allowed characters in a DDI URN
Background 1
DNS-based URN resolution can affect the allowed character set for DDI 
URN's. The dot character "." has the meaning of an hierarchical 
separator in DNS. It is in DNS the only character with a special 
meaning. It is used in DDI URN as separator between 
maintainable-object/id-of-maintained-object and 
contained-object/id-of-contained-object. Allowed alternatives to "." 
would be:
"+"  "," "-" "=" "@" ";" "$" "_" "!" "*" "'"
These characters are allowed in both the URN syntax and the query string 
of an URL. I think "+" would be a good alternate candidate.

Background 2
When we really want to be open for DNS-based URN resolution and want to 
support smooth transition between both worlds it can make sense to use 
the agency id (DDI domain name) as a hierarchical top level field in 
DNS. In the hierarchical DNS sense the following DNS name would make 
sense: OtherPartsOfDDIURN.3_1.gesis.de.ddi.urn.arpa.
It reads:
arpa is top level domain
urn is sub domain of arpa
ddi is sub domain of urn
...
This structure would support the hierarchical resolution system of DNS.

Background 3
BaseIDType in reusable
BaseIDType is a xs:string restricted by the regular expression
([A-Z]|[a-z]|\*|@|[0-9]|_|$|\-)*
The field documentation says: Valid characters include A-Z, a-z, @, 0-9,
_, -, $

Anyway, I'm wondering why are special characters ("*", "$", "@") 
necessary in an id. These characters have sometimes a special meaning in 
another environment (like URN resolution) and therefore can cause 
problems. Then they must be rewritten in another form and escaped. Are 
they really required?

I would suggest as allowed characters just A-Z, a-z, 0-9,
_, - with a starting letter in the ID.
as regexp: ([A-Z]|[a-z])+([A-Z]|[a-z]|[0-9]|_|\-)*

Achim


Wendy Thomas wrote:
> Two major items on the agenda:
> 
> 1: List of what corrections are being made for 3.1 final publication (if 
> approved), Other steps in that production. We will need to provide a 
> REALISTIC timeline for its publication during the meeting at IASSIST
> 
> 2: DDI 2.1 maintenance
> 
> Also, please let me know who will be at IASSIST and who will be at the 
> Expert Meeting on Monday May 25.
> 
> Wendy
> 
> Wendy L. Thomas                          Phone: +1 612.624.4389
> Data Access Core Director		 Fax:   +1 612.626.8375
> Minnesota Population Center              Email: wlt at pop.umn.edu
> University of Minnesota
> 50 Willey Hall
> 225 19th Avenue South
> Minneapolis, MN 55455
> _______________________________________________
> DDI-SRG mailing list
> DDI-SRG at icpsr.umich.edu
> http://www.icpsr.umich.edu/mailman/listinfo/ddi-srg


-- 
GESIS - Leibniz Institute for the Social Sciences
Postal address: P.O. Box 122155, 68072 Mannheim, Germany
Visiting address: B2 1, 68159 Mannheim, Germany
Phone: +49 (0)621 1246 262
Fax: +49 (0)621 1246 100
E-mail: joachim.wackerow at gesis.org
www.gesis.org/en/institute/


More information about the DDI-SRG mailing list