[DDI-SRG] iIn-line data / defined range of values / data display format

Joachim Wackerow joachim.wackerow at gesis.org
Thu Aug 16 10:15:17 EDT 2007


Some additional thoughts on defined ranges/virtual recode below.

Wendy Thomas wrote:
> On Wed, 15 Aug 2007, Joachim Wackerow wrote:
> 
>> In the discussion with Larry Hoyle about a SAS converter I noticed
>> several things. Now I have three questions:
>>
>>
...

>> Defined range of values, CodeScheme / CategoryScheme
>>
>> Variables with interval or ratio measurement can have ranges of data
>> with different code values but same category labels.
>>
>> Example:
>> BMI
>>     low-<18.5 =  "Underweight"
>>     18.5-24.9 =  "Normal weight"
>>     25-29.9 =  "Overweight"
>>     30-high =  "Obesity"
>>
>> For such a variable it would make sense to define ranges of values
>> associated with the same category. A derived variable with a related
>> recode would be not necessary.
>>
>> For my understanding we have no possibility to represent this approach.
>>
>> A solution would be in Code of CodeScheme to have Range as a choice for
>> Value.
> 
> I am uncomfortable using "ranges" as a code scheme. This is, in effect, 
> an unrealized recode, simply providing the definitions of these terms in 
> relation to the ranges. In analysing the data it is designed to use, in 
> this case, the actual BMI. If I were using it as a category "obese" etc 
> I would recode this information either on the fly or by creating a new 
> variable (or overwriting the BMI variable). I would make a separate 
> scheme of virtual recodes and create a variable without a physical 
> representation.

As I said in the other email I agree on the unrealized recode. 
Additionally I have following thoughts:

A virtual recode realized by a defined range in the code scheme seems to 
be a comfortable approach for variables with grouped numbers or codes. 
It is not necessary to define codes for the groups nor to define a 
derivation command in the variable container. The information defined by 
Range would be enough for an application. What could be the disadvantage 
of allowing Range in CodeScheme/Code as a choice to Value?

A indicator for virtual variables would be in general helpful. So it is 
clear that this variable does not exist in the data set, only as a 
virtual derivation. When the virtual state is explicitly indicated, the 
variable can't be misunderstood as a variable just without a reference 
to a physical representation.

Achim


More information about the DDI-SRG mailing list