[DDI-SRG] [Fwd: Coded missing values]

Joachim Wackerow joachim.wackerow at gesis.org
Thu Aug 30 02:25:55 EDT 2007


Pascal and others,

It is possible to define a category without label, just stating the fact 
that the category is missing.

A missing value seems to be an attribute of a category not of a code. 
The term 'value' of missing value is here a bit misleading, it is not 
the 'Value' of 'Code'. A code is just a code. A category gives the code 
a meaning.

1.
A range of missing values seems to be convenient, but for my knowledge 
this kind of definition is only possible in SPSS. Nevertheless it is 
worth to think about.


Here are additional thoughts on missing values.

2.
I was wondering if it makes sense to define a missing value for a whole 
set of variables of for a whole study, which is common practice. I'm not 
sure if 'PhysicalDataProduct/MissingData' covers this sufficient.

3.
Common practice is also to define a blank as missing value for numeric 
variables. I.E. SPSS would assign a system missing value. Currently it 
is possible to define blank as a string code in Code/Value like:
'<Value> </Value>'. This seems to be error prone. A better approach 
would be to have an explicit definition for special codes like a blank. 
For example:
<Value blank="true"/>

4.
System missing values are not describable yet. But this seems to be 
important if a study is only available in SPSS or SAS. These values are 
represented as numeric values beyond the allowed real numbers internally 
in the statistical packages. Currently it would be only possible to 
recode the system missing value code to a valid numeric code described 
as missing category. This doesn't seem to be a good approach.

In SPSS one system missing value code is available, in SAS several are 
available. Again an explicit definition of this system missing value 
codes would be desirable. The name of the system missing value is 
important and in addition the related scheme like SAS. For example:
<Value missingValue="N" missingScheme="SAS"</>

I have the impression that we should discuss the subject "missing 
values" on a conference call (topic 1-4, perhaps other ideas). The 
existing possibility to define a missing value seems to be not sufficient.

Achim

Pascal Heus wrote:
> Further to the message below. in SPSS, I can also define a range of 
> coded missing values (like 10-20), is this something we'll support in 3.0?
> thanks
> *P
> 
> -------- Original Message --------
> Subject: 	Coded missing values
> Date: 	Wed, 29 Aug 2007 11:47:23 -0400
> From: 	Pascal Heus <pascal.heus at gmail.com>
> To: 	DDI Structural Reform Working Group. <ddi-srg at icpsr.umich.edu>
> 
> 
> 
> All:
> I have a couple of questions on coded missing values (like -1=missing, 
> not system missing) :
> - In DDI 2.0, missing values are flagged in the catgry element of the 
> variable. However, in SPSS, I can define missing value codes without 
> defining categories. In such case, is there a way to capture this in DDI 
> 2.0?
> - How do we capture missing value codes in DDI 3.0? According to Wendy's 
> mapping, it goes into the l:category element. As in the SPSS example 
> above, I could however define a discrete code value that is not 
> associated with a l:category but represent a missing value. Should I 
> file this issue in Mantis?
> thanks
> *P
> 
> -- 
> Imagination is more important than knowledge.
> - Albert Einstein
> www.quotator.net
> 
> 
> 
> -- 
> Do not dwell in the past, do not dream of the future, concentrate the mind on the present moment.
> -- Buddha
> www.quotator.net
> 
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> DDI-SRG mailing list
> DDI-SRG at icpsr.umich.edu
> http://www.icpsr.umich.edu/mailman/listinfo/ddi-srg


-- 
GESIS - German Social Science Infrastructure Services
http://www.gesis.org/en/


More information about the DDI-SRG mailing list