Numerical Meanings of Probabilistic Expressions (ICPSR 6046)
Principal Investigator(s): Mosteller, Frederick; Youtz, Cleo
Summary: These data were collected to obtain a clearer understanding of the quantitative meanings that people perceive in common words used to describe probabilistic outcomes. For example, in everyday language, people apply the expressions "always" and "certain" to events that occur in fewer than 100 percent of their opportunities. In this study, science writers were surveyed and asked to quantify, in a percentage term, their understanding of each of 52 expressions. They w... (more info)
Access Notes
These data are available only to users at ICPSR member institutions. Because you are not logged in, we cannot verify that you will be able to download the data.
Dataset(s)
WARNING: Because this study has many datasets, the download all files option has been suppressed, and you will need to download one dataset at a time.
Study Description
Citation
Mosteller, Frederick, and Cleo Youtz. NUMERICAL MEANINGS OF PROBABILISTIC EXPRESSIONS. Cambridge, MA: Frederick Mosteller and Cleo Youtz [producers], 1993. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 1994. doi:10.3886/ICPSR06046.v1
Persistent URL: http://dx.doi.org/10.3886/ICPSR06046.v1
Export Citation:
- RIS (generic format for RefWorks, EndNote, etc.)
- EndNote XML (EndNote X4.0.1 or higher)
Funding
This survey was funded by:
- National Science Foundation (SES 8401422)
Scope of Study
Summary: These data were collected to obtain a clearer understanding of the quantitative meanings that people perceive in common words used to describe probabilistic outcomes. For example, in everyday language, people apply the expressions "always" and "certain" to events that occur in fewer than 100 percent of their opportunities. In this study, science writers were surveyed and asked to quantify, in a percentage term, their understanding of each of 52 expressions. They were also asked to indicate how they thought their readers would quantify each term, giving both an upper and lower limit they thought their readers would set for each expression. One group of expressions included the word "probability", and ranged from "very high probability" to "very low probability". Another used various forms of the word "probable", such as "very probable" and "improbable". Other expressions were centered around the word "chance": "better than even chance" to "less than even chance". The survey also included words like "always", "often", "frequently", "never", and "sometimes". Also tested were expressions with regularly used modifiers such as "very", or negation (not, un-, im-, in-), so that the effect of such modifiers could be evaluated. The sample of respondents was split to permit assessment of the effects of order of presentation: half received a form that ranked the expressions within 15 groups from high probability to low, while the other half received a form ordering the expressions from low probability to high.
Subject Terms: language, language study, perceptions
Geographic Coverage: United States
Time Period:
- 1987
Universe: The universe consisted of 637 members of the National Association of Science Writers in the United States and Canada.
Data Types: survey data
Data Collection Notes:
The data are provided, as received from the producer, in 104 discrete small files, each corresponding to one of the 52 probabilistic expressions and to one of the two survey forms. The files have been edited by ICPSR for easier handling by statistical software. Specifically, two lines of comment, which identified the expression to which the file's data referred, have been removed from each file. In their place, two variables have been added to each record: one identifying the expression and a second identifying the form code. In addition, the respondent identification code was edited to remove blanks. Users should note that the data are not arranged in these files in fixed columns but in a free-format list, with one record per line and each variable delimited by a comma.
Methodology
Sample: A total of 238 respondents from the population (37 percent response rate).
Data Source:
self-enumerated questionnaires
Extent of Processing: ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Performed recodes and/or calculated derived variables.
- Checked for undocumented or out-of-range codes.
Version(s)
Original ICPSR Release: 1994-10-19
Version History:
- 2006-01-12 All files were removed from dataset 106 and flagged as study-level files, so that they will accompany all downloads.
- 2006-01-12 All files were removed from dataset 105 and flagged as study-level files, so that they will accompany all downloads.
Related Publications
Utilities
Update Notification
Use any of the notification links to add this study to your RSS feed; you will then receive notification if the study is substantively updated.
Metadata Exports
- Citations exports are provided above.
Export Study-level metadata (does not include variable-level metadata)
If you're looking for collection-level metadata rather than an individual metadata record, please visit our Metadata Records page.
