How to Choose Which Statistical Test to Use

If you have questions and the data to address them but aren’t sure where to go from there, this guide can help. Find the type of question you want to answer – for example, “how many” or “are these two things related” – in the first column to see the recommended statistical analysis. The data requirements for each test are included.

Based on Question and Variable Types

Use the first table below to find the question that most closely matches what you want to know, and look for the appropriate statistical test on the right! The second table provides more information specific to each method of analysis. Note: This represents many of the most used statistical tests but is not meant to be an exhaustive list.

Reminders

Some statistical tests require variables to be measured at a specific level of measurement. Depending how a question is asked, variables can be at one of four levels:

nominal or categorical: answers are unordered categories, like a list of cities.
ordinal: answer categories have an order but the categories are of unequal sizes or are “squishy,” such as Likert scales. In some disciplines, it is common to use these variables as if they were continuous.
interval: answers are actual numbers but there is no true zero, IQ scores or temperature in Fahrenheit are examples.
ratio: answers are actual numbers with a true zero, such as dollar amounts or number of days.

Note: Interval and ratio are often grouped together (I/R) and called continuous.

Table 1. Statistical Test by Question

Question	Statistical Test	Notes
How many people answered a question this way? Or How did people answer this question?	Frequency table	If there are a lot of unique values, a frequency table may be inefficient and summary statistics (range, mean, mode, etc.) should be used instead.
What answer was the most common?	Mode (or frequency table)	Applicable for variables at all levels. There may be multiple modes for a single question.
What is the “average” value of this variable?	Median or Mean	Use median with ordinal-level variables or those whose distributions have significant outliers. Use mean with continuous variables.
How large is the spread of respondents’ answers and how much do they differ from one another?	Range and Standard Deviation	Both require at least ordinal-level data, standard deviation technically requires continuous.
Is a score one point in time related to a later score on the same item?	Paired-samples T-test	–
Does group one differ from group two on some continuous variable? (e.g., Do men and women differ on the number of meals they eat out each week?)	Independent-samples T-test	–
Do members of three or more groups differ from each other on some continuous measure? (e.g., Do individuals with a high school degree watch more tv than those with some college or those with a college degree?)	Oneway ANOVA	–
Are these two concepts related?	Chi-square or Correlation	Chi-square can be used with nominal or ordinal data, correlation requires two continuous variables. Neither require identifying an independent and a dependent variable.
Is my independent variable related to my dependent variable? (e.g., Is the amount of sleep I get at night related to how many minutes I exercise the next day?)	Bivariate Regression	Need to specify which variable is independent and which is dependent. Both are typically at least ordinal, or “dummy” (dichotomous yes/no) variables.
Does this group of variables predict my outcome of interest?	Multiple Regression	The type of regression depends on what you are trying to predict or explain. Two common types are ordinary least squares (OLS, when the outcome is continuous) and logistic (“yes/no” type outcome).

Table 2. Tests of Significant Differences

Test	Variable Types Needed	Null Hypothesis	Alternative Hypothesis	Test Statistic	Effect Size Statistic
One-Sample T-test	I/R (sample mean, population mean)	The means are the same	The means are different (two-tailed)	T-test	Cohen’s D
Independent-Samples T-test	Independent: Nominal (2 groups) Dependent: Ordinal, I/R	The two samples are drawn from the same population	The populations from which they are drawn are different (two-tailed)	Levine’s test (equal variances vs unequal variances) T-test	Cohen’s D
Dependent (Paired)-Samples T-test	Independent: Person or pair Dependent: Ordinal, I/R	No change from T1 to T2	Change from T1 to T2	T-test	Cohen’s D
Oneway ANOVA	Independent: Nominal Dependent: Ordinal, I/R	The means for all categories are equal	The mean for at least one category is different	F-ratio Tukey’s HSD (post-hoc tests – which pairs are different)	Eta-squared
Chi-Square	Var1: Nominal, Ordinal (few categories) Var2: Nominal, Ordinal (few categories)	Variables are unrelated in population	The variables are related	Chi-sq.	Eta
Correlation	Var1: Ordinal, I/R Var2: Ordinal, I/R	No relationship between the variables	The variables are related	Pearson r	R-squared
Bivariate Regression	Independent: Ordinal, I/R Dependent: Ordinal, I/R	IV is not related to DV (b = 0)	IV is related to DV (b≠ 0)	T-test	R-squared
Multiple Regression	Independent(s): Ordinal, I/R, dummy vars Dependent: Ordinal, I/R	Independent Variables are not related to the dependent variables (all slopes = 0)	At least one slope is not 0 (IV[s] are related to DVs)	F-ratio (combination) T-test	R-squared