Unistat Statistics Software | Nonparametric Tests-Kruskal-Wallis One-Way ANOVA

6.5.1. Kruskal-Wallis One-Way ANOVA

Data entry is in multisample format (see 6.0.4. Multisample Tests). Each sample can be entered in a separate column (not necessarily of equal length), or they can be stacked in one or more columns and subsamples defined by an unlimited number of factor columns. Missing values are omitted by case.

Nonparametric Tests-Kruskal-Wallis One-Way ANOVA

6.5.1.1. Kruskal-Wallis ANOVA Test Results

This test is used to evaluate the degree of association between samples. It is assumed that the samples have similar distributions and that they are independent. All cases in all samples are ranked together and then the rank sum of each sample is found. The test statistic is calculated as follows:

where N is the total number of cases in all samples, M is the number of variables and R is the total of the squared sum of ranks for each sample divided by the respective sample size.

The test statistic corrected for ties is:

Nonparametric Tests-Kruskal-Wallis One-Way ANOVA

where K is sum of k3 – k and k is the number of tied cases for a particular rank.

The one-tail probability is reported from the chi-square distribution.

6.5.1.2. Kruskal-Wallis ANOVA Multiple Comparisons

Eight nonparametric Multiple Comparisons can be performed as part of this procedure. The last two are comparisons against a control group (which require further inputs) and the rest are comparisons between all possible pairs.

Multiple comparisons with rank sums (Tukey-HSD)

Nonparametric Multiple Comparisons are performed in a way similar to the Tukey-HSD test using rank sums. The standard error is computed as:

This test requires equal group sizes.

Multiple comparisons with mean ranks (Tukey-HSD)

Nonparametric Multiple Comparisons are performed in a way similar to the Tukey-HSD test using mean ranks. In this case the standard error is computed as:

This test requires equal group sizes.

Multiple comparisons with rank sums (S-N-K)

Nonparametric Multiple Comparisons are performed in a way similar to the Student-Newman-Keuls test using mean ranks. In this case the standard error is computed as:

This test requires equal group sizes.

Multiple comparisons with mean ranks (S-N-K)

Nonparametric Multiple Comparisons can also be performed in a way similar to the Student-Newman-Keuls test using rank sums. The standard error is computed as follows:

This test requires equal group sizes.

Multiple comparisons with t-distribution

If group sizes are not equal and all possible pairs are to be compared then this option can be selected. Nonparametric Multiple Comparisons are performed in a way similar to the Tukey-HSD test using mean ranks. In this case the standard error is computed as:

Nonparametric Tests-Kruskal-Wallis One-Way ANOVA

Multiple comparisons (Dunn)

If group sizes are not equal and all possible pairs are to be compared, then this option can be selected.

The standard error, which has a correction term for tied ranks, is computed as follows:

Nonparametric Tests-Kruskal-Wallis One-Way ANOVA

where N is the total number of cases, K is the sum of k3 – k and k is the number of tied cases for a particular rank (as in Kruskal-Wallis One-Way ANOVA). In comparisons group mean ranks are used.

Comparisons against a control group (Dunnett)

If each group of data is to be tested against a control group and all groups are of the same size then select this option. If the group sizes are not equal then the next option (Dunn’s test) should be used.

The standard error is computed as follows:

The only other difference between the Dunnett test introduced here and the Dunnett test per se is that here the group rank sums are used while the latter uses group mean ranks.

Comparisons against a control group (Dunn)

If each group of data is to be tested against a control group and all groups are not of the same size then select this option. If the group sizes are equal then the previous option (Dunnett test) may also be employed.

The standard error, which has a correction term for tied ranks, is computed as in the Dunn’s test above.

6.5.1.3. Kruskal-Wallis ANOVA Examples

Example 1

Example 10.6 on p. 287 from Armitage & Berry (2002). Counts of adult worms in four groups of rats are given. The null hypothesis “there is no significant difference between the rats” is tested.

Open NONPARM1 and select Statistics 1 → Nonparametric Tests (Multisample) → Kruskal-Wallis ANOVA. Select Group 1, Group 2, Group 3 and Group 4 (C1 to C4) as [Variable]s and then select only Test Results to obtain the following results:

Kruskal-Wallis One-Way ANOVA

	Cases	Rank Sum	Mean Rank
Group 1	5	42.0000	8.4000
Group 2	5	53.0000	10.6000
Group 3	5	36.0000	7.2000
Group 4	5	79.0000	15.8000
Total	20	210.0000	10.5000

Correction for Ties =	0.0008
Chi-Square Statistic =	6.2047
Degrees of Freedom =	3
Right-Tail Probability =	0.10207

This result is not significant at the 10% level. Hence do not reject the null hypothesis.

Example 2

Examples 10.10 on p. 216 and 11.7 on p. 241 from Zar, J. H. (2010). A researcher wants to test the null hypothesis “the abundance of the flies is the same in all three vegetation layers” at a 95% significance level. If they were found to be different, then the researcher would also like to know which ones.

Open NONPARM1, select Statistics 1 → Nonparametric Tests (Multisample) → Kruskal-Wallis ANOVA and include Herbs (C5), Shrubs (C6) and Trees (C7) in the analysis by clicking [Variable]. Check only the Test Results and the Multiple Comparisons with Rank Sums (Tukey-HSD) boxes to obtain the following results:

Kruskal-Wallis One-Way ANOVA

	Cases	Rank Sum	Mean Rank
Herbs	5	64.0000	12.8000
Shrubs	5	30.0000	6.0000
Trees	5	26.0000	5.2000
Total	15	120.0000	8.0000

Correction for Ties =	0.0000
Chi-Square Statistic =	8.7200
Degrees of Freedom =	2
Right-Tail Probability =	0.0128

Since the right tail probability is less than 5%, the null hypothesis is rejected. Next the researcher would like to find which vegetation layers have different abundance of the flies.

Multiple Comparisons with Rank Sums (Tukey-HSD)

Method: 95% Tukey-HSD interval.

** denotes significantly different pairs. Vertical bars show homogeneous subsets.

A pairwise test result is significant if its q stat value is greater than the table q.

Group	Cases	Rank Sum	Trees	Shrubs	Herbs
Trees	5	26.0000			**	\|
Shrubs	5	30.0000			**	\|
Herbs	5	64.0000	**	**		\|

Comparison	Difference	Standard Error	q Stat	Table q	Probability
Herbs – Trees	38.0000	10.0000	3.8000	3.3145	0.0197
Shrubs – Trees	4.0000	10.0000	0.4000	3.3145	0.9569
Herbs – Shrubs	34.0000	10.0000	3.4000	3.3145	0.0428

Comparison	Lower 95%	Upper 95%	Result
Herbs – Trees	4.8551	71.1449	**
Shrubs – Trees	-29.1449	37.1449
Herbs – Shrubs	0.8551	67.1449	**

Homogeneous Subsets:
Group 1:	Trees Shrubs
Group 2:	Herbs

The overall conclusion is that fly abundance is the same for Trees and Shrubs but it is different for Herbs.

Example 3

Examples 10.11 on p. 217 and 11.8 on p. 242 from Zar, J. H. (2010). The null hypothesis that “pH is the same in all four ponds” is tested at a 95% significance level. If they were found to be different, then we would also like to know which ones. The data has unequal column lengths.

Open NONPARM1, select Statistics 1 → Nonparametric Tests (Multisample) → Kruskal-Wallis ANOVA and include Pond 1, Pond 2, Pond 3 and Pond 4 (C8 to C11) in the analysis by clicking [Variable]. Check only the Test Results and the Multiple Comparisons (Dunn) boxes to obtain the following results:

Kruskal-Wallis One-Way ANOVA

	Cases	Rank Sum	Mean Rank
Pond 1	8	55.0000	6.8750
Pond 2	8	132.5000	16.5625
Pond 3	7	145.0000	20.7143
Pond 4	8	163.5000	20.4375
Total	31	496.0000	16.0000

Correction for Ties =	0.0056
Chi-Square Statistic =	11.9435
Degrees of Freedom =	3
Right-Tail Probability =	0.0076

Since the right tail probability is less than 5%, the null hypothesis is rejected. Next we would like to find which ponds have a different pH.

Multiple Comparisons (Dunn)

Method: 95% Dunn interval.

** denotes significantly different pairs. Vertical bars show homogeneous subsets.

A pairwise test result is significant if its q stat value is greater than the table q.

Group	Cases	Mean Rank	Pond 1	Pond 2	Pond 4	Pond 3
Pond 1	8	6.8750			**	**	\|
Pond 2	8	16.5625					\|\|
Pond 4	8	20.4375	**				\|
Pond 3	7	20.7143	**				\|

Comparison	Difference	Standard Error	q Stat	Table q	Probability
Pond 3 – Pond 1	13.8393	4.6923	2.9493	2.6383	0.0191
Pond 4 – Pond 1	13.5625	4.5332	2.9918	2.6383	0.0166
Pond 2 – Pond 1	9.6875	4.5332	2.1370	2.6383	0.1956
Pond 3 – Pond 2	4.1518	4.6923	0.8848	2.6383	1.0000
Pond 4 – Pond 2	3.8750	4.5332	0.8548	2.6383	1.0000
Pond 3 – Pond 4	0.2768	4.6923	0.0590	2.6383	1.0000

Comparison	Lower 95%	Upper 95%	Result
Pond 3 – Pond 1	1.4597	26.2188	**
Pond 4 – Pond 1	1.6027	25.5223	**
Pond 2 – Pond 1	-2.2723	21.6473
Pond 3 – Pond 2	-8.2278	16.5313
Pond 4 – Pond 2	-8.0848	15.8348
Pond 3 – Pond 4	-12.1028	12.6563

Homogeneous Subsets:
Group 1:	Pond 1 Pond 2
Group 2:	Pond 2 Pond 4 Pond 3

The overall conclusion is that water pH is the same in Pond 2, Pond 4 and Pond 3 but is different in Pond 1.

Example 4

Example 1, p. 291, Conover, W. J. (1999). The null hypothesis that “the four methods (i.e. columns) are equivalent” is tested at a 95% confidence level.

Open NONPARM1, select Statistics 1 → Nonparametric Tests (Multisample) → Kruskal-Wallis ANOVA and include Method 1, Method 2, Method 3, Method 4 (C12 to C15) in the analysis by clicking [Variable]. Check only the Test Results and the Multiple Comparisons with t-Distribution boxes to obtain the following results:

Kruskal-Wallis One-Way ANOVA

	Cases	Rank Sum	Mean Rank
Method 1	9	196.5000	21.8333
Method 2	10	153.0000	15.3000
Method 3	7	207.0000	29.5714
Method 4	8	38.5000	4.8125
Total	34	595.0000	17.5000

Correction for Ties =	0.0064
Chi-Square Statistic =	25.6288
Degrees of Freedom =	3
Right-Tail Probability =	0.0000

Since the right tail probability is less than 5%, the null hypothesis is rejected. Therefore, we can now ask the question which methods are different.

Multiple Comparisons with t Distribution

Method: 95% t interval.

** denotes significantly different pairs. Vertical bars show homogeneous subsets.

A pairwise test result is significant if its q stat value is greater than the table q.

Group	Cases	Mean	Method 4	Method 2	Method 1	Method 3
Method 4	8	4.8125		**	**	**	\|
Method 2	10	15.3000	**		**	**	\|
Method 1	9	21.8333	**	**		**	\|
Method 3	7	29.5714	**	**	**		\|

Comparison	Difference	Standard Error	q Stat	Table q	Probability
Method 3 – Method 4	24.7589	2.5465	9.7227	2.0423	0.0000
Method 1 – Method 4	17.0208	2.3908	7.1192	2.0423	0.0000
Method 2 – Method 4	10.4875	2.3339	4.4935	2.0423	0.0001
Method 3 – Method 2	14.2714	2.4248	5.8857	2.0423	0.0000
Method 1 – Method 2	6.5333	2.2607	2.8899	2.0423	0.0071
Method 3 – Method 1	7.7381	2.4796	3.1207	2.0423	0.0040

Comparison	Lower 95%	Upper 95%	Result
Method 3 – Method 4	19.5583	29.9596	**
Method 1 – Method 4	12.1381	21.9036	**
Method 2 – Method 4	5.7210	15.2540	**
Method 3 – Method 2	9.3194	19.2234	**
Method 1 – Method 2	1.9163	11.1504	**
Method 3 – Method 1	2.6741	12.8021	**

Homogeneous Subsets:
Group 1:	Method 4
Group 2:	Method 2
Group 3:	Method 1
Group 4:	Method 3

The overall conclusion is that all methods are different.

Previous topic | Next topic