kruskalTest(x,...)## Default S3 method:kruskalTest(x, g, dist = c("Chisquare","KruskalWallis","FDist"),...)## S3 method for class 'formula'kruskalTest( formula, data, subset, na.action, dist = c("Chisquare","KruskalWallis","FDist"),...)
Arguments
x: a numeric vector of data values, or a list of numeric data vectors.
...: further arguments to be passed to or from methods.
g: a vector or factor object giving the group for the corresponding elements of "x". Ignored with a warning if "x" is a list.
dist: the test distribution. Defaults's to "Chisquare".
formula: a formula of the form response ~ group where response gives the data values and group a vector or factor of the corresponding groups.
data: an optional matrix or data frame (or similar: see model.frame) containing the variables in the formula formula. By default the variables are taken from environment(formula).
subset: an optional vector specifying a subset of observations to be used.
na.action: a function which indicates what should happen when the data contain NAs. Defaults to getOption("na.action").
Returns
A list with class "htest" containing the following components:
method: a character string indicating what type of test was performed.
data.name: a character string giving the name(s) of the data.
statistic: the estimated quantile of the test statistic.
p.value: the p-value for the test.
parameter: the parameters of the test statistic, if any.
alternative: a character string describing the alternative hypothesis.
estimates: the estimates, if any.
null.value: the estimate under the null hypothesis, if any.
Details
For one-factorial designs with non-normally distributed residuals the Kruskal-Wallis rank sum test can be performed to test the H0:F1(x)=F2(x)=…=Fk(x) against the HA:Fi(x)=Fj(x)(i=j) with at least one strict inequality.
Let Rij be the joint rank of Xij, with R(1)(1)=1,…,R(n)(n)=N,N=∑i=1kni, The test statistic is calculated as
H=i=1∑kni(Rˉi−Rˉ)/σR,
with the mean rank of the i-th group
Rˉi=j=1∑niRij/ni,
the expected value
Rˉ=(N+1)/2
and the expected variance as
σR2=N(N+1)/12.
In case of ties the statistic H is divided by (1−∑i=1rti3−ti)/(N3−N)
According to Conover and Imam (1981), the statistic H is related to the F-quantile as
F=(N−1−H)/(N−k)H/(k−1)
which is equivalent to a one-way ANOVA F-test using rank transformed data (see examples).
The function provides three different dist for p-value estimation:
Chisquare: p-values are computed from the Chisquare
distribution with $v = k - 1$ degree of freedom.
KruskalWallis: p-values are computed from the pKruskalWallis of the package SuppDists.
FDist: p-values are computed from the FDist distribution with v1=k−1,v2=N−k degree of freedom.
Examples
## Hollander & Wolfe (1973), 116.## Mucociliary efficiency from the rate of removal of dust in normal## subjects, subjects with obstructive airway disease, and subjects## with asbestosis.x <- c(2.9,3.0,2.5,2.6,3.2)# normal subjectsy <- c(3.8,2.7,4.0,2.4)# with obstructive airway diseasez <- c(2.8,3.4,3.7,2.2,2.0)# with asbestosisg <- factor(x = c(rep(1, length(x)), rep(2, length(y)), rep(3, length(z))), labels = c("ns","oad","a"))dat <- data.frame( g = g, x = c(x, y, z))## AD-TestadKSampleTest(x ~ g, data = dat)## BWS-TestbwsKSampleTest(x ~ g, data = dat)## Kruskal-Test## Using incomplete beta approximationkruskalTest(x ~ g, dat, dist="KruskalWallis")## Using chisquare distributionkruskalTest(x ~ g, dat, dist="Chisquare")## Not run:## Check with kruskal.test from R statskruskal.test(x ~ g, dat)## End(Not run)## Using Conover's FkruskalTest(x ~ g, dat, dist="FDist")## Not run:## Check with aov on ranksanova(aov(rank(x)~ g, dat))## Check with oneway.testoneway.test(rank(x)~ g, dat, var.equal =TRUE)## End(Not run)## Median Test asymptoticmedianTest(x ~ g, dat)## Median Test with simulated p-valuesset.seed(112)medianTest(x ~ g, dat, simulate.p.value =TRUE)
References
Conover, W.J., Iman, R.L. (1981) Rank Transformations as a Bridge Between Parametric and Nonparametric Statistics. Am Stat 35 , 124--129.
Kruskal, W.H., Wallis, W.A. (1952) Use of Ranks in One-Criterion Variance Analysis. J Am Stat Assoc 47 , 583--621.
Sachs, L. (1997) Angewandte Statistik. Berlin: Springer.