Performance of Wilcoxon-Mann-Whitney Test and t-test

This study compares the Type I error rate and power between the two-sample t-test and the Wilcoxon-Mann-Whitney (WMW) test. The two-sample t-test requires either the two population distributions to be normal or the sample sizes to be large enough in order for the sampling distribution to be normal. The WMW test is a nonparametric test that requires the two population distributions to have the same shape. When two populations have the same mean, Type I error rate is of interest. In contrast, when two populations have different means, power is of interest

Different scenarios are analyzed in this study, such as comparing two Normal distributions, a Normal to a Gamma distribution, and two Gamma distributions with small and large sample sizes. The better test is determined either through a lower Type I error rate or a higher power.

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

It is time to Guess the Population! This game demonstrates the difficulty of identifying which pair of sample data are from the same population. Below are 4 histograms of randomly generated data with sample sizes of 20, where 2 are from N(3,1) (Normal distribution) and 2 are from Gamma(6,.5) (Gamma distribution).

Can you determine which pair came from the Normal distribution and which pair from the Gamma distribution?

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

Normal distribution 1:

Sample size:

Mean:

Normal distribution 2:

Sample size:

Mean:

Significance level α:

Number of simulations:

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

Population Distributions

normnorm1

Note:

Variances are fixed at 1
P(rejecting Ho | μ₁=μ₂) = Type I error rate
P(rejecting Ho | μ₁≠μ₂) = Power

These two Normal distributions have the same means; focus on Type I error rate

These two Normal distributions have different means; focus on Power

Conditions

normcond

Simulation Results

normnorm2

Type I error rate:

Power:

Normal distribution 1:

Sample size:

Range of means:

Normal distribution 2:

Sample size:

Mean:

Significance level α:

Number of simulations:

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

Simulation Information

c11 In this scenario, the mean of the 1st Normal distribution varies according to the specified range, while the mean of the 2nd Normal distribution remains constant. The Type 1 error rate and power is compared between the t-test and the WMW test.

Simulation Results

c12

In the generated graph, each point is either a Type I error rate or power; there is at most 1 Type I error rate (when the two population means are the same).

Normal distribution:

Sample size:

Mean:

Gamma distribution:

Sample size:

Shape:

Scale:

Significance level α:

Number of simulations:

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

Population Distributions

normgam1

Note:

Variance of Normal is fixed at 1
Gamma mean is the product of shape and scale
P(rejecting Ho | μ₁=μ₂) = Type I error rate
P(rejecting Ho | μ₁≠μ₂) = Power

Conditions

normgamcond

Simulation Results

normgam2

Normal distribution:

Sample size:

Range of means:

Gamma distribution:

Sample size:

Shape:

Scale:

Significance level α:

Number of simulations:

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

Simulation Information

c21 In this scenario, the mean of the Normal distribution varies according to the specified range, while the mean of the Gamma distribution remains constant. The Type 1 error rate and power is compared between the t-test and the WMW test.

Simulation Results

c22

In the generated graph, each point is either a Type I error rate or power; there is at most 1 Type I error rate (when the two population means are the same).

Gamma distribution 1:

Sample size:

Shape:

Scale:

Gamma distribution 2:

Sample size:

Distance from first Gamma:

Significance level α:

Number of simulations:

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

Population Distributions

gamgam1

Note:

Gamma mean is the product of shape and scale
P(rejecting Ho | μ₁=μ₂) = Type I error rate
P(rejecting Ho | μ₁≠μ₂) = Power

Conditions

gamcond

Simulation Results

gamgam2

Gamma distribution 1:

Sample size:

Shape:

Scale:

Gamma distribution 2:

Sample size:

Range of distance from first Gamma:

Significance level α:

Number of simulations:

Shiny app by Jimmy Wong

Base R code by Jimmy Wong

Shiny source files: GitHub Gist

Cal Poly Statistics Dept Shiny Series

Simulation Information

c31 In this scenario, the mean of the 1st Gamma distribution remains constant, while the mean of the 2nd Gamma distribution varies depending on the specified range of distance. The Type 1 error rate and power is compared between the t-test and the WMW test.

Simulation Results

c32

In the generated graph, each point is either a Type I error rate or power; there is at most 1 Type I error rate (when the two population means are the same).