Hypothesis Testing and Prerequisite Knowledge

A hypothesis is a statement or assumption about a population parameter (e.g., mean, proportion) that can be tested using statistical methods. It is the foundation of hypothesis testing, which determines whether there is enough statistical evidence in a sample to infer a conclusion about the entire population.

Karl Popper’s Philosophy

Whenever there is a conjecture, which is a statement that is not yet proved, according to Karl Popper, it is easier to disprove it by showing empirical evidence. The conjecture is called the null hypothesis (), and the opposite of it is called the alternative hypothesis ().

Attributes

  • Population: In hypothesis testing, the number of subjects that are measured is called the population.
  • Sampling: The reduced amount of data.
    • It is used largely by government, industries, etc., when the data is too hard to collect, and we collect a sample of the data.
  • : Null Hypothesis
  • : Alternative Hypothesis
graph LR
    A(Formulate Hypothesis) --> B(Collect Sample Data)
    B --> C(Analyze Data)
    C --> D{Is there sufficient evidence?}
    D -- Yes --> E(Reject Null Hypothesis)
    D -- No --> F(Fail to Reject Null Hypothesis)
    E --> G(Make an Inference About the Population)
    F --> G

Surveys

  • Surveys are used to collect data from a population.
  • The data is collected from a sample of the population.
  • The data is then analyzed to make a statement about the population.

Parameters & Statistics

We use Greek letters for population parameters and English letters for sample statistics.

  • Population Measures:
    • Mean ()
    • Variance ()
  • Sample Measures:
    • Mean ()
    • Variance ()

Statistical Hypothesis

Hypothesis

  • A new drug significantly reduces blood pressure.

Null Hypothesis

  • Definition: A definite statement about a population parameter which is tested for possible rejection under the assumption that it is true. It is usually a hypothesis of no difference. Represented by .
  • Example: The new drug does not reduce blood pressure compared to a placebo.

Testing Process

  • Researchers would conduct a clinical trial, and if the data shows a statistically significant decrease in blood pressure in the drug group, then the hypothesis could be accepted. Otherwise, it is rejected.

Alternative Hypothesis

  • Any hypothesis that is complementary to the null hypothesis is called an alternative hypothesis and is denoted by .

Types of Errors

  • Type 1 Error: Rejecting a true null hypothesis. The probability of making a Type 1 error is denoted by .

    • Examples:
      • Convicting an innocent person.
      • 100 phones produced, 10 phones sampled, 1 defective phone found (Type 1 error).
  • Type 2 Error: Accepting a false null hypothesis. The probability of making a Type 2 error is denoted by .

    • Examples:
      • Acquitting a guilty person.
      • 100 phones produced, 10 phones sampled, no defective phones found (Type 2 error).
  • and are referred to as Producer’s Risk and Consumer’s Risk, respectively.

Example Problems

Example 1

Average marks of boys are not the same as average marks of girls.

  • Let average marks for boys be and average marks of girls be .

Example 2

Average height of boys is more than average height of girls.

  • Let average height for boys be and average height of girls be .

One-Tailed & Two-Tailed Tests

Given a sample size with average , and another sample size with average , we can perform the following tests:

  • Right-Tailed Test:

  • Left-Tailed Test:

  • Two-Tailed Test:

Level of Significance

The probability, let’s say , of rejecting a true null hypothesis is called the level of significance.

The level of significance is the probability of rejecting a true null hypothesis. It is denoted by .

If we know the probability , we can calculate the value. The value is the number of standard deviations a data point is from the mean.

Example

If , we look closely at the Z table and find that the value of is 1.96.

Confidence Interval

The confidence interval is the range of values within which the true value of the parameter is expected to lie with a certain level of confidence. The confidence interval is denoted by where is the level of significance.

Tests of Significance Problems

Question 1

Test of significance between population mean and sample mean. A sample size is considered larger if the sample size is greater than 30. If the sample size is less than 30, then the sample size is considered small.

Problem Statement

  • Sample size = 100

  • Standard Deviation = 10 cm

  • Sample Mean = 160 cm

  • Mean Height = 165 cm

  • (Two-Tailed Test)

Solution

Solving test statistics:

Since , we reject .

Conclusion: Reject

Question 2

A random sample of 200 measurements from a large population has a mean of 50 and a standard deviation of 10. Test the hypothesis that the population mean is 52 against the alternative hypothesis that the population mean is not 52. Use a level of significance of 0.05.

Problem Statement

  • Sample size = 200

  • Sample Mean = 50

  • Standard Deviation = 10

  • Population Mean = 52

  • (Two-Tailed Test)

Solution

Solving test statistics:

Since , we reject .

Conclusion: Reject

Small Sample Tests

If the sample size is less than 30, then the sample size is considered small. The test statistic is calculated using the t-distribution.

  • Degrees of freedom =
  • is the t value for the level of significance and degrees of freedom .

The sample mean is calculated as:

When is known, we can ignore only one value, thus degree of freedom is .

The t-statistic is given by:

Properties of t-distribution

  • The t-distribution is symmetric about the mean.
  • The t-distribution has a mean of 0.
  • The t-distribution is more spread out than the standard normal distribution.

Formula for Comparing Two Small Samples

Problems with Small Datasets

Question 1

A machine solves a problem in 1.75 seconds. A new machine is introduced and the time taken to solve the problem is 1.85 seconds. The standard deviation is 0.1. Test the hypothesis that the new machine is inferior to the old machine. Use a level of significance of 0.05.

Problem Statement

  • (machine is not inferior)
  • (Two-Tailed Test, machine is inferior)
  • Degrees of freedom =

Solution

Since , we reject .

Conclusion: Reject

Question 2

A certain injection is administered. Will it always…

Problem Statement

  • (There is a significant difference)

Test of Significance of Difference Between Two Large Sample Means

Testing of significance of difference between two large samples means. We will now have two values of and two values of and two values of . We will also calculate the value for the two samples.

If student 1 is asked to get a sample of college students with marks and student 2 is asked to get another sample. The standard deviation will remain the same. This is because student 1 and student 2…

When the samples are too large it will follow standard normal distribution. When the samples are too small it will follow the t-distribution. Assumption will be made on the basis of sample size.

Cases

  • Case 1: and known
  • Case 2: and unknown
  • Case 3: and known
  • Case 4: and unknown

Formulas

Case 4 Formula

Case 3 Formula

Case 1 Formula

Case 2 Formula

Test of Significance of Difference Between Two Sample Means (Small Samples)

Questions

Samples of two types of electric bulbs are given:

SizeMeanStandard Deviation
Sample18121436
Sample271036

Questions Based on Test of Significance

The average marks scored by 32 boys is 72 with an SD of 8, while that for 36 girls is 70 with an SD of 6. Test at 1% LOS whether the boys perform better than girls.

Paired Testing

When there are two different instances of the same sample, we can use paired testing. In the case of the example where students’ first exam and second exam…

Let be the marks of the first exam and be the marks of the second exam. is the difference between the two exams.

or

The test statistic is given by:

F Test

We move from comparing means to comparing variances. Proportions cannot be compared for very small samples. So for samples of large size, we use the F test that is variance.

References

Information
  • Date: 2025.03.12
  • Time: 14:05