RESEARCHING THE REAL WORLD

MAIN MENU

Basics

Orientation Observation In-depth interviews Document analysis and semiology Conversation and discourse analysis Secondary Data Surveys Experiments Ethics Research outcomes
Conclusion

References

Activities

Social Research Glossary

About Researching the Real World

Search

Contact

© Lee Harvey 2012–2024

Page updated 8 January, 2024

Citation reference: Harvey, L., 2012–2024, Researching the Real World, available at qualityresearchinternational.com/methodology
All rights belong to author.

8.1 Introduction to surveys
8.2 Methodological approaches
8.3 Doing survey research
8.4 Statistical Analysis

8.4.1 Descriptive statistics
8.4.2 Exploring relationships
8.4.3 Analysing samples

8.4.3.1 Generalising from samples
8.4.3.2 Dealing with sampling error
8.4.3.3 Confidence limits
8.4.3.4 Statistical significance
8.4.3.5 Hypothesis testing

8.4.3.5.1 Introduction
8.4.3.5.2 The null hypothesis
8.4.3.5.3 The alternative hypothesis
8.4.3.5.4 Example of hypothesis testing (H0: µ0=µ) by calculating confidence intervals
8.4.3.5.5 Deciding on the confidence level
8.4.3.5.6 Rejecting or not rejecting the null hypothesis
8.4.3.5.7 Testing H0: ∏0 = ∏ by calculating confidence intervals
8.4.3.5.8 Testing hypotheses by computing a z value (on the basis of the assumed population parameter)
8.4.3.5.9 Testing statistic
8.4.3.5.10 Critical value
8.4.3.5.11 Layout for testing hypotheses
8.4.3.5.12 Example of testing hypotheses of the type H0: µ0=µ
8.4.3.5.13 One- or two-tailed tests
8.4.3.5.14 Summary of hypothesis testing

8.4.3.6 Significance tests
8.4.3.7 Summary of significance testing and association: an example

8.4.4 Report writing

8.5 Summary and conclusion

Activity 8.4.3.5.1

Note: when a sample of a given size is taken from a population, it is one of all the possible samples of that size. Each of these possible samples will have a mean. Some sample means will be larger than the population mean (where the sample, at random, selects more large values than small) and some will be smaller (where the sample, at random, selects more small values than large). These different means form a (theoretical) distribution, known as the sampling distribution (or distribution of sample means). Most of the sample means will be fairly close to the actual population mean and a few being further away: this approximates what is called a normal distribution. The avergae value of all the possible sample means will be the same as the population mean.

Suppose a claim is made as to the value of a population mean. This could be checked by measuring the entire population but it is usually impractical to collect the relevant data of every member of the population and so a (representative) sample is taken. The sample mean is calculated and this gives the best unbiased estimate of the population mean. The claimed population mean is then compared with the sample mean to see if the claim is possible or not. However, it is not sufficient merely to see to whether the claimed mean and the sample mean coincide and then reject the claim if they do not.

Hypothesis: Population mean (µ) = hypothesised population mean (µ0)

where µ is the true value of the population mean and µ0 is the hypothesised (or claimed) value of the population mean.

H0: µ0=µ

H0: ∏0=∏

H0: µ0=µ

HA: µ0≠µ

8.4.3.5.4 Example of hypothesis testing (H0: µ0=µ) by calculating confidence intervals
Let us consider an example to illustrate what has been covered so far in this section. A firm claims that it has produced a machine for making bars of soap and that it will output 1000 bars an hour. A soap manufacturer installs one of the machines and discovers that on average over a working week of 50 hours the machine only produces 970 bars per hour with a standard deviation of 91 bars. Is there a significant difference between the claimed output and actual output of the machine?

Superficially, we could say that there is a difference between the actual and claimed output, because the sample mean is not as large as the claimed mean. However, such a hasty decision is problematic because the sample is prone to sampling error. How do we eliminate the effects of sampling error? By calculating the confidence interval for the mean of the population on the basis of the sample data. What we need to compare is the claimed and true population means. However, the actual population mean is unknown and must therefore be estimated from the sample data by calculating the confidence limits.

H0: µ0=µ

HA: µ0≠µ

Sample mean – z(standard error) ≤ population mean ≤ sample mean + z(standard error)

Sample mean = 970
Sample standard deviation = 91
z= 1.96
Standard error = sample standard deviation/√(n-1) = 91/√(50-1) = 91/√49 =91/7 =13
(The divisor n-1 is used rather than n, as dividing by n gives a biased estimate of the population mean, see CASE STUDY Standard error).

970 – 1.96(13) ≤ µ ≤ 970 + 1.96(13)
970 - 25.48 ≤ µ ≤ 970 + 25.48
944.52 ≤ µ ≤ 995.48

For example, if we calculate the 99% confidence interval we are 99% sure that the population mean lies within the range and the risk that it does not is only 1%. Therefore, if the hypothesised value of the mean lies outside the confidence limits then there is only a 1% chance that the population mean is the same as the hypothesised mean and we only run a 1% risk of wrongly rejecting the null hypothesis. However, by increasing the confidence interval we do lose precision in our estimate of the population mean.

Thus: 970 – 2.58(13) ≤ µ ≤ 970 + 2.58(13)
936.46 ≤ µ ≤ 1003.54

In deciding upon the confidence interval we face a dilemma. Do we reduce the risk of wrongly rejecting the null hypothesis or do we retain the precision of the estimate of the population mean? The answer depends upon how important it is not to make a mistake in rejecting the null hypothesis. This 'importance' is frequently a matter of personal choice. If a legal decision is going to be based upon the findings, or if it is the case of 'life and death' then it is normal to use a high level of confidence (99% or more), otherwise 95% level is usually sufficient for social, business and economic research situations.

8.4.3.5.6 Rejecting or not rejecting the null hypothesis
In the preceding examples (Sections 8.4.3.5.4 and 8.4.3.5.5) we 'rejected the null hypothesis' or 'did not reject the null hypothesis'; nowhere in the preceding discussion have we ever accepted a null hypothesis. In fact, we rarely set up a hypothesis test to accept the null hypothesis. If the value of the hypothesised mean (µ0) falls within the confidence limits, all this allows us to say is that the null hypothesis is a possibility that cannot be rejected. However, just because it falls within the limits does not prove that the hypothesised value of the mean is true. It is merely a possibility; the hypothesised value would then be one of many values in the confidence limits.

Suppose a claim is made that 80% of grammar school children in the United Kingdom are middle class. Taking a random sample and calculating the confidence limits we could see whether this claim is true or not. We take a sample because it would be prohibitively expensive to investigate the social class of every grammar school child in the country. Once the criteria for 'middle class' has been established a random sample is selected by some mechanism (see Section 8.3.9 on sampling). A random sample of 1000 grammar school students comprised 750 middle class (= 0.75) and 250 non-middle-class (= 0.25).

H0: ∏0 = ∏ = 0.8
HA: ∏0 ≠ ∏ ≠ 0.8

p – z(standard error of p) ≤ ∏ ≤ p + z(standard error of p)

where ∏ is the population proportion

√∏0(1- ∏)/n = √0.8(0.2)/1000 =√0.00016 = 0.012649

0.75 – 1.96 (0.012649) ≤ ∏ ≤ 0.75 + 1.96 (0.012649)
0.75 – .024792 ≤ ∏ ≤ 0.75 + .024792
0.7252 ≤ ∏ ≤ 0.7748 (to 4 decimal places)

z=(∏0 – p)/standard error of proportion
z= 0.8 – 0.75/0.012649 = 3.95288

8.4.3.5.12 Example of testing hypotheses of the type H0: µ0=µ
A company sells its products via door-to-door commissioned salesmen and women. The firm decides to increase prices and consequently the sales personnel earn more commission on the same number of sales. However, as there are other firms of a similar nature, the increasing prices causes the number of sales per agent to decline. The commission earned in the past by all sales personnel was £1500 per month. The firm is concerned that the commission earned by sales personnel has changed so it takes a random sample of 101 returns from the commission accounts and find the sample mean is £1450 with the standard deviation of £50. Has the average commission earned changed?

H0: µ0 = µ = £1500
HA: µ0 ≠ µ ≠ £1500

Standard error of the mean = sample standard deviation/√(sample size-1)= 50/√(101-1)
Standard error of the mean = 50/√100 = 50/10 = 5

z= (sample mean - population mean)/standard error of means
z= (1450 – 1500)/5
z= –50/5 = –10

µ0> µ
µ0< µ.

When testing H0: µ0 = µ against HA: µ0 ≠ µ at 95% confidence interval the critical value of z is found by ascertaining the number of standard units either side of the mean of the normal curve, which contains 95% of the area under the curve; i.e., 47.5% of the area on each side of the mean, hence the area under the curve outside the confidence limits is equal to 2.5 on each tail. When testing for a change in µ without specifying any direction this is a two-tailed test and the significance level is split equally between the two tails of the distribution (see Figure 8.3.13.13.1).

Level of confidence	95%	99%
One tail test*	1.645	2.33
Two tail test	±1.96	±2.58

A Type II error occurs when the researcher fails to reject a null hypothesis that is false. The probability of committing a Type II error is called 'beta'. A Type II error occurs when the test is not sufficiently sensitive or powerful enough to identify a difference when it actually exists. This may be, for example, because the sample is not big enough, the confidence level is set too high or the measured difference is small. The probability of not committing a Type II error is called the 'power' of the test.

The consequences of a Type I and a Type II error are not the same. Suppose two alternative headache pills are being compared to see which acts faster. Rejecting a null hypotheses that says that they act equally fast (H0:µ1=µ2) is a Type I error when in fact they are equally speedy. The consequence of wrongly rejecting the null hypothesis is not problematic as the benefit to patients is the same for either pill. However, a Type 2 error, failing to reject the null hypothesis when it should be rejected, means that the faster acting pill is not made available in preference to the slower acting one. So, when conducting a hypothesis test, consider the consequences of making Type I and Type II errors and choose the testing circumstances that take account of the consequences.

Activity 8.4.3.5.1
If you want to understand the principles of significance testing make sure you are able answer the following.
What is a sample?
What is the population?
What is sampling error?
What causes sampling error?
What is standard error?
What is a sampling distribution?
How does the sampling distribution differ from the population distribution?
What are confidence limits? What is meant by significance level?
What is the null hypothesis? Why do we define an alternative hypothesis?
What does the critical value of the testing statistics show?
What defines the testing statistic?
What is the basis for the decision rule when testing a hypothesis?
Why do we never accept the null hypothesis?

FOR MORE INFORMATION ON SIGNIFANCE TESTS AND WHICH TO USE IN DIFFERENT CIRCUMSTANCES SEE CASE STUDY: Significance Tests

MAIN MENU Basics

References

About Researching the Real World Search Contact

© Lee Harvey 2012–2024

Page updated 8 January, 2024 Citation reference: Harvey, L., 2012–2024, Researching the Real World, available at qualityresearchinternational.com/methodology All rights belong to author.

MAIN MENU

Basics

About Researching the Real World

Search

Contact

Page updated 8 January, 2024

Citation reference: Harvey, L., 2012–2024, Researching the Real World, available at qualityresearchinternational.com/methodology
All rights belong to author.