RESEARCHING THE REAL WORLD

MAIN MENU

Basics

Orientation Observation In-depth interviews Document analysis and semiology Conversation and discourse analysis Secondary Data Surveys Experiments Ethics Research outcomes
Conclusion

References

Activities

Social Research Glossary

About Researching the Real World

Search

Contact

© Lee Harvey 2012–2025

Page updated 3 February, 2025

Citation reference: Harvey, L., 2012–2025, Researching the Real World, available at qualityresearchinternational.com/methodology
All rights belong to author.

8.1 Introduction to surveys
8.2 Methodological approaches
8.3 Doing survey research

8.3.1 Aims and purpose
8.3.2 Background to the research
8.3.3 Feasibility
8.3.4 Hypotheses
8.3.5 Operationalisation

8.3.5.1 Preliminary enquiry
8.3.5.2 Operationalisation and validity
8.3.5.3 Scaling

8.3.5.3.1 Thurstone scaling

8.3.5.3.1.1 The method of equal appearing intervals
8.3.5.3.1.2 The method of successive intervals and the method of paired comparisons

8.3.5.3.2 Likert or Summative Scales

8.3.5.3.2.1 Introduction to Likert scaling
8.3.5.3.2.2 Constructing a Likert scale
8.3.5.3.2.3 Issues in developing a Likert scale
8.3.5.3.2.4 Possible distortion from the use of Likert scales

8.3.5.3.3 Guttman or Cumulative Scales

8.3.5.4. Interchangeability of indicators

8.3.6 How will data be collected and what are the key relationships
8.3.7 Designing the research instrument
8.3.8 Pilot survey
8.3.9 Sampling
8.3.10 Questionnaire distribution and interviewing
8.3.11 Coding data
8.3.12.Response rate

8.4 Statistical analysis
8.5 Summary and conclusion

For example, attempts to provide a ranking of universities use several dimensions and identify a single variable to represent each: ranging from research outputs, through grants obtained, to Nobel Prize winners, expert opinion and student satisfaction ratings. Different ranking systems use different combinations of these and other dimensions and give different weights to each component based on how important they think they are in providing an overall assessment of universities. The final rank is the (weighted) average or total of the indicator for each dimension.

Thurstone or equal-appearing interval scaling
Likert or summative scaling
Guttman or cumulative scaling

Top

William Trochim (2006a) provided an example of a scale to measure attitudes that people might have towards persons with AIDS. A long list of possible statements was rated on a scale of 1 least favourable attitude towards people with AIDs to 11 most favouable. The long list was reduced to the following list (slightly adpated in this example); the values in parentheses are their scale point and all values had a very small interquartule range. On the questionnaire these items would be listed in random order.

People with AIDS deserve what they got (1).
AIDS is good because it helps control the population (2).
AIDS will never happen to me (3).
I can't get AIDS if I'm in a monogamous relationship (4).
Because AIDS is preventable, we should focus our resources on prevention instead of curing (5).
People with AIDS are like my parents (6).
If you have AIDS, you can still lead a normal life (8).
AIDS doesn't have a preference, anyone can get it (9).
Aids affects us all (10).
People with AIDS should be treated just like everybody else (11).

A Likert-type scale assumes that the strength or intensity of experience is on linear continuum from, for example, strongly agree to strongly disagree. Respondents may be offered a choice of five, seven or even nine pre-coded responses with the neutral point being neither agree nor disagree. In most cases, Likert scales use five points to allow the individual to express how much they agree or disagree with a particular statement (for example, strongly disagree, disagree, neither agree nor disagree, agree, strongly agree).

Ecological concerns are the most important issues facing humanity.
(1) strongly disagree (2) disagree (3) undecided (4) agree (5) strongly agree (0) don't know/can't answer/missing.

A respondent's score for the whole Likert-like scale would usually be the sum total of the scores for each item in the scale (or average score for the items answered). So in the example above if there were 20 such items in the scale, a respondent who provided 20 strongly agree scores would have a total of 100. At the other extreme the lowest total score would be 20. If the situation arises where there are missing values (0) then it is probably best to take the mean score of the items answered.

Third, the large number of items would be rated by judges on a 1-to-5 rating scale; ranging from (1) strongly unfavorable to the concept through (2) unfavorable to the concept to (3) unsure to (4) favorable to the concept (5) strongly favorable to the concept. Then interrcorrelate all pairs of items, based on the ratings of the judges. Discard items that have a low correlation with the total (summed) score across all items (Item-Total correlation). There is no fixed discard rule but a score less than 0.6 would normally be a good starting point. Most statistics packages can easily compute Item-Total correlation. First, create a new variable which is the sum of all of the individual items for each respondent. Add this variable into the correlation matrix computation

Fourth, identify which of the remaining items best discriminate between high and low scores (by judges) of the item. The aim is to have items that correlate highly with overall average ratings but also have high discrimination. For each item, compute the average rating for the top quarter of judges and the bottom quarter. Then, do a t-test [a test of significance] of the differences between the mean value for the item for the top and bottom quarter judges. The higher the t value the bigger the difference and the better the item is at discriminating, so use these items. Your judgement will be needed as to the best items to retain. Keep between 10 and 20 items, preferably all with high Item-Total correlations and high discrimination (high t-values).

8.3.5.3.2.3 Issue in developing a Likert scale
Likert scaling is a bipolar scaling method, measuring either positive or negative response to a statement. Sometimes an even-point scale is used, where the middle option of 'Neither agree nor disagree' is not available. This is sometimes called a 'forced choice' method, since the neutral option is removed (see Allen & Seaman (2007)).

There are two primary considerations in this disagreement. First, Likert scales are arbitrary. The value assigned to a Likert item is simply determined by the researcher designing the survey, who makes the decision based on a desired level of detail. However, by convention Likert items tend to be assigned progressive positive integer values. Likert scales are typically 5 or 7-point scales and the implication is that a higher response category indicates a 'better' response than the preceding value (or 'worse' if the scale is reverse constructed from better to worse).

The second, more important issue, is whether the difference between each successive point on the item is equivalent. For example, is the difference between 'stongly agree' and 'agree' the same as between 'agree' and 'neutral'. An equidistant item response is important otherwise the analysis may be biased. For example, a four-point Likert item with categories 'poor', 'average', 'good', and 'very good' is unlikely to have all equidistant categories since there is only one category that can receive a below average rating. This would arguably bias any result in favour of a positive outcome. However, even if researchers present what they consider to be equidistant categories, it may not be interpreted as such by the respondent.

avoid using extreme response categories (central tendency bias), especially out of a desire to avoid being perceived as having extremist views; or may be restrained early on in the questionnaire but become more extreme later;
agree with statements as presented (acquiescence bias), with this effect especially strong among persons, such as children, developmentally disabled persons, and the elderly or infirm, who are subjected to a culture of institutionalisation that encourages compliance;
respond in a (neutral) way that would avoid perceived negative consequences should their answers be used against them;
provide answers that they believe will be evaluated as indicating strength or lack of dysfunction;
try to portray themselves or their organisation in a light that they consider might be taken more favorably than their true beliefs (social desirability bias).

Top

Judge	Item 5	Item 1	Item 4	Item 3	item 7	item 6	item 2
1	+	+	+	+	+	+	+
4	+	+	+	+	+	+	-
3	+	+	+	+	+	-	-
6	+	+	+	-	-	-	-
2	+	+	-	-	-	-	-
7	+	-	-	-	-	-	-
5	-	-	-	-	-	-	-

MAIN MENU Basics

References

About Researching the Real World Search Contact

© Lee Harvey 2012–2025

Page updated 3 February, 2025 Citation reference: Harvey, L., 2012–2025, Researching the Real World, available at qualityresearchinternational.com/methodology All rights belong to author.

MAIN MENU

Basics

About Researching the Real World

Search

Contact

Page updated 3 February, 2025

Citation reference: Harvey, L., 2012–2025, Researching the Real World, available at qualityresearchinternational.com/methodology
All rights belong to author.