OCAIR

Overseas Chinese Association for Institutional Research
An AIR Affiliate That Supports IR Professionals Since 1996

Calculating Confidence Interval

Feb. 2002

Dongyang Li: Here is a quick question: If a survey is conducted among 500 students and 250 respond. Say for the question: "Are you satisfied with your freshman year?" 20% says "very satisfied". What is the confidence interval for this result? Or do we need other information to infer the confidence interval?


Beiling Xiao: Dongyang:

A 95% confidence interval of 20% of 250 said satisfied with freshman year is:
0.20 - 1.96*square root of {[0.20 * (1-0.20)]/250}, 0.20 + 1.96*square root of {[0.20*(1-0.20)]/250} that is: 0.20 - 1.96*0.0253, 0.20 + 1.96*0.0253 (0.20 - 0.0496, 0.20 + 0.0496) or (15.0% , 25.0%)
You do not need other info to infer the confidence onterval. Hope this helps. Beiling Xiao


Willard: Dongyang, et al.,

I agree in essence with the calculation of the confidence intervals that you have just seen.

However, the 50% response rate still imposes a serious threat from nonresponse bias. The confidence interval for the target population (which includes those who did not respond)could be much wider than the confidence interval BX and XF suggested.

[William G.] Cochran, Sampling Techniques,3rd Edition (1977) recommends calculating two additional confidence intervals as a means of exploring the sensitivity (or risk) that the analyst incurs when nonresponse is high. In a "best-case scenario"---although we realize this is the unlikely state of nature in reality---the analyst would calculate the confidence interval under the assumption that all the nonrespondents would have marked "highly satisfied" if they had really done so. In a "worst-case scenario"---somewhat unlikely in the real world but more likely than the best-case scenario---the analyst would calculate the confidence interval under the asumption that none of the nonrespondents would have marked "highly satisfied" if they had really done so.

The bottom line is that severe nonresponse creates major problems in the interpretation of "standard" confidence intervals. We often hope that consumers of statistical analysis will consider the response rate when a report displays the confidence interval, but they don't (as a rule) pay that much attention. Cochran's sensitivity analysis drives home the risk factor inherent in the potential for nonresponse bias in such a survey as yours---even though most analysts feel that mention of the response rate is sufficient "disclosure." Some federal reports include Cochran's method (albeit in their appendices), and I have included them in my past analyses of survey data. Unfortunately, most analysts have not adopted this particular technique, and it does not get reported often (for obvious reasons). But if the costs for erroneous estimation are high, calculation for "internal use" is very informative to the analyst(although not necessarily helpful to administrators).

A special implication is noteworthy. If Donyang were finding the confidence interval for the "very dissatisfied" category, then I would strongly urge calculating the "worst-case scenario"---where we assume that all nonrespondents are assumed to feel that very dissatisfied would have been their response. I tend to believe that many nonrespondents avoid answers that may jeopardize them or conflict with the social-desirability response set....It is less likely for nonrespondents to withhold a response because they feel very positive towards a subject.

A final, perhaps superfluous consideration, concerns the sampling design. Dongyang's survey question does not specify this, but it's relevant. If the data come from a sample design that is a stratified or multi-stage design, then the analyst should apply the appropriate sampling weights in calculating the confidence interval for the population total. The adjustment resulting from the sample design weighting may or may not be material, depending upon the design, the population, and the responses. Since many surveys of institutional populations use classes as a sampling unit (that is, they draw a sample of classes and then survey all students, or a further sample thereof, within those selected classes), the confidence interval should consider that these samples are essentially cluster samples (implying that weighting for a cluster sample design is needed...).

Well, that's enough overkill from me, but it is an intriguing topic.


Willard: A while back, the astute John Lin tried out my suggestion below and noted that something did not seem right. I got occupied with my many duties here, but I finally found my error after I dug up my Cochran book.

On page 362, Cochran says the following: "In calculating the lower limit, assume that all sample nonrespondents would have given a negative response. In calculating the upper limit, assume that all sample respondents would have given a positive response...The limits are a little wider..." Cochran's didactic example quoted above pertained to just a 20% nonresponse rate. As the nonresponse rate increases, so does the width of the interval suggested in the quote. The error in my posting (see below in my Jan.9 e-mail)was my prescription to calculate two separate confidence intervals. Cochran's method really produces one confidence interval. It just uses the worst-case scenario to find the improvised lower limit and the best-case scenario to find the improvised upper limit---making just one confidence interval.

I apologize to the listserve readers for missing this. I wrote the Jan.9 message without referring directly to the Cochran text. It has been about four years since my use of the Cochran method in an analysis. (Yes, I must be exhibiting some memory decay here...) We should thank (as I do now)John for his keen observation here...(FYI, Shuqin Guo also questioned my Jan.9 message, so she deserves credit here too.)