18 Confidence intervals

Obtaining an exact point estimate of the population parameter from just one random sample is almost unattainable. However, interval estimation allows us to provide a range of values where the parameter is expected to fall with a certain level of confidence. This can be achieved by constructing confidence intervals.

Learning objectives

Understand the concept of confidence intervals
Calculate and interpret confidence intervals for mean
Calculate and interpret confidence intervals for proportion

18.1 Confidence interval for mean

18.1.1 The logic behind constructing a confidence interval

We will base the construction of a confidence interval on two key concepts:

The interval is around the point estimate, which represents our best estimate of the population parameter.
The standard error is utilized to quantify the extent of variability around the point estimate.

According to the Central Limit Theorem, the sampling distribution of the mean approaches a normal distribution (Chapter 17). Furthermore, the standard deviation of this sampling distribution is the standard error of the mean, $σ_{\bar{x}}$ . Consequently, it can be inferred that approximately 95% of the distribution of sample means lies within $\pm 1.96 σ_{\bar{x}}$ from the point estimate (the empirical rule; Chapter 15). This multiple of the standard error, such as $\pm 1.96 σ_{\bar{x}}$ , is referred to as the margin of error in a confidence interval.

Figure 18.1: Sampling distribution of mean and 95% CI.

In this case, the formula for the confidence interval (CI) of mean equals:

$\begin{matrix} (18.1) & 95 % C I = μ_{\bar{x}} \pm 1.96 σ_{\bar{x}} = μ_{\bar{x}} \pm 1.96 \frac{σ}{\sqrt{n}} \end{matrix}$

When the sample size n is sufficiently large, the sample mean provides a good estimate of the population mean. Additionally, if the population standard deviation σ is unknown, we can estimate it by using the sample standard deviation s, and the formula becomes:

$\begin{matrix} (18.2) & 95 % C I = \bar{x} \pm 1.96 S E_{\bar{x}} = \bar{x} \pm 1.96 \frac{s}{\sqrt{n}} \end{matrix}$

Example

The serum creatinine of a sample of 121 elderly men has a mean of 1.15 mg/dl with a standard deviation of 0.3 mg/dl. The 95% confidence interval for the mean creatinine of this population is calculated as follows:

Lower limit of 95% CI

$L L = 1.15 - 1.96 \frac{0.3}{\sqrt{121}} = 1.15 - 1.96 \frac{0.3}{11} = 1.15 - 0.0534 = 1.096$

Upper limit of 95% CI

$U L = 1.15 + 1.96 \frac{0.3}{\sqrt{121}} = 1.15 + 1.96 \frac{0.3}{11} = 1.15 + 0.0534 = 1.203$

We are 95% confident that the mean serum creatinine is between 1.1 mg/dl and 1.2 mg/dl.

In R:

For a 95% confidence interval, each of the grey areas in Figure 18.1 equals 2.5% of the distribution because the total percentage of 5% (100-95) is equally divided between both sides of the normal distribution.

n <- 121
mean <- 1.15
s <- 0.3
z <- qnorm(0.025, lower.tail = FALSE)

# compute lower limit of 95% CI
lower_95CI <- mean - z*(s/sqrt(n))
lower_95CI

# compute upper limit of 95% CI
upper_95CI <- mean + z*(s/sqrt(n))
upper_95CI

[1] 1.096546
[1] 1.203454

18.1.2 Confidence level

However, there is no particular reason for choosing a 95% confidence level for constructing confidence intervals other than convention; confidence levels of 90% or 99% are sometimes preferred depending on the context. For example, when a 99% confidence level is chosen, the margin of error for the mean becomes $\pm 2.58 σ_{\bar{x}}$ (the empirical rule; Chapter 15).

Figure 18.2: Sampling distribution of mean and 99% CI.

Now, each of the grey areas in Figure 18.2 equals 0.5% of the distribution because the total percentage of 1% (100-99) is equally divided between both sides of the normal distribution. In this instance, the 99% CI for the mean is:

z2 <- qnorm(0.005, lower.tail = FALSE)

# compute lower limit of 95% CI
lower_99CI <- mean - z2*(s/sqrt(n))
lower_99CI

# compute upper limit of 95% CI
upper_99CI <- mean + z2*(s/sqrt(n))
upper_99CI

[1] 1.07975
[1] 1.22025

We observe that a 99% CI provides a higher level of confidence but comes with a wider range (1.07-1.22), while a 95% confidence interval offers a narrower range (1.09-1.20) but with slightly less certainty. Therefore, the increased level of confidence comes at the expense of precision, especially with smaller datasets.

18.1.3 Understanding the condidence interval

The intuitive meaning of “confidence” in a confidence interval might not be immediately clear. To understand what confidence truly represents, let’s consider once more the example of a population consisting of 100,000 adults, with a mean blood pressure (BP) of μ = 126 mmHg and a standard deviation of σ = 10 (Figure 18.3).

Figure 18.3: A hypothetical population of 100,000 observations. The dashed black line represents the population, μ.

We proceed by generating 100 random samples of size 10 from our population distribution and construct a 95% confidence interval for the mean of each sample.

Figure 18.4: 100 Sample Means of Size 10 (with 95% Intervals) from the Population.

In Figure 18.4, each blue horizontal bar is a confidence interval (CI), centered on a sample mean (point). The intervals all have the same length, but are centered on different sample means as a result of random sampling from the population. The five red confidence intervals do not cover the population mean (the vertical dashed line; $μ$ = 126 mmHg). This aligns with our expectations under a 95% confidence level, where roughly 95% of the intervals should include the population parameter.

18.1.4 Sample size and condidence interval

Next, we construct the 95% confidence intervals of 100 randomly generated samples of size 50 from our population (Figure 18.5):

Figure 18.5: 100 Sample Means of Size 50 (with 95% Intervals) from the Population.

Comparing the Figure 18.4 and Figure 18.5, we notice two key trends as the sample size increases from 10 to 50:

The sample statistic (points) gets closer to the population parameter (black dashed line).
The uncertainty around the estimate shrinks (confidence intervals become narrower).

A confidence interval is commonly expressed as 90% CI, 95% CI, or 99% CI, indicating the level of confidence associated with the estimate. The percentage reflects the proportion of intervals, constructed from repeated experiments, that would contain the population parameter (long-run interpretation).

Choosing an appropriate confidence level and sample size depend on the specific needs of the analysis and the trade-offs between certainty and precision.

18.2 Confidence interval for proportion (normal approximation)

Let X be a random variable representing the observed number of individuals in a sample with a binary characteristic, such as having a disease. Our best estimate of the population proportion, p, is given by the sample proportion $\hat{p} = \frac{X}{n}$ , where n is the sample size. If we repeatedly draw samples of size n from our population and calculate the sample proportions $\hat{p_{1}} = \frac{X_{1}}{n}$ , $\hat{p_{2}} = \frac{X_{2}}{n}$ , $\hat{p_{3}} = \frac{X_{3}}{n}$ , and so forth, then, under the assumption that the sample size is sufficiently large and satisfies the condition $m i n (n p, n (1 - p)) \geq 5$ , the sampling distribution of the proportion would approximate a normal distribution, $N (μ_{\hat{p}} = p, σ_{\hat{p}}^{2} = \frac{p (1 - p)}{n})$ (Figure 18.6).

Figure 18.6: Sampling distribution of proportion and 95% CI.

Similar to a confidence interval for the mean Equation 18.1, a confidence interval for a proportion can be constructed as follows:

$\begin{matrix} (18.3) & 95 % C I = μ_{\hat{p}} \pm 1.96 σ_{\hat{p}} = p \pm 1.96 \sqrt{\frac{p (1 - p)}{n}} \end{matrix}$

and when the value of p is unknown, it is replaced with the sample proportion $\hat{p}$ :

$\begin{matrix} (18.4) & 95 % C I = \hat{p} \pm 1.96 S E_{\hat{p}} = \hat{p} \pm 1.96 \sqrt{\frac{\hat{p} (1 - \hat{p})}{n}} \end{matrix}$

where the standard error for proportion is $S E_{\hat{p}} = \sqrt{\frac{\hat{p} (1 - \hat{p})}{n}}$ .

Example

Suppose a pulmonologist chooses a random sample of 317 patients from the patient register, and finds that 34 of them have a history of suffering from chronic obstructive pulmonary disease (COPD). The 95% confidence interval for the proportion of COPD is calculated as follows:

$\hat{p} = \frac{X}{n} = \frac{34}{317} = 0.107 o r 10.7 %$

Additionally, the condition $m i n (n p, n (1 - p)) \geq 5$ is satisfied:

np = 317 * 0.107 = 33.9 > 5

n(1-p) = 317 * (1 - 0.107) = 317 * 0.893 = 283 > 5

Lower limit of 95% CI

$L L = 0.107 - 1.96 \sqrt{\frac{0.107 (1 - 0.107)}{317}} = 0.107 - 0.034 = 0.073 o r 7.3 %$

Upper limit of 95% CI

$U L = 0.107 + 1.96 \sqrt{\frac{0.107 (1 - 0.107)}{317}} = 0.107 + 0.034 = 0.141 o r 14.1 %$

Based on our random sample, we are 95% confident that the percentage of patients with COPD falls within the range of 7.3% to 14.1%.

In R:

n = 317
x = 34

# calculate the proportion
p_hat <- x/n
p_hat

# check the assumption min(np, n(1-p)) ≥ 5
min(c(n*p_hat, n*(1 - p_hat)))

[1] 0.1072555
[1] 34

z <- qnorm(0.025, lower.tail = FALSE)
se <- sqrt(p_hat*(1 - p_hat)/n)

# compute lower limit of 95% CI
lower_95CI <- p_hat - z*se
lower_95CI

# compute upper limit of 95% CI
upper_95CI <- p_hat + z*se
upper_95CI

[1] 0.07319182
[1] 0.1413192