Q1

i

df <- read.csv("lead_concentration.csv")
df_fall_1977 <- na.omit(df$Fall.1977)
 
n = length(df_fall_1977)
m = mean(df_fall_1977)
v = var(df_fall_1977)
s = sd(df_fall_1977)
 
alpha = (1 - 0.9) / 2
t_alpha = qt(1 - alpha, df=n-1)
 
c(m - t_alpha * s / sqrt(n), m + t_alpha * s / sqrt(n))

The 90% CI of mean is [8.504470, 9.858688]

ii

The CI for is

The CI for is

X_one_minus_alpha = qchisq(alpha, df=n-1)
X_alpha = qchisq(1 - alpha, df=n-1)
 
1 / sqrt(
    c(
        (n - 1) * v / X_one_minus_alpha,
        (n - 1) * v / X_alpha
    )
)

The 90% CI of is [0.3260397, 0.4800552]

Q2

a

Therefore is an unbiased estimator of

b

c

d

Q3

i

The PDF of is

The expectation of is

The expectation of is

To make it unbiased, multiply by 2, then

Therefore is an unbiased estimator of

ii

Since is the maximum time interval, is the minimum value of , which maximize

iii

The CDF of is

Differentiate w.r.t to , the PDF of is


The bias of is

Therefore is a biased estimator

iv

Note that

MSE of is

Note that

MSE of is

Comparing the MSE of and

for all , therefore the MSE of is larger