Q1
i
df <- read.csv("lead_concentration.csv")
df_fall_1977 <- na.omit(df$Fall.1977)
n = length(df_fall_1977)
m = mean(df_fall_1977)
v = var(df_fall_1977)
s = sd(df_fall_1977)
alpha = (1 - 0.9) / 2
t_alpha = qt(1 - alpha, df=n-1)
c(m - t_alpha * s / sqrt(n), m + t_alpha * s / sqrt(n))
The 90% CI of mean is [8.504470, 9.858688]
ii
The CI for σ2 is
[χn−1,α2(n−1)Sn−12,χn−1,1−α2(n−1)Sn−12]
The CI for σ1 is
(n−1)Sn−12χn−1,1−α2,(n−1)Sn−12χn−1,α2
X_one_minus_alpha = qchisq(alpha, df=n-1)
X_alpha = qchisq(1 - alpha, df=n-1)
1 / sqrt(
c(
(n - 1) * v / X_one_minus_alpha,
(n - 1) * v / X_alpha
)
)
The 90% CI of σ1 is [0.3260397, 0.4800552]
Q2
a
Bias(p^,p)=E(p^)−p=E(Xˉ)−p=E(nY)−p=E(nX1+X2+⋯+Xn)−p=n1i=1∑nE(Xi)−p=n1np−p=0
Therefore nY is an unbiased estimator of p
b
Var(Xˉ)=Var(nX1+X2+⋯+Xn)=n21Var(X1+X2+⋯+Xn)=n21[Var(X1)+Var(X2)+⋯+Var(Xn)]=n2np(1−p)=np(1−p)
c
Var(Xˉ)=E(Xˉ2)−E(Xˉ)2E(Xˉ2)=np(1−p)=np(1−p)+p2
E(nXˉ(1−Xˉ))=E(nXˉ−Xˉ2)=n1(E(Xˉ)−E(Xˉ2))=n1(p−(np(1−p)+p2))=n1(nnp−np2−p+p2)=n2p2(−n+1)+p(n−1)=n2(n−1)(−p2+p)=n2(n−1)p(1−p)
d
Bias=E(cXˉ(1−Xˉ))−np(1−p)nc(n−1)p(1−p)c=0=np(1−p)=n−11
\pagebreak
Q3
i
The PDF of Xi is
p(x)=T1,0≤x≤T
The expectation of Xi is
E(Xi)=∫0Txp(x)dx=∫0TxT1dx=T12T2=2T
The expectation of Xˉ is
E(Xˉ)=E(nX1+X2+⋯+Xn)=n1E(X1+X2+⋯+Xn)=n1×n×2T=2T
To make it unbiased, multiply by 2, then T^=2Xˉ
Bias(T^)=Bias(2Xˉ)=E(2Xˉ)−T=T−T=0
Therefore T^=2Xˉ is an unbiased estimator of T
ii
Since T is the maximum time interval, max{X1,X2,…,Xn} is the minimum value of T, which maximize Tn1
T^MLE=argmax{p(X1,X2,…,Xn)}=argmax{Tn1}=max{X1,X2,…,Xn}
iii
The CDF of T^MLE is
F(x)=P(T^MLE≤x)=P(max(X1,X2,…,Xn)≤x)=P(X1≤x)P(X2≤x)…P(Xn≤x)=(x×T1)n=(Tx)n
Differentiate w.r.t to x, the PDF of T^MLE is
f(x)=Tn1nxn−1=Tnnxn−1
\pagebreak
The bias of T^MLE is
Bias(T^MLE)=E(T^MLE)−T=∫0TxTnnxn−1dx−T=Tnn[n+1xn+1]0T−T=Tnnn+1Tn+1−T=n+1nT−T=0
Therefore T^MLE is a biased estimator
iv
Note that
σX2=Var(X)=E(X2)−[E(X)]2=∫0Tx2T1dx−(2T)2=T1[3x3]0T−4T2=3T2−4T2=12T2
\pagebreak
MSE of T^ is
MSE(T^)=(Bias(T^,T))2+Var(T^)=0+Var(2Xˉ)=4Var(Xˉ)=4×nσ2=4×12nT2=3nT2
Note that
Var(T^MLE)=E(max(X1,X2,…,Xn)2)−[E(max(X1,X2,…,Xn))]2=∫0Tx2Tnnxn−1dx−(n+1nT)2=Tnn[n+2xn+2]0T−(n+1nT)2=n+2nT2−(n+1nT)2=(n+2)(n+1)2(n+1)2nT2−(n+2)n2T2=(n+2)(n+1)2n3T2+2n2T2+nT2−n3T2−2n2T2=(n+2)(n+1)2nT2
\pagebreak
MSE of T^MLE is
MSE(T^MLE)=(Bias(T^MLE,T))2+Var(T^MLE)=(n+1nT−T)2+(n+2)(n+1)2nT2=(n+1nT−T(n+1))2+(n+2)(n+1)2nT2=(n+1nT−nT−T)2+(n+2)(n+1)2nT2=(n+1)2T2+(n+2)(n+1)2nT2=(n+1)2(n+2)nT2+2T2+nT2=(n+1)2(n+2)2T2(n+1)=(n+1)(n+2)2T2
Comparing the MSE of T^ and T^MLE
MSE(T^MLE)MSE(T^)=(n+2)(n+1)2nT22T2(n+1)(n+2)=2(n+1)n=2n+2n
2n+2n<1 for all n≥1, therefore the MSE of T^MLE is larger