Exchangeability

Exchangeable random variables

Prediction without covariates

Lemma 1

Suppose Z1,ZnZ_1, \cdots Z_n are exchangeable random variables. For any α{0,1}\alpha \in \{0, 1\}, P[ZnQ^n(α)]αP[Z_n \leq \hat{Q}_n(\alpha)] \geq \alpha

Moreover, if Z1,ZnZ_1, \cdots Z_{n} are a.s distinct, P[ZnQ^n+1(α)]α+1nP[Z_{n} \leq \hat{Q}_{n+1}(\alpha)] \leq \alpha + \frac{1}{n}

  • key proof: using the uniform distribution or these order statistics?

Lemma 2

Suppose Z1,Zn+1Z_1, \cdots Z_{n+1} are exchangeable random variables. For any α{0,1}\alpha \in \{0, 1\}, define αn\alpha_n as αn=(1+1n)α\alpha_n = (1 + \frac{1}{n})\alpha. Then, P[Zn+1Q^n+1(αn)]αP[Z_{n+1} \leq \hat{Q}_{n+1}(\alpha_n)] \geq \alpha

Moreover, if Z1,Zn+1Z_1, \cdots Z_{n+1} are a.s distinct, P[Zn+1Q^n+1(αn)]α+1nP[Z_{n+1} \leq \hat{Q}_{n+1}(\alpha_n)] \leq \alpha + \frac{1}{n}.

  • Key of the proof:

    • use lemma 1 and change n points to (n+1) points of these order statistics

    • and you cannot jump for two points since you only added one point into the empirical distribution

One-sided prediction interval without covariates

Last updated