Lecture-16: Weak convergence of random variables
1 Convergence in distribution
Definition 1.1. A sequence (Xn:n∈N)of random variablesconverges in distributionto a random vari- ableXif
limn FXn(x) =FX(x)
at all continuity pointsxofFX. Convergence in distribution is denoted by limnXn=Xin distribution.
Proposition 1.2. Let (Xn :n∈N)be a sequence of random variables and let X be a random variable. Then the following are equivalent:
(a) limnXn=X in distribution.
(b) limnE[g(Xn)] =E[g(X)]for any bounded continuous function g.
(c) Characteristic functions converge point-wise, i.e.limnΦXn(u)→ΦX(u)for each u∈R.
Proof. Let(Xn:n∈N)be a sequence of random variables and letXbe a random variable. We will show that(a) =⇒ (b) =⇒ (c) =⇒ (a).
(a) =⇒ (b): Let limnXn=Xin distribution, then limnR
x∈Rg(x)dFXn(x) =R
x∈Rg(x)limndFXn(x)by the bounded convergence theorem for any bounded continuous functiong.
(b) =⇒ (c): Let limnE[g(Xn)] =E[g(X)]for any bounded continuous functiong. Taking g(x) =ejux, we get the result.
(c) =⇒ (a): The proof of this part is technical and is omitted.
Example 1.3 (Convergence in distribution but not in probability). Consider a sequence of non- degenerate continuousi.i.d. random variables(Xn:n∈N)and independent random variableXwith the common distribution FX. Then FXn =FX for all n∈N, and hence limnXn =X in distribution.
However, for anyn∈Nande>0, from the monotonicity of distribution function, we have P{|Xn−X|>e}=E1{Xn∈[X−e,X+e]}/ =EFX(X+e)−EFX(X−e)>0.
Lemma 1.4 (Convergence in probability implies in distribution). Consider a sequence (Xn:n∈N)of random variables and a random variable X, such thatlimnXn=X in probability, thenlimnXn=X in distribution.
Proof. Fixe>0, and consider the eventEn,{ω∈Ω:|Xn−X|(ω)>e}={Xn∈/[X−e,X+e]} ∈F. We further define eventsAn(x),{Xn6x}andA(x),{X6x}, then we can write
An(x)∩A(x+e)⊆A(x+e), An(x)∩Ac(x+e)⊆En, A(x−e)∩An(x)⊆An(x), A(x−e)∩Acn(x)⊆En.
1
From the above set relations, law of total probability, and union bound, we have F(x−e)−P(En)6Fn(x)6F(x+e) +P(En). From the convergence in probability, we have limnP(En) =0 get
F(x−e)6lim inf
n Fn(x)6lim sup
n Fn(x)6F(x+e). We get the result at the continuity points ofFX, since the choice ofewas arbitrary.
Theorem 1.5 (Central Limit Theorem). Consider ani.i.d. random sequence(Xn:n∈N)withEXn =µand Var(Xn) =σ2, defined on the probability space (Ω,F,P). We define the n-sum as Sn =∑ni=1Xi and consider a standard normal random variable Y with density function fY(y) =√1
2πe−y
2
2 for all y∈R. Then, limn
Sn−nµ σ
√n =Y in distribution.
Proof. The classical proof is using the characteristic functions. LetZi,Xiσ−µfor alli∈N, then the shifted and scaledn-sumSn−nµ
σ
√n =∑ni=1ZiWe use the third equivalence in Proposition 1.2 to show that the charac- teristic function of converges to the characteristic function of the standard normal. We define the character- istic functions
Φn(u),Eexp
ju(Sn−nµ) σ
√n
, ΦZi(u),Eexp(juZi), ΦY(u),Eexp(juY). We can compute the characteristic function of the standard normal as
ΦY(u) =√1 2π
Z
y∈Re−u
2
2 exp
−(y−ju)2 2
dy=e−u
2 2 .
Since(Zi:n∈N)is a zero meani.i.d. sequence, and using the Taylor expansion of the characteristic func- tion, we have
Φn(u) =
∏
n i=1Eexp
ju(Xi−µ) σ
√n
=
ΦZ1
√u n
n
=
1− u
2
2n +o u2
n n
. For anyu∈R, taking limitn∈N, we get the result.
2 Strong law of large numbers
Definition 2.1. For a random sequence(Xn:n∈N)with bounded meanE|Xn|<∞, we define then-sum asSn,∑ni=1Xiand the empiricaln-mean Snn for eachn∈N. For eachn∈N, we define event
En,{ω∈Ω:|Sn−ESn|>ne} ∈F.
Theorem 2.2 (L4strong law of large numbers). Let(Xn:n∈N)be a sequence of pair-wise uncorrelated random variables with bounded meanEXnand uniformly bounded fourth central momentE(Xn−EXn)46B<∞for all n∈N. Then, the empirical n-mean converges tolimnESn
n almost surely.
Proof. From the Markov’s and Minkowski’s inequality, we have P(En)6 ∑ni=1E(Xn4e2i−µ)4 6 B+µn3e24. It follows that the∑n∈NP(En)<∞, and hence by Borel Canteli Lemma, we haveP{Ecnfor all but finitely manyn}= 1. Since, the choice ofewas arbitrary, the result follows.
Theorem 2.3 (L2strong law of large numbers). Let(Xn:n∈N)be a sequence of pair-wise uncorrelated random variables with meanEXn and uniformly bounded variance Var(Xn)6B<∞for all n∈N. Then, the empirical n-mean converges tolimnESn
n almost surely.
2
Proof. For eachn∈N, we define eventsFn,En2, and Gn,
( max
n26k<(n+1)2
|Sk−Sn2−E(Sk−Sn2)|>n2e )
= [
n26k<(n+1)2
( ω∈Ω:
∑
k i=n2Xi−EXi
>n2e )
.
From the Markov’s inequality and union bound, we have P(Fn)6 ∑
n2
i=1Var(Xi) n4e2 6 B
n2e2, P(Gn)6
∑
2n k=0kB
n4e6 (2n+1)B n3e . Therefore,∑n∈NP(Fn)<∞and∑n∈NP(Gn)<∞, and hence by Borel Canteli Lemma, we have
limn
Sn2−ESn2
n2 =lim
n
Sk−Sn2−E(Sk−Sn2)
n2 =0 a.s.
The result follows from the fact that for anyk∈N, there existsn∈Nsuch thatk∈n2, . . . ,(n+1)2−1 and hence
|Sk−ESk|
k 6
|Sn2−ESn2|
n2 +|Sk−Sn2−E(Sk−Sn2)|
n2
.
Theorem 2.4 (L1strong law of large numbers). Let(Xn:n∈N)be a sequence of pair-wise uncorrelated random variables such thatE|Xn|6B<∞for all n∈N. Then, the empirical n-mean converges tolimnESn
n almost surely.
3