5 Real Data Analysis
Lemma 1 Kraus (2015), Proposition 1)
(a) Let E X 2<∞and assumption (A1) be satisfied. Then E(|| ˆμNME−μ||2)= O(n−1)forn→ ∞.
(b) Let E X 4 < ∞ and observation pattern (A2) holds. Then E(|| ˆCNME
X −
CX||2S)=O(n−1)forn→ ∞(here|| · ||Sdenotes the Hilbert–Schmidt norm).
Lemma 2 (Kneip and Liebl (2020), Theorem 4.1) Under the assumptions (B1)–
(B5), we have that
(a) supt∈T| ˆμWME(t )−μ(t )| =Op(rμ)withrμ=h2μ+1/9
nN hμ+1/√ n.
(b) sup(s,t )∈T2|ˆcWME(s, t )−cX(s, t )| =Op(rμ+rc)withrc =h2c+1/9
nMh2c+ 1/√
n.
Proof of Theorem1The following results can be derived from the theory developed by Bhatia et al. (1983):
supj≥1|ˆλNME
j −λj| ≤ ˆCNME
X −CX ,
supj≥1δj ˆφjNME
−φj ≤81/2 ˆCNME
X −CX . (13)
Therefore, we obtain from Lemma1, supj≥1|ˆλNME
j −λj| =Op(n−1/2), supj≥1δj ˆφNME
j −φj =Op(n−1/2). (14)
Note that,
T(γˆNME(s)−γ (s))2ds
=
T
⎧⎨
⎩
m−1
j=1
⎡
⎣n−1#n
i=1[YiUˆNME
ij ]
ˆ λNME
j
φˆNME
j (s)−E[Y Uj] λj
φj(s)
⎤
⎦
⎫⎬
⎭
2
ds
+
T
⎧⎨
⎩ ∞ j=m
E[Y Uj] λj φj(s)
⎫⎬
⎭
2
ds
+2
T
⎧⎨
⎩
m−1
j=1
⎡
⎣n−1#n
i=1[YiUˆNME
ij ]
ˆ λNME
j
ˆ φNME
j (s)−E[Y Uj] λj φj(s)
⎤
⎦
⎫⎬
⎭
⎧⎨
⎩ ∞ j=m
E[Y Uj] λj φj(s)
⎫⎬
⎭ds
:=A1(n)+A2(n)+A3(n). (15)
For simplicity, we suppress the notation on “NME.” Assumption (A6) implies thatA2(n) → 0 asm→ ∞. ForA3(n), Cauchy–Schwarz inequality implies that A23(n) ≤A21(n)×A22(n)→p 0. Combing the result (14), and the formula (15), we see that the result of the theorem follows if we can get the convergence rate ofUˆij of the trajectories per subject withUˆij = ˆUij Oi+ ˆUij M(α)
i.
Denote the estimates of Uij Mi, COiOi, COiMi, φj Mi as Uˆij Mi(−i), CˆOiOi(−i), CˆOiMi(−i),φˆj Mi(−i) with deleting the ith curves Xi(t ). Let ξ˜ij M(ρ)
i = (C(ρ)O
iOi)−1 COiMiφj Mi withCO(ρ)
iOi =COiOi+ρFOi,U˜ij M(ρ)
i = ˜ξij M(ρ)
i, XiOi, and the notation ξ˜ij MiU˜ij Mi are corresponded to the symbolsξ˜ij M(ρ)
i,U˜ij M(ρ)
i withρ=0. Since EUˆij M(ρ)
i − ˜Uij Mi2= EUˆij M(ρ)
i− ˜Uij M(ρ)
i+ ˜Uij M(ρ)
i− ˜Uij Mi2
= 2EUˆij M(ρ)
i− ˜Uij M(ρ)
i
2+2U˜ij M(ρ)
i− ˜Uij Mi2
≤ 4EUˆij M(ρ)
i− ˆUij M(ρ)
i(−i)
2+4EUˆij M(ρ)
i(−i)− ˜Uij M(ρ)
i
2 +2U˜ij M(ρ)
i− ˜Uij Mi2, (16)
we then analyze the terms E ˆUij M(ρ)
i− ˆUij M(ρ)
i(−i) 2, E ˆUij M(ρ)
i(−i)− ˜Uij M(ρ)
i
2, ˜Uij M(ρ)
i−
U˜ij Mi 2in turn. Letξˆij M(ρ)
i(−i)=(CˆO(ρ)
iOi(−i))−1CˆOiMi(−i)φˆj Mi(−i). Then E ˆUij M(ρ)
i(−i)− ˜Uij M(ρ)
i
2
= Eˆξij M(ρ)
i(−i)− ˜ξij M(ρ)
i, XiOi2
= E{E[ˆξij M(ρ)
i(−i)− ˜ξij M(ρ)
i, XiOi2|{XkOi, k=i}]}
= E||CO1/2
iOi((CˆO(ρ)
iOi(−i))−1CˆOiMi(−i)φˆj Mi(−i)−(CO(ρ)
iOi)−1COiMiφj Mi)||2
≤ 4
E||CO1/2
iOi(CˆO(ρ)
iOi(−i))−1(CˆOiMi(−i)−COiMi)(φˆj Mi(−i)−φj Mi)||2 +E||CO1/2
iOi(Cˆ(ρ)O
iOi(−i))−1COiMi(φˆj Mi(−i)−φj Mi)||2 +E||CO1/2
iOi(Cˆ(ρ)O
iOi(−i))−1(CˆOiMi(−i)−COiMi)φj Mi||2 +E||CO1/2
iOi((CˆO(ρ)
iOi(−i))−1−(CO(ρ)
iOi)−1)COiMiφj Mi||2
:=B1+B2+B3+B4. (17)
LetFm= {λ2m <λˆm< 32λm}. Suppose the eventFmholds. Otherwise, we have P(|ˆλm−λm| ≥ λ2m)≤P( ˆCNME
X −CX ≥ λ2m)→ 0 from assumption (A4). We have the following results for termsB1toB4with the equality
CˆO(ρ)
iOi(−i)
−1
− CO(ρ)
iOi
−1
=(CˆOiOi(−i)−COiOi)
CO(ρ)
iOi
−1 CˆO(ρ)
iOi(−i)
−1
. For the termB1,
B1≤E1 CO1/2
iOi
22·
ˆ CO(ρ)
iOi(−i)
−1 2
∞·CˆOiMi(−i)−COiMi22·φˆj Mi(−i)−φj Mi2 2
=O n−2δ−j2
·O(ρ−2).
Denote · ∞as the operator norm. For the termB2, under the assumption (A7), E CO1/2
iOi 2
∞<∞and the result (14), it is clear that B2≤E
1CO1/2
iOi
2
∞·
CˆO(ρ)
iOi(−i)
−1
COiMi 2
2
·φˆj Mi(−i)−φj Mi2 2
≤
j
k
rM2
iOij k
(λOiOik+ρ)2·O
n−1δj−2 =O
n−1δ−j2
.
For the termB3, B3≤E
3 CO1/2
iOi 2 2· (CˆO(ρ)
iOi(−i))−1 2∞· ˆCOiMi(−i)−COiMi 22· φj Mi 2 4
=O(n−1ρ−2).
Note that (λρλOi Oi k
Oi Oi k+ρ)2 <1. Under the assumption (A7), we have that
B4≤E3 CO1/2
iOi·(CO(ρ)
iOi)−1·(Cˆ(ρ)O
iOi(−i))−1·COiMi 22· ˆCOiOi(−i)−COiOi 22· φj Mi 24
≤
⎧⎨
⎩
j
k
ρλOiOik
(λOiOik+ρ)2· rOiMij k2 (λOiOik+ρ)2·ρ−1
⎫⎬
⎭·O(n−1)
=O(n−1)·O(ρ−1).
These results combined with (17) indicate E ˆUij M(ρ)
i(−i)− ˜Uij M(ρ)
i
2=O
n−1ρ−2+n−1δ−j2
. (18)
We then analyze E ˆUij M(ρ)
i− ˆUij M(ρ)
i(−i) 2, E ˆUij M(ρ)
i− ˆUij M(ρ)
i(−i) = Eˆξij M(ρ)
i− ˆξij M(ρ)
i(−i), XiOi
≤ {E ˆξij M(ρ)
i− ˆξij M(ρ)
i(−i)
2}1/2{E XiOi 2}1/2
≤ L{E ˆξij M(ρ)
i− ˆξij M(ρ)
i(−i)
2}1/2, (19)
where the last inequality holds from the finite second moment ofXthat is bounded by constantL. We also have,
E ˆξij M(ρ)
i− ˆξij M(ρ)
i(−i)
2= E (CˆO(ρ)
iOi)−1CˆOiMi−(Cˆ(ρ)O
iOi(−i))−1CˆOiMi(−i)
φˆj Mi(−i) 2
= E 3
(Cˆ(ρ)O
iOi)−1−(CˆO(ρ)
iOi(−i))−1 CˆOiMi
+(Cˆ(ρ)O
iOi(−i))−1(CˆOiMi− ˆCOiMi(−i))
4φˆj Mi(−i) 2
≤ 2 E
(Cˆ(ρ)O
iOi)−1−(CˆO(ρ)
iOi(−i))−1
CˆOiMi 2
+E (CˆO(ρ)
iOi(−i))−1(CˆOiMi− ˆCOiMi(−i)) 2}. (20) Note that
E ˆCOiMi− ˆCOiMi(−i) 2=O(n−2),
E (CˆO(ρ)
iOi)−1−(CˆO(ρ)
iOi(−i))−1
CˆOiMi 2=O(n−2),
E (CˆO(ρ)
iOi(−i))−1(CˆOiMi− ˆCOiMi(−i)) 2=O(n−2ρ−2).
Combining formulas (19) and (20), we deduce that
E ˆUij M(ρ)
i− ˆUij M(ρ)
i(−i)
2=O(n−2ρ−2). (21)
On the other hand,
E ˜Uij M(ρ)
i− ˜Uij Mi 2=O(ρ), (22)
var(U˜ij Mi−Uij Mi)= φj Mi, CMiMiφj Mi − φj Mi, CMiOiCO−1
iOiCOiMiφj Mi
:=Vij. (23)
Therefore, withnρ3→0 and the formulas (16), (18), (21)–(23), we have that E ˆUij M(ρ)
i−Uij Mi 2=O
n−1ρ−2+n−1δj−2+Vij
. Then the results are proved withnρ3→0.
Proof of Theorem 2 Let U˜i = (U˜i1,· · · ,U˜im)T, Ui = (Ui1,· · · , Uim)T. The covariance matrix of U˜i is var(Ui) = ΞΣ−Z1
iΞT with Ξ = cov(U˜i,Zi) =
(λ1φi1,· · ·, λmφim)T. Moreover, var(U˜i−Ui)=Λ−ΞΣZiΞT. Combining these results with formulas (14), (12) and the results of Lemma2, the result of Theorem 3 is obtained by replacingUˆijNMEwithUˆijWMEin (15) with assumptions (B1)–(B6).
References
Besse, P., & Ramsay, J. O. (1986). Principal components analysis of sampled functions.Psychome- trika, 51(2), 285–311.
Bhatia, R., Davis, C., & McIntosh, A. (1983). Perturbation of spectral subspaces and solution of linear operator equations.Linear Algebra and its Applications, 52, 45–67.
Cardot, H., Ferraty, F., & Sarda, P. (1999). Functional linear model.Statistics & Probability Letters, 45(1), 11–22.
Che, M., Kong, L., Bell, R. C., & Yuan, Y. (2017). Trajectory modeling of gestational weight: A functional principal component analysis approach.PloS One, 12(10), e0186761.
Crambes, C., Kneip, A., & Sarda, P. (2009). Smoothing splines estimators for functional linear regression.The Annals of Statistics, 37(1), 35–72.
Delaigle, A., & Hall, P. (2016). Approximating fragmented functional data by segments of Markov chains.Biometrika, 103(4), 779–799.
Goldberg, Y., Ritov, Y., & Mandelbaum, A. (2014). Predicting the continuation of a function with applications to call center data.Journal of Statistical Planning and Inference, 147, 53–65.
Hall, P., & Horowitz, J. L. (2007). Methodology and convergence rates for functional linear regression.The Annals of Statistics, 35(1), 70–91.
Hall, P., Müller, H.-G., & Wang, J.-L. (2006). Properties of principal component methods for functional and longitudinal data analysis.The Annals of Statistics, 34(3), 1493–1517.
Hansen, P. C. (1990). The discrete Picard condition for discrete ill-posed problems.BIT Numerical Mathematics, 30(4), 658–672.
Horváth, L., & Kokoszka, P. (2012).Inference for functional data with applications, vol. 200.
Berlin: Springer.
James, G. M., Hastie, T. J., & Sugar, C. A. (2000). Principal component models for sparse functional data.Biometrika, 87(3), 587–602.
Kneip, A., & Liebl, D. (2020). On the optimal reconstruction of partially observed functional data.
Annals of Statistics, 48(3), 1692–1717.
Kraus, D. (2015). Components and completion of partially observed functional data.Journal of the Royal Statistical Society: Series B (Statistical Methodology), 77(4), 777–801.
Li, Y., & Hsing, T. (2010). Uniform convergence rates for nonparametric regression and principal component analysis in functional/longitudinal data.The Annals of Statistics, 38(6), 3321–3351.
Liebl, D. (2013). Modeling and forecasting electricity spot prices: A functional data perspective.
The Annals of Applied Statistics, 7(3), 1562–1592.
Liebl, D., & Rameseder, S. (2019). Partially observed functional data: The case of systematically missing parts.Computational Statistics and Data Analysis, 131, 104–115.
Marx, B. D. & Eilers, P. H. (1999). Generalized linear regression on sampled signals and curves: a p-spline approach.Technometrics, 41(1), 1–13.
Morris, J. S. (2015). Functional regression.Annual Review of Statistics and Its Application, 2, 321–359.
Ramsay, J. (2005). Functional data analysis. In B. S. Everitt & D. C. Howell (Eds.)Encyclopedia of Statistics in Behavioral Science(Vol. 2. pp. 675–678). Chichester: John Wiley & Sons Ltd.
Ramsay, J. O., & Dalzell, C. (1991). Some tools for functional data analysis.Journal of the Royal Statistical Society: Series B (Methodological), 53(3), 539–561.
Reiss, P. T., Goldsmith, J., Shang, H. L., & Ogden, R. T. (2017). Methods for scalar-on-function regression.International Statistical Review, 85(2), 228–249.
Rice, J. A., & Silverman, B. W. (1991). Estimating the mean and covariance structure non- parametrically when the data are curves.Journal of the Royal Statistical Society: Series B (Methodological), 53(1), 233–243.
Riesz, F., & Nagy, S. (1955). B.(1990). functional analysis.Dover Publications, Inc., New York.
First published in, 3(6), 35.
Shang, H. L. (2014). A survey of functional principal component analysis.AStA Advances in Statistical Analysis, 98(2), 121–142.
Shin, H. (2009). Partial functional linear regression.Journal of Statistical Planning and Inference, 139(10), 3405–3418.
Staniswalis, J. G., & Lee, J. J. (1998). Nonparametric regression analysis of longitudinal data.
Journal of the American Statistical Association, 93(444), 1403–1418.
Wang, Y., Kong, L., Jiang, B., Zhou, X., Yu, S., Zhang, L., & Heo, G. (2019). Wavelet-based lasso in functional linear quantile regression.Journal of Statistical Computation and Simulation, 89(6), 1111–1130.
Yao, F., Müller, H.-G., & Wang, J.-L. (2005a). Functional data analysis for sparse longitudinal data.Journal of the American Statistical Association, 100(470), 577–590.
Yao, F., Müller, H.-G., & Wang, J.-L. (2005b). Functional linear regression analysis for longitudi- nal data.The Annals of Statistics, 33(6), 2873–2903.
Yu, D., Kong, L., & Mizera, I. (2016). Partial functional linear quantile regression for neuroimaging data analysis.Neurocomputing, 195, 74–87.
Zhao, Y., Ogden, R. T., & Reiss, P. T. (2012). Wavelet-based lasso in functional linear regression.
Journal of computational and graphical statistics, 21(3), 600–617.
Zhu, H., Li, R., & Kong, L. (2012). Multivariate varying coefficient model for functional responses.
Annals of Statistics, 40(5), 2634–2666.