• Tidak ada hasil yang ditemukan

Appendix

B. Proofs of Main Theorems

Fadj,j(Xi(j ))FX(j )(Xk(j ))

S1+S2. (B.4)

First, we examineS1. SinceFadj,j(x)is the estimated cumulative distribution function with 0≤Fadj,j(x)≤1, then for anyiandk, we have that

Fadj,j(Xi(j ))Fadj,j(Xk(j ))≤1. (B.5) By the triangle inequality, we have that

F (Yi)F (Y k)− |F (Yi)F (Yk)| ≤F (Yi)F (Yi)+F (Yk)F (Yk). (B.6) Then by (B.6), we have that

P&

F (Yi)F (Y k)− |F (Yi)F (Yk)| > ξ

'

PF (Yi)F (Yi)+F (Yk)F (Yk)> ξ

P (

sup

t∈[0]

F (t )F (t )+ sup

t∈[0]

F (t )F (t )> ξ )

=P (

2 sup

t∈[0]

F (t )F (t )> ξ )

=P (

sup

t∈[0]F (t )F (t )>ξ 2

)

κ1exp

−1

42η4κ2

, (B.7)

where the last step is due to Lemma1withξin the right-hand side of (A.1) replaced byξ2. Therefore, combining (B.5) and (B.7) gives

PS1> ξ

κ1exp

−1

42η4κ2

. (B.8)

Next, we examineS2in a similar manner. Similar to (B.5), we have that

|F (Yi)F (Yk)| ≤1. (B.9)

Similar to the arguments for (B.7), we obtain that P&

Fadj,j(Xi(j ))FX(j )(Xk(j ))Fadj,j(Xi(j ))FX(j )(Xk(j )) > ξ

'

κ3exp

&

2

2G2 +o(n15) '

. (B.10)

Therefore, combining (B.9) and (B.10) yields P S2> ξ

κ3exp

&

2

2G2+o(n15) '

. (B.11)

Finally, combining (B.4), (B.8), and (B.11), the probabilistic bound ofMj,1Mj,1is given by

P Mj,1Mj,1> ξ

=PS1+S2> ξ

(B.12)

PS1+S2> ξ

PS1> ξ 2

+PS2 2

κ1exp

−1

162η4κ2

+κ3exp

&

2

8G2+o(n15) '

.

Furthermore, similar derivations show that P Mj,2Mj,2> ξ

κ1exp

−1

162η4κ2

+κ3exp

&

2

8G2+o(n15) '

(B.13) and

P Mj,3Mj,3> ξ

κ1exp

−1

162η4κ2

+κ3exp

&

2

8G2 +o(n15) '

. (B.14)

Noting that the upper bounds in (B.12)–(B.14) are dominated by exp

c2 for certain constantc, we apply (B.12)–(B.14) to (B.3) and obtain that

jωj> ξ =O

exp

c22

(B.15)

for somec2 > 0. Thus, combining (B.2) and (B.15) with (B.1) and specifying ξ =cnζ for the constantscandζ described in Condition (C4) yield the desired result.

Part 2 We prove(20).

LetJ =min

j∈Iωj−max

j∈Icωj. The left-hand side of (20) can be expressed as P

max

j∈Icωj≥min j∈Iωj

=P

max

j∈Icωj−max

j∈Icωj≥min

j∈Iωj−max j∈Icωj

=P

max

j∈Icωj−max

j∈Icωj≥min

j∈Iωj−min j∈Iωj+J

P

max

j∈Icωjωj+max

j∈IωjωjJ

P

2 max

j=1,···,pωjωjJ

=O

&

exp

−1 4Dnv20

' ,

where the last step comes from the result in Part 1 and Condition (C5).

B.2 Proof of Theorem2

Similar to the derivations of Li et al. (2012), one can obtain that

&

maxj∈Iωjωjcnζ '

⊆ I⊆I

. It gives

P I⊆I

≥1−P

max

j∈Iωjωjcnζ

≥1−qPωjωjcnζ

≥1−O

qexp

Dn12ζ

,

where the last step comes from Theorem1.

References

Carroll, R. J., Ruppert, D., Stefanski, L. A., & Crainiceanu, C. M. (2006).Measurement Error in Nonlinear Model. New York: CRC Press.

Chen, L.-P. (2019). Iterated feature screening based on distance correlation for ultrahigh- dimensional censored data with covariates measurement error. arXiv:1901.01610.

Chen, J., & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces.Biometrika,95, 759–771.

Chen, L.-P., & Yi, G. Y. (2020). Model selection and model averaging for analysis of truncated and censored data with measurement error.Electronic Journal of Statistics,14, 4054–4109.

Chen, L.-P., & Yi, G. Y. (2021a). Analysis of noisy survival data with graphical proportional hazards measurement error models.Biometrics.

Chen, L.-P., & Yi, G. Y. (2021b). Semiparametric methods for left-truncated and right-censored survival data with covariate measurement error.Annals of the Institute of Statistical Mathemat- ics, 73, 481–517.

Chen, X., Chen, X., & Wang, H. (2018). Robust feature screening for ultra-high dimensional right censored data via distance correlation.Computational Statistics and Data Analysis,119, 118–

138.

Cui, H., Li, R., & Zhong, W. (2015). Model-free feature screening for ultrahigh dimensional discriminant analysis.Journal of the American Statistical Association,110, 630–641.

Dreiera, I., & Kotzb, S. (2002). A note on the characteristic function of the t-distribution.Statistics and Probability Letters,57, 221–224.

Fan, J., & Lv, J. (2008). Sure independence screening for ultrahigh dimensional feature space (with discussion).Journal of the Royal Statistical Society, Series B,70, 849–911.

Fan, J., & Song, R. (2010). Sure independence screening in generalized linear models with NP- dimensionality.The Annals of Statistics,38, 3567–3604.

Fan, J., Samworth, R., & Wu, Y. (2009). Ultrahigh dimensional feature selection: beyond the linear model.Journal of Machine Learning Research,10, 1829–1853.

Fan, J., Feng, Y., & Wu, Y. (2010). Ultrahigh dimensional variable selection for Cox’s proportional hazards model.IMS Collect,6, 70–86.

Földes, A., & Rejtö, L. (1981). A LIL type result for the product limit estimator.Z. Wahrschein- lichkeitstheorie verw. Gebiete,56, 75–86.

Hall, P., & Miller, H. (2009). Using generalized correlation to effect variable selection in very high dimensional problems.Journal of Computational and Graphical Statistics,18, 533–550.

Hao, M., Lin, Y., Liu, X., & Tang, W. (2019). Robust feature screening for high-dimensional survival data.Journal of Applied Statistics,46, 979–994.

Isaev, M., & McKay, B. D. (2016). On a bound of Hoeffding in the complex case.Electronic Communications in Probability,21, 1–7.

Li, R., Zhong, W., & Zhu, L. (2012). Feature screening via distance correlation learning.Journal of the American Statistical Association,107, 1129–1139.

Marsden, J. E., & Hoffman, M. J. (1999).Basic complex analysis. New York: W. H. Freeman.

Rosenwald, A., Wright, G., Chan, W. C., Connors, J. M., Campo, E., Fisher, R. I., Gascoyne, R.

D., Muller-Hermelink, H. K., Smeland, E. B., & Staudt, L. M. (2003). The proliferation gene expression signature is a quantitative integrator of oncogenic events that predicts survival in mantle cell lymphoma.Cancer Cell,3, 185–197.

Song, R., Lu, W., Ma, S., & Jeng, X. (2014). Censored rank independence screening for high- dimensional survival data.Biometrika, 101, 799–814.

Székely, G. J., Rizzo, M. L., & Bakirov, N. K. (2007). Measuring and testing dependence by correlation of distances.The Annals of Statistics,35, 2769–2794.

Wand, M.P. & Jones, M.C. (1995).Kernel Smoothing.Chapman & Hall, London.

Xue, J., & Liang, F. (2017). A robust model free feature screening method for ultrahigh dimensional data.Journal of Computational and Graphical Statistics,26, 803–813.

Yan, X., Tang, N., & Zhao, X. (2017). The Spearman rank correlation screening for ultrahigh dimensional censored data. arXiv:1702.02708v1.

Yi, G. Y. (2017). Statistical Analysis with Measurement Error and Misclassication: Strategy, Method and Application. Springer.

Yi, G.Y., He, W., & Caroll, R.J. (2021). Feature screening with large-scale and high-dimensional survival data.Biometrics.

Yi, G. Y., Ma, Y., Spiegelman, D., & Carroll, R. J. (2015). Functional and structural methods with mixed measurement error and misclassification in covariates.Journal of the American Statistical Association,110, 681–696.

Zhang, J., Liu, Y., & Cui, H. (2020). Model-free feature screening via distance correlation for ultrahigh dimensional survival data.Statistical Papers.

01210-3

Zhong, W., & Zhu, L. (2015). An iterative approach to distance correlation-based sure indepen- dence screening.Journal of Statistical Computation and Simulation,85, 2331–2345.

Zhu, L., Li, L., Li, R., & Zhu, L. (2011). Model-free feature screening for ultrahigh-dimensional data.Journal of the American Statistical Association,106, 1464–1475.

Rate and Sensitivity Using Least Angle