Confidence Intervals and Hypothesis Testing for High-Dimensional Statistical Models

Adel Javanmard; Andrea Montanari

2013 NIPS NeurIPS 2013

Confidence Intervals and Hypothesis Testing for High-Dimensional Statistical Models

Abstract

Fitting high-dimensional statistical models often requires the use of non-linear parameter estimation procedures. As a consequence, it is generally impossible to obtain an exact characterization of the probability distribution of the parameter estimates. This in turn implies that it is extremely challenging to quantify the uncertainty' associated with a certain parameter estimate. Concretely, no commonly accepted procedure exists for computing classical measures of uncertainty and statistical significance as confidence intervals or p-values. We consider here a broad class of regression problems, and propose an efficient algorithm for constructing confidence intervals and p-values. The resulting confidence intervals have nearly optimal size. When testing for the null hypothesis that a certain parameter is vanishing, our method has nearly optimal power. Our approach is based on constructing ade-biased' version of regularized M-estimators. The new construction improves over recent work in the field in that it does not assume a special structure on the design matrix. Furthermore, proofs are remarkably simple. We test our method on a diabetes prediction problem.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — confidence intervals

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy

📈 Trend Setter — Uncertainty Quantification

🐣 Hot Topic Early Bird — hypothesis testing

Authors

Adel Javanmard , Andrea Montanari

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Statistical Learning Mathematics & Optimization > Mathematics > Statistics Machine Learning > Optimization & Theory > Statistics Machine Learning > Learning Types > Uncertainty Quantification Mathematics & Optimization > Statistics > Statistics

Keywords

statistical inference high-dimensional statistics hypothesis testing high-dimensional regression m-estimators statistical significance confidence interval regularized estimator

Download PDF

Related papers

Latent Structured Active Learning 2013

On Flat versus Hierarchical Classification in Large-Scale Taxonomies 2013

Generalized Method-of-Moments for Rank Aggregation 2013

Third-Order Edge Statistics: Contour Continuation, Curvature, and Cortical Connections 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent 2013