Inference and Prediction in Large Dimensions

Inference and Prediction in Large Dimensions PDF

Author: Denis Bosq

Publisher: John Wiley & Sons

Published: 2008-03-11

Total Pages: 336

ISBN-13: 9780470724026

DOWNLOAD EBOOK →

This book offers a predominantly theoretical coverage of statistical prediction, with some potential applications discussed, when data and/ or parameters belong to a large or infinite dimensional space. It develops the theory of statistical prediction, non-parametric estimation by adaptive projection – with applications to tests of fit and prediction, and theory of linear processes in function spaces with applications to prediction of continuous time processes. This work is in the Wiley-Dunod Series co-published between Dunod (www.dunod.com) and John Wiley and Sons, Ltd.

Large-Scale Inference

Large-Scale Inference PDF

Author: Bradley Efron

Publisher: Cambridge University Press

Published: 2012-11-29

Total Pages:

ISBN-13: 1139492136

DOWNLOAD EBOOK →

We live in a new age for statistical inference, where modern scientific technology such as microarrays and fMRI machines routinely produce thousands and sometimes millions of parallel data sets, each with its own estimation or testing problem. Doing thousands of problems at once is more than repeated application of classical methods. Taking an empirical Bayes approach, Bradley Efron, inventor of the bootstrap, shows how information accrues across problems in a way that combines Bayesian and frequentist ideas. Estimation, testing and prediction blend in this framework, producing opportunities for new methodologies of increased power. New difficulties also arise, easily leading to flawed inferences. This book takes a careful look at both the promise and pitfalls of large-scale statistical inference, with particular attention to false discovery rates, the most successful of the new statistical techniques. Emphasis is on the inferential ideas underlying technical developments, illustrated using a large number of real examples.

High-Dimensional Covariance Estimation

High-Dimensional Covariance Estimation PDF

Author: Mohsen Pourahmadi

Publisher: John Wiley & Sons

Published: 2013-05-28

Total Pages: 204

ISBN-13: 1118573668

DOWNLOAD EBOOK →

Methods for estimating sparse and large covariance matrices Covariance and correlation matrices play fundamental roles in every aspect of the analysis of multivariate data collected from a variety of fields including business and economics, health care, engineering, and environmental and physical sciences. High-Dimensional Covariance Estimation provides accessible and comprehensive coverage of the classical and modern approaches for estimating covariance matrices as well as their applications to the rapidly developing areas lying at the intersection of statistics and machine learning. Recently, the classical sample covariance methodologies have been modified and improved upon to meet the needs of statisticians and researchers dealing with large correlated datasets. High-Dimensional Covariance Estimation focuses on the methodologies based on shrinkage, thresholding, and penalized likelihood with applications to Gaussian graphical models, prediction, and mean-variance portfolio management. The book relies heavily on regression-based ideas and interpretations to connect and unify many existing methods and algorithms for the task. High-Dimensional Covariance Estimation features chapters on: Data, Sparsity, and Regularization Regularizing the Eigenstructure Banding, Tapering, and Thresholding Covariance Matrices Sparse Gaussian Graphical Models Multivariate Regression The book is an ideal resource for researchers in statistics, mathematics, business and economics, computer sciences, and engineering, as well as a useful text or supplement for graduate-level courses in multivariate analysis, covariance estimation, statistical learning, and high-dimensional data analysis.

Estimation and Conditional Inference in High-dimensional Statistical Models

Estimation and Conditional Inference in High-dimensional Statistical Models PDF

Author: Arend L. Voorman

Publisher:

Published: 2014

Total Pages: 117

ISBN-13:

DOWNLOAD EBOOK →

In many areas of biology, recent advances in technology have facilitated the measurement of large numbers of features, while the number of observations in a data set may remain relatively modest. In this setting, lasso regression and related procedures have been extensively studied for prediction, while the problem of inference is relatively less studied. Most inference in high dimensions is based on simple marginal associations between variables. However, a richer characterization of the associations between variables can be obtained by examining conditional relationships, which account for the joint behavior of the variables. Inference on conditional relationships is more difficult, because it requires one to specify how features are related to one another, to estimate these relationships, and to characterize the uncertainty in the estimation procedure. In Chapters 2 and 3, we explore a few methods for testing hypotheses about conditional relationships in the high-dimensional setting. In Chapter 4, we note some strong distributional assumptions implicit in many treatments of high-dimensional graphical models, and propose a modification which treats this issue.

Sample Size Determination and Power

Sample Size Determination and Power PDF

Author: Thomas P. Ryan

Publisher: John Wiley & Sons

Published: 2013-05-28

Total Pages: 230

ISBN-13: 1118439228

DOWNLOAD EBOOK →

A comprehensive approach to sample size determination and power with applications for a variety of fields Sample Size Determination and Power features a modern introduction to the applicability of sample size determination and provides a variety of discussions on broad topics including epidemiology, microarrays, survival analysis and reliability, design of experiments, regression, and confidence intervals. The book distinctively merges applications from numerous fields such as statistics, biostatistics, the health sciences, and engineering in order to provide a complete introduction to the general statistical use of sample size determination. Advanced topics including multivariate analysis, clinical trials, and quality improvement are addressed, and in addition, the book provides considerable guidance on available software for sample size determination. Written by a well-known author who has extensively class-tested the material, Sample Size Determination and Power: Highlights the applicability of sample size determination and provides extensive literature coverage Presents a modern, general approach to relevant software to guide sample size determination including CATD (computer-aided trial design) Addresses the use of sample size determination in grant proposals and provides up-to-date references for grant investigators An appealing reference book for scientific researchers in a variety of fields, such as statistics, biostatistics, the health sciences, mathematics, ecology, and geology, who use sampling and estimation methods in their work, Sample Size Determination and Power is also an ideal supplementary text for upper-level undergraduate and graduate-level courses in statistical sampling.

Exploration and Analysis of DNA Microarray and Other High-Dimensional Data

Exploration and Analysis of DNA Microarray and Other High-Dimensional Data PDF

Author: Dhammika Amaratunga

Publisher: John Wiley & Sons

Published: 2014-01-27

Total Pages: 320

ISBN-13: 111836452X

DOWNLOAD EBOOK →

Praise for the First Edition “...extremely well written...a comprehensive and up-to-date overview of this important field.” – Journal of Environmental Quality Exploration and Analysis of DNA Microarray and Other High-Dimensional Data, Second Edition provides comprehensive coverage of recent advancements in microarray data analysis. A cutting-edge guide, the Second Edition demonstrates various methodologies for analyzing data in biomedical research and offers an overview of the modern techniques used in microarray technology to study patterns of gene activity. The new edition answers the need for an efficient outline of all phases of this revolutionary analytical technique, from preprocessing to the analysis stage. Utilizing research and experience from highly-qualified authors in fields of data analysis, Exploration and Analysis of DNA Microarray and Other High-Dimensional Data, Second Edition features: A new chapter on the interpretation of findings that includes a discussion of signatures and material on gene set analysis, including network analysis New topics of coverage including ABC clustering, biclustering, partial least squares, penalized methods, ensemble methods, and enriched ensemble methods Updated exercises to deepen knowledge of the presented material and provide readers with resources for further study The book is an ideal reference for scientists in biomedical and genomics research fields who analyze DNA microarrays and protein array data, as well as statisticians and bioinformatics practitioners. Exploration and Analysis of DNA Microarray and Other High-Dimensional Data, Second Edition is also a useful text for graduate-level courses on statistics, computational biology, and bioinformatics.

Approximate Dynamic Programming

Approximate Dynamic Programming PDF

Author: Warren B. Powell

Publisher: John Wiley & Sons

Published: 2011-10-26

Total Pages: 573

ISBN-13: 111802916X

DOWNLOAD EBOOK →

Praise for the First Edition "Finally, a book devoted to dynamic programming and written using the language of operations research (OR)! This beautiful book fills a gap in the libraries of OR specialists and practitioners." —Computing Reviews This new edition showcases a focus on modeling and computation for complex classes of approximate dynamic programming problems Understanding approximate dynamic programming (ADP) is vital in order to develop practical and high-quality solutions to complex industrial problems, particularly when those problems involve making decisions in the presence of uncertainty. Approximate Dynamic Programming, Second Edition uniquely integrates four distinct disciplines—Markov decision processes, mathematical programming, simulation, and statistics—to demonstrate how to successfully approach, model, and solve a wide range of real-life problems using ADP. The book continues to bridge the gap between computer science, simulation, and operations research and now adopts the notation and vocabulary of reinforcement learning as well as stochastic search and simulation optimization. The author outlines the essential algorithms that serve as a starting point in the design of practical solutions for real problems. The three curses of dimensionality that impact complex problems are introduced and detailed coverage of implementation challenges is provided. The Second Edition also features: A new chapter describing four fundamental classes of policies for working with diverse stochastic optimization problems: myopic policies, look-ahead policies, policy function approximations, and policies based on value function approximations A new chapter on policy search that brings together stochastic search and simulation optimization concepts and introduces a new class of optimal learning strategies Updated coverage of the exploration exploitation problem in ADP, now including a recently developed method for doing active learning in the presence of a physical state, using the concept of the knowledge gradient A new sequence of chapters describing statistical methods for approximating value functions, estimating the value of a fixed policy, and value function approximation while searching for optimal policies The presented coverage of ADP emphasizes models and algorithms, focusing on related applications and computation while also discussing the theoretical side of the topic that explores proofs of convergence and rate of convergence. A related website features an ongoing discussion of the evolving fields of approximation dynamic programming and reinforcement learning, along with additional readings, software, and datasets. Requiring only a basic understanding of statistics and probability, Approximate Dynamic Programming, Second Edition is an excellent book for industrial engineering and operations research courses at the upper-undergraduate and graduate levels. It also serves as a valuable reference for researchers and professionals who utilize dynamic programming, stochastic programming, and control theory to solve problems in their everyday work.

Statistical Inference as Severe Testing

Statistical Inference as Severe Testing PDF

Author: Deborah G. Mayo

Publisher: Cambridge University Press

Published: 2018-09-20

Total Pages: 503

ISBN-13: 1108563309

DOWNLOAD EBOOK →

Mounting failures of replication in social and biological sciences give a new urgency to critically appraising proposed reforms. This book pulls back the cover on disagreements between experts charged with restoring integrity to science. It denies two pervasive views of the role of probability in inference: to assign degrees of belief, and to control error rates in a long run. If statistical consumers are unaware of assumptions behind rival evidence reforms, they can't scrutinize the consequences that affect them (in personalized medicine, psychology, etc.). The book sets sail with a simple tool: if little has been done to rule out flaws in inferring a claim, then it has not passed a severe test. Many methods advocated by data experts do not stand up to severe scrutiny and are in tension with successful strategies for blocking or accounting for cherry picking and selective reporting. Through a series of excursions and exhibits, the philosophy and history of inductive inference come alive. Philosophical tools are put to work to solve problems about science and pseudoscience, induction and falsification.

Introduction to Imprecise Probabilities

Introduction to Imprecise Probabilities PDF

Author: Thomas Augustin

Publisher: John Wiley & Sons

Published: 2014-06-03

Total Pages: 452

ISBN-13: 0470973811

DOWNLOAD EBOOK →

In recent years, the theory has become widely accepted and has been further developed, but a detailed introduction is needed in order to make the material available and accessible to a wide audience. This will be the first book providing such an introduction, covering core theory and recent developments which can be applied to many application areas. All authors of individual chapters are leading researchers on the specific topics, assuring high quality and up-to-date contents. An Introduction to Imprecise Probabilities provides a comprehensive introduction to imprecise probabilities, including theory and applications reflecting the current state if the art. Each chapter is written by experts on the respective topics, including: Sets of desirable gambles; Coherent lower (conditional) previsions; Special cases and links to literature; Decision making; Graphical models; Classification; Reliability and risk assessment; Statistical inference; Structural judgments; Aspects of implementation (including elicitation and computation); Models in finance; Game-theoretic probability; Stochastic processes (including Markov chains); Engineering applications. Essential reading for researchers in academia, research institutes and other organizations, as well as practitioners engaged in areas such as risk analysis and engineering.

Nonparametric Analysis of Univariate Heavy-Tailed Data

Nonparametric Analysis of Univariate Heavy-Tailed Data PDF

Author: Natalia Markovich

Publisher: John Wiley & Sons

Published: 2008-03-11

Total Pages: 336

ISBN-13: 9780470723593

DOWNLOAD EBOOK →

Heavy-tailed distributions are typical for phenomena in complex multi-component systems such as biometry, economics, ecological systems, sociology, web access statistics, internet traffic, biblio-metrics, finance and business. The analysis of such distributions requires special methods of estimation due to their specific features. These are not only the slow decay to zero of the tail, but also the violation of Cramer’s condition, possible non-existence of some moments, and sparse observations in the tail of the distribution. The book focuses on the methods of statistical analysis of heavy-tailed independent identically distributed random variables by empirical samples of moderate sizes. It provides a detailed survey of classical results and recent developments in the theory of nonparametric estimation of the probability density function, the tail index, the hazard rate and the renewal function. Both asymptotical results, for example convergence rates of the estimates, and results for the samples of moderate sizes supported by Monte-Carlo investigation, are considered. The text is illustrated by the application of the considered methodologies to real data of web traffic measurements.