We gratefully acknowledge support from
the Simons Foundation and member institutions.

Methodology

New submissions

[ total of 45 entries: 1-45 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Tue, 30 Apr 24

[1]  arXiv:2404.17615 [pdf, ps, other]
Title: DeepVARMA: A Hybrid Deep Learning and VARMA Model for Chemical Industry Index Forecasting
Authors: Xiang Li, Hu Yang
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)

Since the chemical industry index is one of the important indicators to measure the development of the chemical industry, forecasting it is critical for understanding the economic situation and trends of the industry. Taking the multivariable nonstationary series-synthetic material index as the main research object, this paper proposes a new prediction model: DeepVARMA, and its variants Deep-VARMA-re and DeepVARMA-en, which combine LSTM and VARMAX models. The new model firstly uses the deep learning model such as the LSTM remove the trends of the target time series and also learn the representation of endogenous variables, and then uses the VARMAX model to predict the detrended target time series with the embeddings of endogenous variables, and finally combines the trend learned by the LSTM and dependency learned by the VARMAX model to obtain the final predictive values. The experimental results show that (1) the new model achieves the best prediction accuracy by combining the LSTM encoding of the exogenous variables and the VARMAX model. (2) In multivariate non-stationary series prediction, DeepVARMA uses a phased processing strategy to show higher adaptability and accuracy compared to the traditional VARMA model as well as the machine learning models LSTM, RF and XGBoost. (3) Compared with smooth sequence prediction, the traditional VARMA and VARMAX models fluctuate more in predicting non-smooth sequences, while DeepVARMA shows more flexibility and robustness. This study provides more accurate tools and methods for future development and scientific decision-making in the chemical industry.

[2]  arXiv:2404.17734 [pdf, other]
Title: Manipulating a Continuous Instrumental Variable in an Observational Study of Premature Babies: Algorithm, Partial Identification Bounds, and Inference under Randomization and Biased Randomization Assumptions
Subjects: Methodology (stat.ME); Applications (stat.AP)

Regionalization of intensive care for premature babies refers to a triage system of mothers with high-risk pregnancies to hospitals of varied capabilities based on risks faced by infants. Due to the limited capacity of high-level hospitals, which are equipped with advanced expertise to provide critical care, understanding the effect of delivering premature babies at such hospitals on infant mortality for different subgroups of high-risk mothers could facilitate the design of an efficient perinatal regionalization system. Towards answering this question, Baiocchi et al. (2010) proposed to strengthen an excess-travel-time-based, continuous instrumental variable (IV) in an IV-based, matched-pair design by switching focus to a smaller cohort amenable to being paired with a larger separation in the IV dose. Three elements changed with the strengthened IV: the study cohort, compliance rate and latent complier subgroup. Here, we introduce a non-bipartite, template matching algorithm that embeds data into a target, pair-randomized encouragement trial which maintains fidelity to the original study cohort while strengthening the IV. We then study randomization-based and IV-dependent, biased-randomization-based inference of partial identification bounds for the sample average treatment effect (SATE) in an IV-based matched pair design, which deviates from the usual effect ratio estimand in that the SATE is agnostic to the IV and who is matched to whom, although a strengthened IV design could narrow the partial identification bounds. Based on our proposed strengthened-IV design, we found that delivering at a high-level NICU reduced preterm babies' mortality rate compared to a low-level NICU for $81,766 \times 2 = 163,532$ mothers and their preterm babies and the effect appeared to be minimal among non-black, low-risk mothers.

[3]  arXiv:2404.17763 [pdf, ps, other]
Title: Likelihood Based Inference in Fully and Partially Observed Exponential Family Graphical Models with Intractable Normalizing Constants
Subjects: Methodology (stat.ME); Computation (stat.CO); Machine Learning (stat.ML)

Probabilistic graphical models that encode an underlying Markov random field are fundamental building blocks of generative modeling to learn latent representations in modern multivariate data sets with complex dependency structures. Among these, the exponential family graphical models are especially popular, given their fairly well-understood statistical properties and computational scalability to high-dimensional data based on pseudo-likelihood methods. These models have been successfully applied in many fields, such as the Ising model in statistical physics and count graphical models in genomics. Another strand of models allows some nodes to be latent, so as to allow the marginal distribution of the observable nodes to depart from exponential family to capture more complex dependence. These approaches form the basis of generative models in artificial intelligence, such as the Boltzmann machines and their restricted versions. A fundamental barrier to likelihood-based (i.e., both maximum likelihood and fully Bayesian) inference in both fully and partially observed cases is the intractability of the likelihood. The usual workaround is via adopting pseudo-likelihood based approaches, following the pioneering work of Besag (1974). The goal of this paper is to demonstrate that full likelihood based analysis of these models is feasible in a computationally efficient manner. The chief innovation lies in using a technique of Geyer (1991) to estimate the intractable normalizing constant, as well as its gradient, for intractable graphical models. Extensive numerical results, supporting theory and comparisons with pseudo-likelihood based approaches demonstrate the applicability of the proposed method.

[4]  arXiv:2404.17772 [pdf, other]
Title: PWEXP: An R Package Using Piecewise Exponential Model for Study Design and Event/Timeline Prediction
Comments: 37 pages, 15 figures
Subjects: Methodology (stat.ME); Computation (stat.CO)

Parametric assumptions such as exponential distribution are commonly used in clinical trial design and analysis. However, violation of distribution assumptions can introduce biases in sample size and power calculations. Piecewise exponential (PWE) hazard model partitions the hazard function into segments each with constant hazards and is easy for interpretation and computation. Due to its piecewise property, PWE can fit a wide range of survival curves and accurately predict the future number of events and analysis time in event-driven clinical trials, thus enabling more flexible and reliable study designs. Compared with other existing approaches, the PWE model provides a superior balance of flexibility and robustness in model fitting and prediction. The proposed PWEXP package is designed for estimating and predicting PWE hazard models for right-censored data. By utilizing well-established criteria such as AIC, BIC, and cross-validation log-likelihood, the PWEXP package chooses the optimal number of change-points and determines the optimal position of change-points. With its particular goodness-of-fit, the PWEXP provides accurate and robust hazard estimation, which can be used for reliable power calculation at study design and timeline prediction at study conduct. The package also offers visualization functions to facilitate the interpretation of survival curve fitting results.

[5]  arXiv:2404.17792 [pdf, other]
Title: A General Framework for Random Effects Models for Binary, Ordinal, Count Type and Continuous Dependent Variables Including Variable Selection
Authors: Gerhard Tutz
Subjects: Methodology (stat.ME)

A general random effects model is proposed that allows for continuous as well as discrete distributions of the responses. Responses can be unrestricted continuous, bounded continuous, binary, ordered categorical or given in the form of counts. The distribution of the responses is not restricted to exponential families, which is a severe restriction in generalized mixed models. Generalized mixed models use fixed distributions for responses, for example the Poisson distribution in count data, which has the disadvantage of not accounting for overdispersion. By using a response function and a thresholds function the proposed mixed thresholds model can account for a variety of alternative distributions that often show better fits than fixed distributions used within the generalized linear model framework. A particular strength of the model is that it provides a tool for joint modeling, responses may be of different types, some can be discrete, others continuous. In addition to introducing the mixed thresholds model parameter sparsity is addressed. Random effects models can contain a large number of parameters, in particular if effects have to be assumed as measurement-specific. Methods to obtain sparser representations are proposed and illustrated. The methods are shown to work in the thresholds model but could also be adapted to other modeling approaches.

[6]  arXiv:2404.18000 [pdf, other]
Title: Thinking inside the bounds: Improved error distributions for indifference point data analysis and simulation via beta regression using common discounting functions
Subjects: Methodology (stat.ME)

Standard nonlinear regression is commonly used when modeling indifference points due to its ability to closely follow observed data, resulting in a good model fit. However, standard nonlinear regression currently lacks a reasonable distribution-based framework for indifference points, which limits its ability to adequately describe the inherent variability in the data. Software commonly assumes data follow a normal distribution with constant variance. However, typical indifference points do not follow a normal distribution or exhibit constant variance. To address these limitations, this paper introduces a class of nonlinear beta regression models that offers excellent fit to discounting data and enhances simulation-based approaches. This beta regression model can accommodate popular discounting functions. This work proposes three specific advances. First, our model automatically captures non-constant variance as a function of delay. Second, our model improves simulation-based approaches since it obeys the natural boundaries of observable data, unlike the ordinary assumption of normal residuals and constant variance. Finally, we introduce a scale-location-truncation trick that allows beta regression to accommodate observed values of zero and one. A comparison between beta regression and standard nonlinear regression reveals close agreement in the estimated discounting rate k obtained from both methods.

[7]  arXiv:2404.18197 [pdf, other]
Title: A General Causal Inference Framework for Cross-Sectional Observational Data
Comments: 19 pages, 7 figures
Subjects: Methodology (stat.ME); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Causal inference methods for observational data are highly regarded due to their wide applicability. While there are already numerous methods available for de-confounding bias, these methods generally assume that covariates consist solely of confounders or make naive assumptions about the covariates. Such assumptions face challenges in both theory and practice, particularly when dealing with high-dimensional covariates. Relaxing these naive assumptions and identifying the confounding covariates that truly require correction can effectively enhance the practical significance of these methods. Therefore, this paper proposes a General Causal Inference (GCI) framework specifically designed for cross-sectional observational data, which precisely identifies the key confounding covariates and provides corresponding identification algorithm. Specifically, based on progressive derivations of the Markov property on Directed Acyclic Graph, we conclude that the key confounding covariates are equivalent to the common root ancestors of the treatment and the outcome variable. Building upon this conclusion, the GCI framework is composed of a novel Ancestor Set Identification (ASI) algorithm and de-confounding inference methods. Firstly, the ASI algorithm is theoretically supported by the conditional independence properties and causal asymmetry between variables, enabling the identification of key confounding covariates. Subsequently, the identified confounding covariates are used in the de-confounding inference methods to obtain unbiased causal effect estimation, which can support informed decision-making. Extensive experiments on synthetic datasets demonstrate that the GCI framework can effectively identify the critical confounding covariates and significantly improve the precision, stability, and interpretability of causal inference in observational studies.

[8]  arXiv:2404.18232 [pdf, other]
Title: A cautious approach to constraint-based causal model selection
Authors: Daniel Malinsky
Subjects: Methodology (stat.ME)

We study the data-driven selection of causal graphical models using constraint-based algorithms, which determine the existence or non-existence of edges (causal connections) in a graph based on testing a series of conditional independence hypotheses. In settings where the ultimate scientific goal is to use the selected graph to inform estimation of some causal effect of interest (e.g., by selecting a valid and sufficient set of adjustment variables), we argue that a "cautious" approach to graph selection should control the probability of falsely removing edges and prefer dense, rather than sparse, graphs. We propose a simple inversion of the usual conditional independence testing procedure: to remove an edge, test the null hypothesis of conditional association greater than some user-specified threshold, rather than the null of independence. This equivalence testing formulation to testing independence constraints leads to a procedure with desriable statistical properties and behaviors that better match the inferential goals of certain scientific studies, for example observational epidemiological studies that aim to estimate causal effects in the face of causal model uncertainty. We illustrate our approach on a data example from environmental epidemiology.

[9]  arXiv:2404.18256 [pdf, other]
Title: Semiparametric causal mediation analysis in cluster-randomized experiments
Authors: Chao Cheng, Fan Li
Subjects: Methodology (stat.ME)

In cluster-randomized experiments, there is emerging interest in exploring the causal mechanism in which a cluster-level treatment affects the outcome through an intermediate outcome. Despite an extensive development of causal mediation methods in the past decade, only a few exceptions have been considered in assessing causal mediation in cluster-randomized studies, all of which depend on parametric model-based estimators. In this article, we develop the formal semiparametric efficiency theory to motivate several doubly-robust methods for addressing several mediation effect estimands corresponding to both the cluster-average and the individual-level treatment effects in cluster-randomized experiments--the natural indirect effect, natural direct effect, and spillover mediation effect. We derive the efficient influence function for each mediation effect, and carefully parameterize each efficient influence function to motivate practical strategies for operationalizing each estimator. We consider both parametric working models and data-adaptive machine learners to estimate the nuisance functions, and obtain semiparametric efficient causal mediation estimators in the latter case. Our methods are illustrated via extensive simulations and two completed cluster-randomized experiments.

[10]  arXiv:2404.18370 [pdf, other]
Title: Out-of-distribution generalization under random, dense distributional shifts
Subjects: Methodology (stat.ME)

Many existing approaches for estimating parameters in settings with distributional shifts operate under an invariance assumption. For example, under covariate shift, it is assumed that p(y|x) remains invariant. We refer to such distribution shifts as sparse, since they may be substantial but affect only a part of the data generating system. In contrast, in various real-world settings, shifts might be dense. More specifically, these dense distributional shifts may arise through numerous small and random changes in the population and environment. First, we will discuss empirical evidence for such random dense distributional shifts and explain why commonly used models for distribution shifts-including adversarial approaches-may not be appropriate under these conditions. Then, we will develop tools to infer parameters and make predictions for partially observed, shifted distributions. Finally, we will apply the framework to several real-world data sets and discuss diagnostics to evaluate the fit of the distributional uncertainty model.

[11]  arXiv:2404.18377 [pdf, other]
Title: Inference for the panel ARMA-GARCH model when both $N$ and $T$ are large
Authors: Bing Su, Ke Zhu
Subjects: Methodology (stat.ME)

We propose a panel ARMA-GARCH model to capture the dynamics of large panel data with $N$ individuals over $T$ time periods. For this model, we provide a two-step estimation procedure to estimate the ARMA parameters and GARCH parameters stepwisely. Under some regular conditions, we show that all of the proposed estimators are asymptotically normal with the convergence rate $(NT)^{-1/2}$, and they have the asymptotic biases when both $N$ and $T$ diverge to infinity at the same rate. Particularly, we find that the asymptotic biases result from the fixed effect, estimation effect, and unobservable initial values. To correct the biases, we further propose the bias-corrected version of estimators by using either the analytical asymptotics or jackknife method. Our asymptotic results are based on a new central limit theorem for the linear-quadratic form in the martingale difference sequence, when the weight matrix is uniformly bounded in row and column. Simulations and one real example are given to demonstrate the usefulness of our panel ARMA-GARCH model.

[12]  arXiv:2404.18421 [pdf, other]
Title: Semiparametric mean and variance joint models with Laplace link functions for count time series
Subjects: Methodology (stat.ME); Statistics Theory (math.ST)

Count time series data are frequently analyzed by modeling their conditional means and the conditional variance is often considered to be a deterministic function of the corresponding conditional mean and is not typically modeled independently. We propose a semiparametric mean and variance joint model, called random rounded count-valued generalized autoregressive conditional heteroskedastic (RRC-GARCH) model, to address this limitation. The RRC-GARCH model and its variations allow for the joint modeling of both the conditional mean and variance and offer a flexible framework for capturing various mean-variance structures (MVSs). One main feature of this model is its ability to accommodate negative values for regression coefficients and autocorrelation functions. The autocorrelation structure of the RRC-GARCH model using the proposed Laplace link functions with nonnegative regression coefficients is the same as that of an autoregressive moving-average (ARMA) process. For the new model, the stationarity and ergodicity are established and the consistency and asymptotic normality of the conditional least squares estimator are proved. Model selection criteria are proposed to evaluate the RRC-GARCH models. The performance of the RRC-GARCH model is assessed through analyses of both simulated and real data sets. The results indicate that the model can effectively capture the MVS of count time series data and generate accurate forecast means and variances.

[13]  arXiv:2404.18678 [pdf, other]
Title: Sequential model confidence sets
Subjects: Methodology (stat.ME)

In most prediction and estimation situations, scientists consider various statistical models for the same problem, and naturally want to select amongst the best. Hansen et al. (2011) provide a powerful solution to this problem by the so-called model confidence set, a subset of the original set of available models that contains the best models with a given level of confidence. Importantly, model confidence sets respect the underlying selection uncertainty by being flexible in size. However, they presuppose a fixed sample size which stands in contrast to the fact that model selection and forecast evaluation are inherently sequential tasks where we successively collect new data and where the decision to continue or conclude a study may depend on the previous outcomes. In this article, we extend model confidence sets sequentially over time by relying on sequential testing methods. Recently, e-processes and confidence sequences have been introduced as new, safe methods for assessing statistical evidence. Sequential model confidence sets allow to continuously monitor the models' performances and come with time-uniform, nonasymptotic coverage guarantees.

[14]  arXiv:2404.18732 [pdf, other]
Title: Two-way Homogeneity Pursuit for Quantile Network Vector Autoregression
Subjects: Methodology (stat.ME)

While the Vector Autoregression (VAR) model has received extensive attention for modelling complex time series, quantile VAR analysis remains relatively underexplored for high-dimensional time series data. To address this disparity, we introduce a two-way grouped network quantile (TGNQ) autoregression model for time series collected on large-scale networks, known for their significant heterogeneous and directional interactions among nodes. Our proposed model simultaneously conducts node clustering and model estimation to balance complexity and interpretability. To account for the directional influence among network nodes, each network node is assigned two latent group memberships that can be consistently estimated using our proposed estimation procedure. Theoretical analysis demonstrates the consistency of membership and parameter estimators even with an overspecified number of groups. With the correct group specification, estimated parameters are proven to be asymptotically normal, enabling valid statistical inferences. Moreover, we propose a quantile information criterion for consistently selecting the number of groups. Simulation studies show promising finite sample performance, and we apply the methodology to analyze connectedness and risk spillover effects among Chinese A-share stocks.

[15]  arXiv:2404.18779 [pdf, ps, other]
Title: Semiparametric fiducial inference
Subjects: Methodology (stat.ME); Statistics Theory (math.ST); Computation (stat.CO)

R. A. Fisher introduced the concept of fiducial as a potential replacement for the Bayesian posterior distribution in the 1930s. During the past century, fiducial approaches have been explored in various parametric and nonparametric settings. However, to the best of our knowledge, no fiducial inference has been developed in the realm of semiparametric statistics. In this paper, we propose a novel fiducial approach for semiparametric models. To streamline our presentation, we use the Cox proportional hazards model, which is the most popular model for the analysis of survival data, as a running example. Other models and extensions are also discussed. In our experiments, we find our method to perform well especially in situations when the maximum likelihood estimator fails.

[16]  arXiv:2404.18854 [pdf, other]
Title: Switching Models of Oscillatory Networks Greatly Improve Inference of Dynamic Functional Connectivity
Subjects: Methodology (stat.ME)

Functional brain networks can change rapidly as a function of stimuli or cognitive shifts. Tracking dynamic functional connectivity is particularly challenging as it requires estimating the structure of the network at each moment as well as how it is shifting through time. In this paper, we describe a general modeling framework and a set of specific models that provides substantially increased statistical power for estimating rhythmic dynamic networks, based on the assumption that for a particular experiment or task, the network state at any moment is chosen from a discrete set of possible network modes. Each model is comprised of three components: (1) a set of latent switching states that represent transitions between the expression of each network mode; (2) a set of latent oscillators, each characterized by an estimated mean oscillation frequency and an instantaneous phase and amplitude at each time point; and (3) an observation model that relates the observed activity at each electrode to a linear combination of the latent oscillators. We develop an expectation-maximization procedure to estimate the network structure for each switching state and the probability of each state being expressed at each moment. We conduct a set of simulation studies to illustrate the application of these models and quantify their statistical power, even in the face of model misspecification.

[17]  arXiv:2404.18857 [pdf, other]
Title: VT-MRF-SPF: Variable Target Markov Random Field Scalable Particle Filter
Authors: Ning Ning
Comments: 70 pages
Subjects: Methodology (stat.ME); Applications (stat.AP); Computation (stat.CO)

Markov random fields (MRFs) are invaluable tools across diverse fields, and spatiotemporal MRFs (STMRFs) amplify their effectiveness by integrating spatial and temporal dimensions. However, modeling spatiotemporal data introduces additional hurdles, including dynamic spatial dimensions and partial observations, prevalent in scenarios like disease spread analysis and environmental monitoring. Tracking high-dimensional targets with complex spatiotemporal interactions over extended periods poses significant challenges in accuracy, efficiency, and computational feasibility. To tackle these obstacles, we introduce the variable target MRF scalable particle filter (VT-MRF-SPF), a fully online learning algorithm designed for high-dimensional target tracking over STMRFs with varying dimensions under partial observation. We rigorously guarantee algorithm performance, explicitly indicating overcoming the curse of dimensionality. Additionally, we provide practical guidelines for tuning graphical parameters, leading to superior performance in extensive examinations.

[18]  arXiv:2404.18862 [pdf, other]
Title: Conformal Prediction Sets for Populations of Graphs
Subjects: Methodology (stat.ME)

The analysis of data such as graphs has been gaining increasing attention in the past years. This is justified by the numerous applications in which they appear. Several methods are present to predict graphs, but much fewer to quantify the uncertainty of the prediction. The present work proposes an uncertainty quantification methodology for graphs, based on conformal prediction. The method works both for graphs with the same set of nodes (labelled graphs) and graphs with no clear correspondence between the set of nodes across the observed graphs (unlabelled graphs). The unlabelled case is dealt with the creation of prediction sets embedded in a quotient space. The proposed method does not rely on distributional assumptions, it achieves finite-sample validity, and it identifies interpretable prediction sets. To explore the features of this novel forecasting technique, we perform two simulation studies to show the methodology in both the labelled and the unlabelled case. We showcase the applicability of the method in analysing the performance of different teams during the FIFA 2018 football world championship via their player passing networks.

[19]  arXiv:2404.18905 [pdf, other]
Title: Detecting critical treatment effect bias in small subgroups
Comments: Accepted for presentation at the Conference on Uncertainty in Artificial Intelligence (UAI) 2024
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)

Randomized trials are considered the gold standard for making informed decisions in medicine, yet they often lack generalizability to the patient populations in clinical practice. Observational studies, on the other hand, cover a broader patient population but are prone to various biases. Thus, before using an observational study for decision-making, it is crucial to benchmark its treatment effect estimates against those derived from a randomized trial. We propose a novel strategy to benchmark observational studies beyond the average treatment effect. First, we design a statistical test for the null hypothesis that the treatment effects estimated from the two studies, conditioned on a set of relevant features, differ up to some tolerance. We then estimate an asymptotically valid lower bound on the maximum bias strength for any subgroup in the observational study. Finally, we validate our benchmarking strategy in a real-world setting and show that it leads to conclusions that align with established medical knowledge.

Cross-lists for Tue, 30 Apr 24

[20]  arXiv:2403.15352 (cross-list from q-bio.BM) [pdf, other]
Title: Universal Cold RNA Phase Transitions
Comments: Main: 21 pages, 5 figures. Supplementary Info: 29 pages, 10 figures, 6 tables
Subjects: Biomolecules (q-bio.BM); Biological Physics (physics.bio-ph); Quantitative Methods (q-bio.QM); Methodology (stat.ME)

RNA's diversity of structures and functions impacts all life forms since primordia. We use calorimetric force spectroscopy to investigate RNA folding landscapes in previously unexplored low-temperature conditions. We find that Watson-Crick RNA hairpins, the most basic secondary structure elements, undergo a glass-like transition below $\mathbf{T_G\sim 20 ^{\circ}}$C where the heat capacity abruptly changes and the RNA folds into a diversity of misfolded structures. We hypothesize that an altered RNA biochemistry, determined by sequence-independent ribose-water interactions, outweighs sequence-dependent base pairing. The ubiquitous ribose-water interactions lead to universal RNA phase transitions below $\mathbf{T_G}$, such as maximum stability at $\mathbf{T_S\sim 5 ^{\circ}}$C where water density is maximum, and cold denaturation at $\mathbf{T_C\sim-50^{\circ}}$C. RNA cold biochemistry may have a profound impact on RNA function and evolution.

[21]  arXiv:2404.17682 (cross-list from math.ST) [pdf, other]
Title: Testing for similarity of dose response in multi-regional clinical trials
Subjects: Statistics Theory (math.ST); Methodology (stat.ME)

This paper addresses the problem of deciding whether the dose response relationships between subgroups and the full population in a multi-regional trial are similar to each other. Similarity is measured in terms of the maximal deviation between the dose response curves. We consider a parametric framework and develop two powerful bootstrap tests for the similarity between the dose response curves of one subgroup and the full population, and for the similarity between the dose response curves of several subgroups and the full population. We prove the validity of the tests, investigate the finite sample properties by means of a simulation study and finally illustrate the methodology in a case study.

[22]  arXiv:2404.17737 (cross-list from stat.AP) [pdf, other]
Title: Neutral Pivoting: Strong Bias Correction for Shared Information
Authors: Joseph Rilling
Comments: 11 pages, 3 figures
Subjects: Applications (stat.AP); Methodology (stat.ME)

In the absence of historical data for use as forecasting inputs, decision makers often ask a panel of judges to predict the outcome of interest, leveraging the wisdom of the crowd (Surowiecki 2005). Even if the crowd is large and skilled, shared information can bias the simple mean of judges' estimates. Addressing the issue of bias, Palley and Soll (2019) introduces a novel approach called pivoting. Pivoting can take several forms, most notably the powerful and reliable minimal pivot. We build on the intuition of the minimal pivot and propose a more aggressive bias correction known as the neutral pivot. The neutral pivot achieves the largest bias correction of its class that both avoids the need to directly estimate crowd composition or skill and maintains a smaller expected squared error than the simple mean for all considered settings. Empirical assessments on real datasets confirm the effectiveness of the neutral pivot compared to current methods.

[23]  arXiv:2404.17769 (cross-list from cs.IR) [pdf, other]
Title: Conformal Ranked Retrieval
Comments: 14 pages, 6 figures, 1 table; 7 supplementary pages, 12 supplementary figures, 2 supplementary tables
Subjects: Information Retrieval (cs.IR); Methodology (stat.ME); Machine Learning (stat.ML)

Given the wide adoption of ranked retrieval techniques in various information systems that significantly impact our daily lives, there is an increasing need to assess and address the uncertainty inherent in their predictions. This paper introduces a novel method using the conformal risk control framework to quantitatively measure and manage risks in the context of ranked retrieval problems. Our research focuses on a typical two-stage ranked retrieval problem, where the retrieval stage generates candidates for subsequent ranking. By carefully formulating the conformal risk for each stage, we have developed algorithms to effectively control these risks within their specified bounds. The efficacy of our proposed methods has been demonstrated through comprehensive experiments on three large-scale public datasets for ranked retrieval tasks, including the MSLR-WEB dataset, the Yahoo LTRC dataset and the MS MARCO dataset.

[24]  arXiv:2404.17812 (cross-list from math.ST) [pdf, other]
Title: High-Dimensional Single-Index Models: Link Estimation and Marginal Inference
Comments: 42 pages
Subjects: Statistics Theory (math.ST); Methodology (stat.ME)

This study proposes a novel method for estimation and hypothesis testing in high-dimensional single-index models. We address a common scenario where the sample size and the dimension of regression coefficients are large and comparable. Unlike traditional approaches, which often overlook the estimation of the unknown link function, we introduce a new method for link function estimation. Leveraging the information from the estimated link function, we propose more efficient estimators that are better aligned with the underlying model. Furthermore, we rigorously establish the asymptotic normality of each coordinate of the estimator. This provides a valid construction of confidence intervals and $p$-values for any finite collection of coordinates. Numerical experiments validate our theoretical results.

[25]  arXiv:2404.17856 (cross-list from stat.ML) [pdf, other]
Title: Uncertainty quantification for iterative algorithms in linear models with application to early stopping
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Computation (stat.CO); Methodology (stat.ME)

This paper investigates the iterates $\hbb^1,\dots,\hbb^T$ obtained from iterative algorithms in high-dimensional linear regression problems, in the regime where the feature dimension $p$ is comparable with the sample size $n$, i.e., $p \asymp n$. The analysis and proposed estimators are applicable to Gradient Descent (GD), proximal GD and their accelerated variants such as Fast Iterative Soft-Thresholding (FISTA). The paper proposes novel estimators for the generalization error of the iterate $\hbb^t$ for any fixed iteration $t$ along the trajectory. These estimators are proved to be $\sqrt n$-consistent under Gaussian designs. Applications to early-stopping are provided: when the generalization error of the iterates is a U-shape function of the iteration $t$, the estimates allow to select from the data an iteration $\hat t$ that achieves the smallest generalization error along the trajectory. Additionally, we provide a technique for developing debiasing corrections and valid confidence intervals for the components of the true coefficient vector from the iterate $\hbb^t$ at any finite iteration $t$. Extensive simulations on synthetic data illustrate the theoretical results.

[26]  arXiv:2404.17885 (cross-list from econ.EM) [pdf, ps, other]
Title: Sequential monitoring for explosive volatility regimes
Subjects: Econometrics (econ.EM); Methodology (stat.ME)

In this paper, we develop two families of sequential monitoring procedure to (timely) detect changes in a GARCH(1,1) model. Whilst our methodologies can be applied for the general analysis of changepoints in GARCH(1,1) sequences, they are in particular designed to detect changes from stationarity to explosivity or vice versa, thus allowing to check for volatility bubbles. Our statistics can be applied irrespective of whether the historical sample is stationary or not, and indeed without prior knowledge of the regime of the observations before and after the break. In particular, we construct our detectors as the CUSUM process of the quasi-Fisher scores of the log likelihood function. In order to ensure timely detection, we then construct our boundary function (exceeding which would indicate a break) by including a weighting sequence which is designed to shorten the detection delay in the presence of a changepoint. We consider two types of weights: a lighter set of weights, which ensures timely detection in the presence of changes occurring early, but not too early after the end of the historical sample; and a heavier set of weights, called Renyi weights which is designed to ensure timely detection in the presence of changepoints occurring very early in the monitoring horizon. In both cases, we derive the limiting distribution of the detection delays, indicating the expected delay for each set of weights. Our theoretical results are validated via a comprehensive set of simulations, and an empirical application to daily returns of individual stocks.

[27]  arXiv:2404.18786 (cross-list from math.ST) [pdf, ps, other]
Title: Randomization-based confidence intervals for the local average treatment effect
Comments: 40 pages
Subjects: Statistics Theory (math.ST); Methodology (stat.ME)

We consider the problem of generating confidence intervals in randomized experiments with noncompliance. We show that a refinement of a randomization-based procedure proposed by Imbens and Rosenbaum (2005) has desirable properties. Namely, we show that using a studentized Anderson-Rubin-type statistic as a test statistic yields confidence intervals that are finite-sample exact under treatment effect homogeneity, and remain asymptotically valid for the Local Average Treatment Effect when the treatment effect is heterogeneous. We provide a uniform analysis of this procedure.

Replacements for Tue, 30 Apr 24

[28]  arXiv:2006.13850 (replaced) [pdf, other]
Title: Global Sensitivity and Domain-Selective Testing for Functional-Valued Responses: An Application to Climate Economy Models
Subjects: Methodology (stat.ME); General Economics (econ.GN)
[29]  arXiv:2204.13439 (replaced) [pdf, ps, other]
Title: Mahalanobis balancing: a multivariate perspective on approximate covariate balancing
Authors: Yimin Dai, Ying Yan
Subjects: Methodology (stat.ME)
[30]  arXiv:2206.02508 (replaced) [pdf, other]
Title: Tucker tensor factor models: matricization and mode-wise PCA estimation
Subjects: Methodology (stat.ME)
[31]  arXiv:2301.13701 (replaced) [pdf, other]
Title: On the Stability of General Bayesian Inference
Comments: 29 pages, 7 figures
Subjects: Methodology (stat.ME)
[32]  arXiv:2304.04519 (replaced) [pdf, other]
Title: On new omnibus tests of uniformity on the hypersphere
Comments: 17 pages, 3 figures, 5 tables. Supplementary material: 16 pages, 3 figures, 4 tables
Journal-ref: Test, 32(4):1508-1529, 2023
Subjects: Methodology (stat.ME)
[33]  arXiv:2305.02685 (replaced) [pdf, other]
Title: Testing for no effect in regression problems: a permutation approach
Comments: Submitted to a special issue of Statistica Neerlandica
Subjects: Methodology (stat.ME); Statistics Theory (math.ST)
[34]  arXiv:2306.01198 (replaced) [pdf, other]
Title: Confidence Intervals for Error Rates in 1:1 Matching Tasks: Critical Statistical Analysis and Recommendations
Subjects: Methodology (stat.ME); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[35]  arXiv:2306.01468 (replaced) [pdf, other]
Title: Robust Bayesian Inference for Berkson and Classical Measurement Error Models
Comments: 60 pages, 12 figures. v2: Updated version of paper
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[36]  arXiv:2306.09151 (replaced) [pdf, other]
Title: Estimating the Sampling Distribution of Posterior Decision Summaries in Bayesian Clinical Trials
Subjects: Methodology (stat.ME)
[37]  arXiv:2307.16138 (replaced) [pdf, other]
Title: A switching state-space transmission model for tracking epidemics and assessing interventions
Comments: 45 pages. Submitted to Computational Statistics & Data Analysis
Subjects: Methodology (stat.ME); Physics and Society (physics.soc-ph); Applications (stat.AP)
[38]  arXiv:2309.09115 (replaced) [pdf, other]
Title: Fully Synthetic Data for Complex Surveys
Subjects: Methodology (stat.ME)
[39]  arXiv:2402.12323 (replaced) [pdf, other]
Title: Expressing and visualizing model uncertainty in Bayesian variable selection using Cartesian credible sets
Authors: J. E. Griffin
Subjects: Methodology (stat.ME)
[40]  arXiv:2403.12908 (replaced) [pdf, other]
Title: Regularised Spectral Estimation for High-Dimensional Point Processes
Subjects: Methodology (stat.ME)
[41]  arXiv:2111.04597 (replaced) [pdf, other]
Title: Neyman-Pearson Multi-class Classification via Cost-sensitive Learning
Authors: Ye Tian, Yang Feng
Comments: 117 pages, 18 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[42]  arXiv:2211.01939 (replaced) [pdf, other]
Title: Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation
Comments: Proceedings of the 12th International Conference on Learning Representations (ICLR), 2024. (Spotlight)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[43]  arXiv:2309.06305 (replaced) [pdf, other]
Title: Sensitivity Analysis for Linear Estimators
Comments: Previously circulated as Sensitivity Analysis for Linear Estimands
Subjects: Econometrics (econ.EM); Methodology (stat.ME)
[44]  arXiv:2311.02766 (replaced) [pdf, other]
Title: Riemannian Laplace Approximation with the Fisher Metric
Comments: AISTATS 2024, with additional fixes and improvements
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[45]  arXiv:2404.10942 (replaced) [pdf, other]
Title: What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning
Comments: 13 pages, 9 figures, accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Methodology (stat.ME)
[ total of 45 entries: 1-45 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, stat, recent, 2404, contact, help  (Access key information)