A New Multiple Imputation Method for High‐Dimensional Neuroimaging Data

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

ABSTRACT

Missing data are a prevalent challenge in neuroimaging, with significant implications for downstream statistical analysis. Neglecting this issue can introduce bias and lead to erroneous inferential conclusions, making it crucial to employ appropriate statistical methods for handling missing data. Although the multiple imputation is a widely used technique, its application in neuroimaging is severely hindered by the high dimensionality of neuroimaging data, and the substantial computational demands. To tackle the critical computational challenges, we propose a novel approach, High d imensional Multiple Imput ation (HIMA), based on Bayesian models specifically designed for large‐scale neuroimaging datasets. HIMA introduces a new computational strategy to sample large covariance matrices based on a robustly estimated posterior mode, significantly improving both computational efficiency and numerical stability. To assess the effectiveness of HIMA, we conducted extensive simulation studies and real‐data analysis from a Schizophrenia brain imaging dataset with around 1000 voxels. HIMA showcases a remarkable reduction of computational burden, for example, 1 hour by HIMA versus 800 hours by classic multiple imputation packages. HIMA also demonstrates improved precision and stability of imputed data.

Abstract

HIMA, a novel multiple imputation method specifically designed for high‐dimensional neuroimaging data, drastically reduces computational burden (e.g., 1 h vs. 800 h for traditional methods) while improving imputation precision and stability, as evidenced by theoretical justification, extensive simulations, and real data analysis.

Related collections

Most cited references 40

Record: found
Abstract: not found
Article: not found

Inference from Iterative Simulation Using Multiple Sequences

Andrew Gelman, Donald B Rubin (1992)

0 comments Cited 3860 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

Stuart Geman, Donald Geman (1984)

We make an analogy between images and statistical mechanics systems. Pixel gray levels and the presence and orientation of edges are viewed as states of atoms or molecules in a lattice-like physical system. The assignment of an energy function in the physical system determines its Gibbs distribution. Because of the Gibbs distribution, Markov random field (MRF) equivalence, this assignment also determines an MRF image model. The energy function is a more convenient and natural mechanism for embodying picture attributes than are the local characteristics of the MRF. For a range of degradation mechanisms, including blurring, nonlinear deformations, and multiplicative or additive noise, the posterior distribution is an MRF with a structure akin to the image model. By the analogy, the posterior distribution defines another (imaginary) physical system. Gradual temperature reduction in the physical system isolates low energy states (``annealing''), or what is the same thing, the most probable states under the Gibbs distribution. The analogous operation under the posterior distribution yields the maximum a posteriori (MAP) estimate of the image given the degraded observations. The result is a highly parallel ``relaxation'' algorithm for MAP estimation. We establish convergence properties of the algorithm and we experiment with some simple pictures, for which good restorations are obtained at low signal-to-noise ratios.

0 comments Cited 397 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

An introduction to modern missing data analyses.

Amanda N Baraldi, Craig K Enders (2010)

A great deal of recent methodological research has focused on two modern missing data analysis methods: maximum likelihood and multiple imputation. These approaches are advantageous to traditional techniques (e.g. deletion and mean imputation techniques) because they require less stringent assumptions and mitigate the pitfalls of traditional techniques. This article explains the theoretical underpinnings of missing data analyses, gives an overview of traditional missing data techniques, and provides accessible descriptions of maximum likelihood and multiple imputation. In particular, this article focuses on maximum likelihood estimation and presents two analysis examples from the Longitudinal Study of American Youth data. One of these examples includes a description of the use of auxiliary variables. Finally, the paper illustrates ways that researchers can use intentional, or planned, missing data to enhance their research designs.

0 comments Cited 158 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Shuo Chen:

ORCID: https://orcid.org/0000-0002-7990-4947

shuochen@som.umaryland.edu

Journal

Journal ID (nlm-ta): Hum Brain Mapp

Journal ID (iso-abbrev): Hum Brain Mapp

Journal ID (doi): 10.1002/(ISSN)1097-0193

Journal ID (publisher-id): HBM

Title: Human Brain Mapping

Publisher: John Wiley & Sons, Inc. (Hoboken, USA )

ISSN (Print): 1065-9471

ISSN (Electronic): 1097-0193

Publication date (Electronic): 21 March 2025

Publication date Collection: 01 April 2025

Volume: 46

Issue: 5 ( doiID: 10.1002/hbm.v46.5 )

Electronic Location Identifier: e70161

Affiliations

[ ¹ ] Department of Mathematics University of Maryland College Park Maryland USA

[ ² ] Department of Psychiatry and Behavioral Science University of Texas Health Science Center Houston Texas USA

[ ³ ] Division of Biostatistics and Bioinformatics, Department of Epidemiology and Public Health, School of Medicine University of Maryland Baltimore Maryland USA

[ ⁴ ] University of Maryland Institute for Health Computing North Bethesda Maryland USA

[ ⁵ ] Department of Statistics and Data Science University of Central Florida Orlando Florida USA

[ ⁶ ] Maryland Psychiatric Research Center, Department of Psychiatry, School of Medicine University of Maryland Catonsville Maryland USA

Author notes

[*] [* ] Correspondence:

Shuo Chen ( shuochen@ 123456som.umaryland.edu )

Author information

Tong Lu https://orcid.org/0009-0000-5688-2207

Shuo Chen https://orcid.org/0000-0002-7990-4947

Article

Publisher ID: HBM70161 Other ID: HBM-24-0964.R1

DOI: 10.1002/hbm.70161

PMC ID: 11926575

PubMed ID: 40116075

SO-VID: 24e09b9b-ef09-48ed-9792-58eef33a5f03

License:

This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

History

Date revision received : 06 January 2025

Date received : 19 September 2024

Date accepted : 31 January 2025

Page count

Figures: 6, Tables: 4, Pages: 13, Words: 7700

Funding

Funded by: National Institutes of Health , doi 10.13039/100000002;

Award ID: 1DP1DA04896801

Custom metadata

source-schema-version-number 2.0

cover-date 01 April 2025

details-of-publishers-convertor Converter:WILEY_ML3GV2_TO_JATSPMC version:6.5.4 mode:remove_FC converted:21.03.2025

ScienceOpen disciplines: Neurology

Keywords: bayesian,large covariance matrix,multiple imputation,multivariate missing data,posterior mode

Data availability:

ScienceOpen disciplines: Neurology

Keywords: bayesian, large covariance matrix, multiple imputation, multivariate missing data, posterior mode

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Most referenced authors 511

See all reference authors

A New Multiple Imputation Method for High‐Dimensional Neuroimaging Data

Read this article at

ABSTRACT

Abstract

Related collections

NeuroImaging Methods

Most cited references 40

Inference from Iterative Simulation Using Multiple Sequences

Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

An introduction to modern missing data analyses.

Author and article information

Contributors

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 172

Most referenced authors 511