Student Publications

Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry

Zhiyi Chen, Third Military Medical University
Bowen Hu, Southwest University
Xuerong Liu, Third Military Medical University
Benjamin Becker, University of Electronic Science and Technology of China
Simon B. Eickoff, Heinrich Heine University
Kuan Miao, Third Military Medical University
Xingmei Gu, Third Military Medical University (China)
Yancheng Tang, Shanghai International Studies University
Xin Dai, Southwest University
Chao Li, Sun Yat-sen University
Artemiy Leonov, Clark University
Zhibing Xiao, Beijing Normal University
Zhengzhi Feng, Third Military Medical University
Ji Chen, Zhejiang University School of Medicine
Hu Chuan-Peng, Nanjing Normal University

Document Type

Article

Abstract

Background:
The development of machine learning models for aiding in the diagnosis of mental disorder is recognized as a significant breakthrough in the field of psychiatry. However, clinical practice of such models remains a challenge, with poor generalizability being a major limitation.

Methods:
Here, we conducted a pre-registered meta-research assessment on neuroimaging-based models in the psychiatric literature, quantitatively examining global and regional sampling issues over recent decades, from a view that has been relatively underexplored. A total of 476 studies (n = 118,137) were included in the current assessment. Based on these findings, we built a comprehensive 5-star rating system to quantitatively evaluate the quality of existing machine learning models for psychiatric diagnoses.

Results:
A global sampling inequality in these models was revealed quantitatively (sampling Gini coefficient (G) = 0.81, p <.01), varying across different countries (regions) (e.g., China, G = 0.47; the USA, G = 0.58; Germany, G = 0.78; the UK, G = 0.87). Furthermore, the severity of this sampling inequality was significantly predicted by national economic levels (β = − 2.75, p <.001, R ²_adj = 0.40; r = −.84, 95% CI: −.41 to −.97), and was plausibly predictable for model performance, with higher sampling inequality for reporting higher classification accuracy. Further analyses showed that lack of independent testing (84.24% of models, 95% CI: 81.0–87.5%), improper cross-validation (51.68% of models, 95% CI: 47.2–56.2%), and poor technical transparency (87.8% of models, 95% CI: 84.9–90.8%)/availability (80.88% of models, 95% CI: 77.3–84.4%) are prevailing in current diagnostic classifiers despite improvements over time. Relating to these observations, model performances were found decreased in studies with independent cross-country sampling validations (all p <.001, BF₁₀ > 15). In light of this, we proposed a purpose-built quantitative assessment checklist, which demonstrated that the overall ratings of these models increased by publication year but were negatively associated with model performance.

Conclusions:
Together, improving sampling economic equality and hence the quality of machine learning models may be a crucial facet to plausibly translating neuroimaging-based diagnostic classifiers into clinical practice.

Publication Title

BMC Medicine

Publication Date

12-2023

Volume

Issue

ISSN

1741-7015

DOI

10.1186/s12916-023-02941-4

Keywords

diagnostic classification, meta-analysis, neuroimaging, psychiatric machine learning, sampling inequalities

Repository Citation

Chen, Zhiyi; Hu, Bowen; Liu, Xuerong; Becker, Benjamin; Eickoff, Simon B.; Miao, Kuan; Gu, Xingmei; Tang, Yancheng; Dai, Xin; Li, Chao; Leonov, Artemiy; Xiao, Zhibing; Feng, Zhengzhi; Chen, Ji; and Chuan-Peng, Hu, "Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry" (2023). Student Publications. 20.
https://commons.clarku.edu/student_publications/20

Link to Full Text

Find in your library

COinS

Student Publications

Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry

Document Type

Abstract

Publication Title

Publication Date

Volume

Issue

ISSN

DOI

Keywords

Repository Citation

Search

Browse

Participate

Links

Student Publications

Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry

Authors

Document Type

Abstract

Publication Title

Publication Date

Volume

Issue

ISSN

DOI

Keywords

Repository Citation

Share

Search

Browse

Participate

Links