Beyond fragility: what the fragility index cannot measure
DOI:
https://doi.org/10.5281/zenodo.19465222Keywords:
statistical fragility, fragility index, robustness, neutrality boundary framework, p-value limitations, complete statistical evidence, evidence quality, clinical trialsAbstract
The fragility index reveals classification stability that p values conceal, but neither measure whether an observed result is geometrically separated from no effect, the point of therapeutic neutrality. Recent editorials in major journals have cataloged this limitation with unusual clarity, identifying sample-size dependence, p value correlation, restriction to dichotomous outcomes, and inability to measure effect strength as fundamental deficiencies. This commentary argues that these deficiencies point to a structural gap requiring an orthogonal robustness dimension — one that quantifies distance from therapeutic neutrality, where therapies have no effect on outcomes, independent of significance classification. The p–fr–nb framework addresses this gap by defining complete statistical evidence as a triplet: significance (p), fragility (fr), and robustness (nb). Empirical validation across 100 pharmaceutical trials demonstrates that half of all trials show discordance between p value classification and complete evidence assessment, confirming that reporting p-values alone yields systematically incomplete evidence.
References
1. Bhatia A. Understanding Stability by Evaluating Fragility (Indices). Anesthesiology. 2026;144: 764–766. doi:10.1097/ALN.0000000000005932
2. Kosyakovsky LB, Garan AR. Frailty, Thy Name is Significance: Examining the Utility of the Fragility Index in Cardiovascular Clinical Trials. J Card Fail - Intersect. 2026;0. doi:10.1016/j.yjcafi.2026.02.013
3. Walsh M, Srinathan SK, McAuley DF, Mrkobrada M, Levine O, Ribic C, et al. The statistical significance of randomized controlled trial results is frequently fragile: a case for a Fragility Index. J Clin Epidemiol. 2014;67: 622–628. doi:10.1016/j.jclinepi.2013.10.019
4. Ahmed W, Fowler RA, McCredie VA. Does sample size matter when interpreting the fragility index? Crit Care Med. 2016;44: e1142–e1143. doi:10.1097/CCM.0000000000001976
5. Heston TF. The Modified-Arm Fragility Quotient: An Improved Metric for Assessing Robustness in Clinical Trials. Zenodo. 2025. doi:10.5281/zenodo.17014261
6. Heston TF. Meaningful Change Index: A P-Value Independent Metric for Assessing Robustness and Fragility in Continuous Outcomes. Zenodo. 2025. doi:10.5281/zenodo.17212383
7. Heston TF. Significance, Fragility, and Robustness in Clinical Trials: Stratifying Statistical Evidence. Cureus. 2025;17. doi:10.7759/cureus.100494
8. Heston TF. The Neutrality Boundary Framework: Quantifying Statistical Robustness Geometrically. arXiv. 2025; 2511.00982. doi:10.48550/arXiv.2511.00982
9. Albert CL, Pampori A, Guglin M. Fragility Index Analysis of Contemporary Temporary Mechanical Circulatory Support Devices in Cardiogenic Shock. J Card Fail - Intersect. 2026; S3050661126000250. doi:10.1016/j.yjcafi.2025.12.014
10. Heston TF. Fragility Metrics Toolkit. Zenodo; 2026. doi:10.5281/zenodo.17254763
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Thomas F Heston

This work is licensed under a Creative Commons Attribution 4.0 International License.