Beyond fragility: what the fragility index cannot measure

Authors

  • Thomas F Heston Internet Medical Journal

DOI:

https://doi.org/10.5281/zenodo.19465222

Keywords:

statistical fragility, fragility index, robustness, neutrality boundary framework, p-value limitations, complete statistical evidence, evidence quality, clinical trials

Abstract

The fragility index reveals classification stability that p values conceal, but neither measure whether an observed result is geometrically separated from no effect, the point of therapeutic neutrality. Recent editorials in major journals have cataloged this limitation with unusual clarity, identifying sample-size dependence, value correlation, restriction to dichotomous outcomes, and inability to measure effect strength as fundamental deficiencies. This commentary argues that these deficiencies point to a structural gap requiring an orthogonal robustness dimension — one that quantifies distance from therapeutic neutrality, where therapies have no effect on outcomes,  independent of significance classification. The p–fr–nb framework addresses this gap by defining complete statistical evidence as a triplet: significance (p), fragility (fr), and robustness (nb). Empirical validation across 100 pharmaceutical trials demonstrates that half of all trials show discordance between p value classification and complete evidence assessment, confirming that reporting p-values alone yields systematically incomplete evidence.

References

1. Bhatia A. Understanding Stability by Evaluating Fragility (Indices). Anesthesiology. 2026;144: 764–766. doi:10.1097/ALN.0000000000005932

2. Kosyakovsky LB, Garan AR. Frailty, Thy Name is Significance: Examining the Utility of the Fragility Index in Cardiovascular Clinical Trials. J Card Fail - Intersect. 2026;0. doi:10.1016/j.yjcafi.2026.02.013

3. Walsh M, Srinathan SK, McAuley DF, Mrkobrada M, Levine O, Ribic C, et al. The statistical significance of randomized controlled trial results is frequently fragile: a case for a Fragility Index. J Clin Epidemiol. 2014;67: 622–628. doi:10.1016/j.jclinepi.2013.10.019

4. Ahmed W, Fowler RA, McCredie VA. Does sample size matter when interpreting the fragility index? Crit Care Med. 2016;44: e1142–e1143. doi:10.1097/CCM.0000000000001976

5. Heston TF. The Modified-Arm Fragility Quotient: An Improved Metric for Assessing Robustness in Clinical Trials. Zenodo. 2025. doi:10.5281/zenodo.17014261

6. Heston TF. Meaningful Change Index: A P-Value Independent Metric for Assessing Robustness and Fragility in Continuous Outcomes. Zenodo. 2025. doi:10.5281/zenodo.17212383

7. Heston TF. Significance, Fragility, and Robustness in Clinical Trials: Stratifying Statistical Evidence. Cureus. 2025;17. doi:10.7759/cureus.100494

8. Heston TF. The Neutrality Boundary Framework: Quantifying Statistical Robustness Geometrically. arXiv. 2025; 2511.00982. doi:10.48550/arXiv.2511.00982

9. Albert CL, Pampori A, Guglin M. Fragility Index Analysis of Contemporary Temporary Mechanical Circulatory Support Devices in Cardiogenic Shock. J Card Fail - Intersect. 2026; S3050661126000250. doi:10.1016/j.yjcafi.2025.12.014

10. Heston TF. Fragility Metrics Toolkit. Zenodo; 2026. doi:10.5281/zenodo.17254763

Downloads

Published

2026-04-08

How to Cite

Heston, T. F. (2026). Beyond fragility: what the fragility index cannot measure. Internet Medical Journal, 1(1). https://doi.org/10.5281/zenodo.19465222