Warning: fopen(/home/virtual/enm-kes/journal/upload/ip_log/ip_log_2026-06.txt): failed to open stream: Permission denied in /home/virtual/lib/view_data.php on line 100 Warning: fwrite() expects parameter 1 to be resource, boolean given in /home/virtual/lib/view_data.php on line 101

Division of Endocrinology and Metabolism, Department of Internal Medicine, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Korea
Copyright © 2025 Korean Endocrine Society
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
CONFLICTS OF INTEREST
No potential conflict of interest relevant to this article was reported.
ACKNOWLEDGMENTS
This study was supported by the Korean Endocrine Society, and Korean Society of Bone and Mineral Research Young Investigator Award in 2022.
| Interpretability method | Description | Example in fracture prediction | Clinical relevance |
|---|---|---|---|
| Grad-CAM | Generates visual heatmaps showing which parts of an image contributed most to the model’s prediction | Focuses on femoral neck or vertebral body deformities in fracture detection [39,40] | Confirms that the model’s attention aligns with clinical reasoning; builds trust |
| Radiomics integration | Extracts predefined features (e.g., texture and intensity) and correlates with deep features | Uses radiomic features in DXA-based fracture models [28] | Enhances transparency and interpretability; supports regulatory review |
| Neurosymbolic rule-based modeling | Combines deep learning-based segmentation with transparent rule-based classification logic | Vertebral height ratio rules applied after vertebral segmentation in CT [40] | High accuracy with transparent logic; easier for clinicians and regulators to interpret |
| Survival plot visualization | Predicts time-to-event outcomes and visualizes individual patient risk over time | 2.5D ensemble model predicting subsequent fracture risk after hip fracture [37] | Intuitive risk communication using survival curves; supports patient counseling |
| Domain | Requirement | Description | Examples/references |
|---|---|---|---|
| Development transparency | Adherence to reporting guidelines (e.g., MI-CLAIM and CONSORT-AI) | Ensure reproducibility, quality control, and regulatory readiness | Structured reporting of dataset, algorithm, and validation process [45] |
| External validation | Multisite, multi-device, and multi-population evaluation | Assess model generalizability across real-world settings | Validation across institutions [21] |
| Calibration and equity | Calibration plots and subgroup-specific metrics | Detect performance drift and avoid bias across demographic groups | C-index by sex/age; predicted vs. observed risk plots [34] |
| Fail-safes and alerts | Confidence scoring, OOD detection, and image quality warnings | Prevent unsafe use of AI under uncertain conditions | Alert on low-quality input; defer-to-human in ambiguous cases |
| Workflow integration | Embedding in EMR, PACS, or FLS pipelines | Enable real-time clinical use with minimal friction | Auto-generated risk scores in radiology report or FLS referral |
| Interpretability at point of care | Intuitive outputs (e.g., heatmaps and survival curves) | Improve clinician trust and facilitate patient communication | Grad-CAM, vertebral attention maps, and survival plots [37] |
AI, artificial intelligence; MI-CLAIM, Minimum Information Checklist for Artificial Intelligence in Medical Imaging; CONSORT-AI, Consolidated Standards of Reporting Trials–Artificial Intelligence extension; C-index, concordance index; OOD, out-of-distribution; EMR, electronic medical record; PACS, picture archiving and communication system; FLS, fracture liaison service; Grad-CAM, gradient-weighted class activation mapping.
| Interpretability method | Description | Example in fracture prediction | Clinical relevance |
|---|---|---|---|
| Grad-CAM | Generates visual heatmaps showing which parts of an image contributed most to the model’s prediction | Focuses on femoral neck or vertebral body deformities in fracture detection [39,40] | Confirms that the model’s attention aligns with clinical reasoning; builds trust |
| Radiomics integration | Extracts predefined features (e.g., texture and intensity) and correlates with deep features | Uses radiomic features in DXA-based fracture models [28] | Enhances transparency and interpretability; supports regulatory review |
| Neurosymbolic rule-based modeling | Combines deep learning-based segmentation with transparent rule-based classification logic | Vertebral height ratio rules applied after vertebral segmentation in CT [40] | High accuracy with transparent logic; easier for clinicians and regulators to interpret |
| Survival plot visualization | Predicts time-to-event outcomes and visualizes individual patient risk over time | 2.5D ensemble model predicting subsequent fracture risk after hip fracture [37] | Intuitive risk communication using survival curves; supports patient counseling |
| Domain | Requirement | Description | Examples/references |
|---|---|---|---|
| Development transparency | Adherence to reporting guidelines (e.g., MI-CLAIM and CONSORT-AI) | Ensure reproducibility, quality control, and regulatory readiness | Structured reporting of dataset, algorithm, and validation process [45] |
| External validation | Multisite, multi-device, and multi-population evaluation | Assess model generalizability across real-world settings | Validation across institutions [21] |
| Calibration and equity | Calibration plots and subgroup-specific metrics | Detect performance drift and avoid bias across demographic groups | C-index by sex/age; predicted vs. observed risk plots [34] |
| Fail-safes and alerts | Confidence scoring, OOD detection, and image quality warnings | Prevent unsafe use of AI under uncertain conditions | Alert on low-quality input; defer-to-human in ambiguous cases |
| Workflow integration | Embedding in EMR, PACS, or FLS pipelines | Enable real-time clinical use with minimal friction | Auto-generated risk scores in radiology report or FLS referral |
| Interpretability at point of care | Intuitive outputs (e.g., heatmaps and survival curves) | Improve clinician trust and facilitate patient communication | Grad-CAM, vertebral attention maps, and survival plots [37] |
Grad-CAM, gradient-weighted class activation mapping; DXA, dual-energy X-ray absorptiometry; CT, computed tomography; 2.5D, 2.5-dimensional.
AI, artificial intelligence; MI-CLAIM, Minimum Information Checklist for Artificial Intelligence in Medical Imaging; CONSORT-AI, Consolidated Standards of Reporting Trials–Artificial Intelligence extension; C-index, concordance index; OOD, out-of-distribution; EMR, electronic medical record; PACS, picture archiving and communication system; FLS, fracture liaison service; Grad-CAM, gradient-weighted class activation mapping.