From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

Заболотній, Сергій Васильович; Zabolotnii, Serhii; Holinko, Viktoriia; Antonenko, Olha

From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

dc.contributor.author	Заболотній, Сергій Васильович
dc.contributor.author	Zabolotnii, Serhii
dc.contributor.author	Holinko, Viktoriia
dc.contributor.author	Antonenko, Olha
dc.date.accessioned	2026-06-05T13:34:47Z
dc.date.available	2026-06-05T13:34:47Z
dc.date.issued	2026
dc.description.abstract	Trust in clinical artificial intelligence (AI) cannot be reduced to model accuracy, fluency of generation, or overall positive user impression. In medicine, trust must be engineered as a measurable system property grounded in evidence, supervision, and operational boundaries of AI autonomy. This article proposes a practical framework for trustworthy clinical AI built around three principles: evidence, supervision, and staged autonomy. Rather than replacing deterministic clinical logic wholesale with end-to-end black-box models, the proposed approach combines a deterministic core, a patient-specific AI assistant for contextual validation, a multi-tier model escalation mechanism, and a human supervision layer for verification, escalation, and risk control. We demonstrate that trust also depends on selective verification of clinically critical findings, bounded clinical context, disciplined prompt architecture, and careful evaluation on realistic cases. Classifier-driven modular prompting is examined as an incremental path to scaling clinical depth without sacrificing prompt performance and without waiting for complete rule-based coverage. To operationalize trust, a set of trust metrics is proposed, built on metrological principles -- measurement uncertainty, calibration, traceability -- enabling quantitative rather than subjective assessment of each architectural layer. In this perspective, trustworthy clinical AI emerges not as a property of an individual model, but as an architectural outcome of a system into which evidence trails, human oversight, tiered escalation, and graduated action rights are embedded from the outset.
dc.identifier.citation	Заболотній С.В., Голінько В., Антоненко О. From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy. arXiv (Preprint). 2026. https://doi.org/10.48550/ARXIV.2604.26671
dc.identifier.uri	https://arxiv.org/abs/2604.26671
dc.identifier.uri	https://dr.csbc.edu.ua/handle/123456789/2248
dc.language.iso	en
dc.publisher	https://arxiv.org/
dc.subject	TECHNOLOGY
dc.subject	SOCIAL SCIENCES::Statistics, computer and systems science::Informatics, computer and systems science::Information technology
dc.subject	MEDICINE
dc.subject	MEDICINE::Physiology and pharmacology::Physiology::Medical informatics
dc.title	From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy
dc.type	Article

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2604.26671v1.pdf
Size:: 2.39 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

Інформаційні технології