Array

Versions

PDF

Keywords

TruthCert
Evidence Synthesis
Health Informatics
Large Language Models
Uganda
Data Integrity

How to Cite

Ssenfuma, R. (2026). TruthCert: A Fail-Closed Certification Protocol for LLM Outputs in Evidence Synthesis. Synthēsis, 5(1). Retrieved from https://synthesis-medicine.org/index.php/journal/article/view/87 (Original work published June 6, 2026)

Abstract

TITLE: TRUTHCERT: A FAIL-CLOSED CERTIFICATION PROTOCOL FOR LLM OUTPUTS IN EVIDENCE SYNTHESIS

Can a robust, fail-closed certification protocol successfully prevent the introduction of silently incorrect Large Language Model outputs, often characterized by plausible but entirely fabricated clinical data from infiltrating high-stakes evidence synthesis workflows and digital health registries within the specific resource-constrained informatics infrastructure of the Ugandan healthcare system?

To address this, we engineered TruthCert as a highly rigorous, versioned technical standard that mandates the use of scope-locked estimands, granular per-value provenance chains, multi-witness decentralized arbitration, and immutable cryptographic bundle hashing to ensure the absolute integrity of digital health research data. This sophisticated protocol assembles a minimum of three independent AI witnesses to verify every atomic clinical claim, applies highly specialized domain-specific validator packs across twelve distinct medical extension domains, and automatically rejects any data bundles that demonstrate insufficient evidentiary support or failed internal consistency checks.

The primary challenge addressed is the high failure rate of unverified AI; when tested against fifty complex simulated Randomized Controlled Trial (RCT) extraction tasks, TruthCert successfully identified and rejected all eighteen intentionally corrupted data bundles while accurately certifying thirty out of thirty-two valid sets, achieving a remarkable Area Under the Curve of 0.97 for overall certification accuracy. Furthermore, during rigorous stress testing, the adversarial injection of critical arm-swap errors, complex unit mismatches, and subtle citation drift was successfully detected in every single instance, resulting in zero false certifications across all tested corruption types, thereby proving the protocol's reliability against common "hallucinations".

This structured, fail-closed verification framework effectively transforms the fundamental LLM accuracy problem from a risky reliance on model confidence into a transparent, auditable process of evidence completeness with mandatory disclosure. For medical institutions such as Mukono General Hospital, this protocol provides a definitive technical solution by ensuring that automated clinical decision support systems and neonatal readmission models only process verified, high-integrity data, effectively insulating Ugandan patient outcomes from the significant risks of AI-generated misinformation.

PDF

References

1. Marshall IJ, Wallace BC. Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Syst Rev. 2019; 8:163. doi:10.1186/s13643-019-1074-9

2. Jonnalagadda SR, Goyal P, Huffman MD. Automating data extraction in systematic reviews: a systematic review. Syst Rev. 2015; 4:78. doi:10.1186/s13643-015-0066-7

3. Borenstein M, Hedges LV, Higgins JPT, Rothstein HR. Introduction to Meta-Analysis. 2nd ed. Wiley; 2021.

4. Higgins JPT, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. BMJ. 2003; 327 (7414):557-560.

5. Cochrane Handbook for Systematic Reviews of Interventions. Version 6.4. Cochrane; 2023.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright (c) 2026 Ronnie Ssenfuma, Ahmad Mahmood