Please use this identifier to cite or link to this item:
https://dair.nps.edu/handle/123456789/5497| Title: | HemingwAI: The Confidence Also Rises |
| Authors: | Alexandra Adams, Protima Banerjee Theresa Cauble, Noah Pape Agam Singh, Eric Toa |
| Keywords: | Trust Framework Confidence Metric Hallucination Detection LLM Factuality Verification |
| Issue Date: | 30-Apr-2026 |
| Publisher: | Acquisition Research Program |
| Citation: | APA 7 |
| Series/Report no.: | Acquisition Management;SYM-AM-26-132 |
| Abstract: | As the Department of War (DoW) increasingly adopts Large Language Models (LLMs) to accelerate mission-critical functions, trust in model outputs becomes essential. While LLMs offer significant capability gains, their susceptibility to hallucinations presents an unacceptable risk in high-consequence environments. This paper introduces HemingwAI, an LLM-agnostic, modular trust framework designed to detect hallucinations and quantify hallucination risks. HemingwAI evaluates factual accuracy of LLM responses alongside response completeness, relevance, and subjectivity, producing actionable risk signals to support informed decision-making. The HemingwAI framework is designed to integrate seamlessly into existing DoW workflows and to support deployment in secure and air-gapped environments. HemingwAI was evaluated using open-domain hallucination benchmarks (HaluBench); benchmark results show measurable improvement in hallucination detection rates above the baseline. More importantly, HemingwAI’s operational relevance was evaluated through an internal Subject Matter Expert (SME) guided pilot. ASRC Federal domain subject matter experts confirmed strong alignment between HemingwAI outputs and human evaluation, demonstrating that the tool can effectively reduce analyst review burden. The work presented in this paper positions HemingwAI as a foundational capability for trusted, mission-ready AI adoption across DoW programs. |
| Description: | 23rd Annual Acquisition Research Symposium and Innovation Summit |
| URI: | https://dair.nps.edu/handle/123456789/5497 |
| Appears in Collections: | Annual Acquisition Research Symposium Proceedings & Presentations |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| SYM-AM-26-132.pdf | 1.36 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.