Please use this identifier to cite or link to this item:
https://dair.nps.edu/handle/123456789/5497Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Alexandra Adams, Protima Banerjee | - |
| dc.contributor.author | Theresa Cauble, Noah Pape | - |
| dc.contributor.author | Agam Singh, Eric Toa | - |
| dc.date.accessioned | 2026-05-18T16:50:10Z | - |
| dc.date.available | 2026-05-18T16:50:10Z | - |
| dc.date.issued | 2026-04-30 | - |
| dc.identifier.citation | APA 7 | en_US |
| dc.identifier.uri | https://dair.nps.edu/handle/123456789/5497 | - |
| dc.description | 23rd Annual Acquisition Research Symposium and Innovation Summit | en_US |
| dc.description.abstract | As the Department of War (DoW) increasingly adopts Large Language Models (LLMs) to accelerate mission-critical functions, trust in model outputs becomes essential. While LLMs offer significant capability gains, their susceptibility to hallucinations presents an unacceptable risk in high-consequence environments. This paper introduces HemingwAI, an LLM-agnostic, modular trust framework designed to detect hallucinations and quantify hallucination risks. HemingwAI evaluates factual accuracy of LLM responses alongside response completeness, relevance, and subjectivity, producing actionable risk signals to support informed decision-making. The HemingwAI framework is designed to integrate seamlessly into existing DoW workflows and to support deployment in secure and air-gapped environments. HemingwAI was evaluated using open-domain hallucination benchmarks (HaluBench); benchmark results show measurable improvement in hallucination detection rates above the baseline. More importantly, HemingwAI’s operational relevance was evaluated through an internal Subject Matter Expert (SME) guided pilot. ASRC Federal domain subject matter experts confirmed strong alignment between HemingwAI outputs and human evaluation, demonstrating that the tool can effectively reduce analyst review burden. The work presented in this paper positions HemingwAI as a foundational capability for trusted, mission-ready AI adoption across DoW programs. | en_US |
| dc.description.sponsorship | Acquisition Research Program | en_US |
| dc.language.iso | en_US | en_US |
| dc.publisher | Acquisition Research Program | en_US |
| dc.relation.ispartofseries | Acquisition Management;SYM-AM-26-132 | - |
| dc.subject | Trust Framework | en_US |
| dc.subject | Confidence Metric | en_US |
| dc.subject | Hallucination Detection | en_US |
| dc.subject | LLM Factuality Verification | en_US |
| dc.title | HemingwAI: The Confidence Also Rises | en_US |
| dc.type | Technical Report | en_US |
| Appears in Collections: | Annual Acquisition Research Symposium Proceedings & Presentations | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| SYM-AM-26-132.pdf | 1.36 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.