Please use this identifier to cite or link to this item: https://dair.nps.edu/handle/123456789/5497
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAlexandra Adams, Protima Banerjee-
dc.contributor.authorTheresa Cauble, Noah Pape-
dc.contributor.authorAgam Singh, Eric Toa-
dc.date.accessioned2026-05-18T16:50:10Z-
dc.date.available2026-05-18T16:50:10Z-
dc.date.issued2026-04-30-
dc.identifier.citationAPA 7en_US
dc.identifier.urihttps://dair.nps.edu/handle/123456789/5497-
dc.description23rd Annual Acquisition Research Symposium and Innovation Summiten_US
dc.description.abstractAs the Department of War (DoW) increasingly adopts Large Language Models (LLMs) to accelerate mission-critical functions, trust in model outputs becomes essential. While LLMs offer significant capability gains, their susceptibility to hallucinations presents an unacceptable risk in high-consequence environments. This paper introduces HemingwAI, an LLM-agnostic, modular trust framework designed to detect hallucinations and quantify hallucination risks. HemingwAI evaluates factual accuracy of LLM responses alongside response completeness, relevance, and subjectivity, producing actionable risk signals to support informed decision-making. The HemingwAI framework is designed to integrate seamlessly into existing DoW workflows and to support deployment in secure and air-gapped environments. HemingwAI was evaluated using open-domain hallucination benchmarks (HaluBench); benchmark results show measurable improvement in hallucination detection rates above the baseline. More importantly, HemingwAI’s operational relevance was evaluated through an internal Subject Matter Expert (SME) guided pilot. ASRC Federal domain subject matter experts confirmed strong alignment between HemingwAI outputs and human evaluation, demonstrating that the tool can effectively reduce analyst review burden. The work presented in this paper positions HemingwAI as a foundational capability for trusted, mission-ready AI adoption across DoW programs.en_US
dc.description.sponsorshipAcquisition Research Programen_US
dc.language.isoen_USen_US
dc.publisherAcquisition Research Programen_US
dc.relation.ispartofseriesAcquisition Management;SYM-AM-26-132-
dc.subjectTrust Frameworken_US
dc.subjectConfidence Metricen_US
dc.subjectHallucination Detectionen_US
dc.subjectLLM Factuality Verificationen_US
dc.titleHemingwAI: The Confidence Also Risesen_US
dc.typeTechnical Reporten_US
Appears in Collections:Annual Acquisition Research Symposium Proceedings & Presentations

Files in This Item:
File Description SizeFormat 
SYM-AM-26-132.pdf1.36 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.