HemingwAI: The Confidence Also Rises

Alexandra Adams, Protima Banerjee; Theresa Cauble, Noah Pape; Agam Singh, Eric Toa

Please use this identifier to cite or link to this item: https://dair.nps.edu/handle/123456789/5497

Full metadata record

DC Field	Value	Language
dc.contributor.author	Alexandra Adams, Protima Banerjee	-
dc.contributor.author	Theresa Cauble, Noah Pape	-
dc.contributor.author	Agam Singh, Eric Toa	-
dc.date.accessioned	2026-05-18T16:50:10Z	-
dc.date.available	2026-05-18T16:50:10Z	-
dc.date.issued	2026-04-30	-
dc.identifier.citation	APA 7	en_US
dc.identifier.uri	https://dair.nps.edu/handle/123456789/5497	-
dc.description	23rd Annual Acquisition Research Symposium and Innovation Summit	en_US
dc.description.abstract	As the Department of War (DoW) increasingly adopts Large Language Models (LLMs) to accelerate mission-critical functions, trust in model outputs becomes essential. While LLMs offer significant capability gains, their susceptibility to hallucinations presents an unacceptable risk in high-consequence environments. This paper introduces HemingwAI, an LLM-agnostic, modular trust framework designed to detect hallucinations and quantify hallucination risks. HemingwAI evaluates factual accuracy of LLM responses alongside response completeness, relevance, and subjectivity, producing actionable risk signals to support informed decision-making. The HemingwAI framework is designed to integrate seamlessly into existing DoW workflows and to support deployment in secure and air-gapped environments. HemingwAI was evaluated using open-domain hallucination benchmarks (HaluBench); benchmark results show measurable improvement in hallucination detection rates above the baseline. More importantly, HemingwAI’s operational relevance was evaluated through an internal Subject Matter Expert (SME) guided pilot. ASRC Federal domain subject matter experts confirmed strong alignment between HemingwAI outputs and human evaluation, demonstrating that the tool can effectively reduce analyst review burden. The work presented in this paper positions HemingwAI as a foundational capability for trusted, mission-ready AI adoption across DoW programs.	en_US
dc.description.sponsorship	Acquisition Research Program	en_US
dc.language.iso	en_US	en_US
dc.publisher	Acquisition Research Program	en_US
dc.relation.ispartofseries	Acquisition Management;SYM-AM-26-132	-
dc.subject	Trust Framework	en_US
dc.subject	Confidence Metric	en_US
dc.subject	Hallucination Detection	en_US
dc.subject	LLM Factuality Verification	en_US
dc.title	HemingwAI: The Confidence Also Rises	en_US
dc.type	Technical Report	en_US
Appears in Collections:	Annual Acquisition Research Symposium Proceedings & Presentations

Files in This Item:

File	Description	Size	Format
SYM-AM-26-132.pdf		1.36 MB	Adobe PDF	View/Open

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets