From Hype to Reality: Real-World Lessons and Recommendations for AI in Military Applications
Author(s)
Lynch, Joshua; Niss, Laura
DownloadMain Report (1.462Mb)
Metadata
Show full item recordAbstract
The current use cases, limitations, and future capacity
of large language models (LLMs) as assistants to military
personnel remain an open question. This paper presents a case
study of an Airman’s interaction with and trust calibration of
LLMs over three months, both as an everyday assistant and
for development of ROMAD-AI, a tactical military application.
Through intuitive, AI-generated software development, an approach
relying on iterative code generation through natural
language prompting of LLMs from a technical novice rather
than human generated programming from a technical expert,
the research reveals significant gaps between industry curated
AI capability demonstrations and operational reality, requiring
systematic trust calibration and realistic scope management.
Outcomes are analyzed through operational and technical expertise
perspectives to provide practical guidance for both military
service members seeking effective AI integration and researchers
developing military-focused AI systems.
Date issued
2026-02-17Department
Lincoln LaboratoryKeywords
AI, LLMs, military, vibe coding, application development, trust, calibration