From Hype to Reality: Real-World Lessons and Recommendations for AI in Military Applications

Lynch, Joshua; Niss, Laura

Author(s)

Lynch, Joshua; Niss, Laura

DownloadMain Report (1.462Mb)

Metadata

Show full item record

Abstract

The current use cases, limitations, and future capacity of large language models (LLMs) as assistants to military personnel remain an open question. This paper presents a case study of an Airman’s interaction with and trust calibration of LLMs over three months, both as an everyday assistant and for development of ROMAD-AI, a tactical military application. Through intuitive, AI-generated software development, an approach relying on iterative code generation through natural language prompting of LLMs from a technical novice rather than human generated programming from a technical expert, the research reveals significant gaps between industry curated AI capability demonstrations and operational reality, requiring systematic trust calibration and realistic scope management. Outcomes are analyzed through operational and technical expertise perspectives to provide practical guidance for both military service members seeking effective AI integration and researchers developing military-focused AI systems.

Date issued

2026-02-17

URI

https://hdl.handle.net/1721.1/164898

Department

Lincoln Laboratory

Keywords

AI, LLMs, military, vibe coding, application development, trust, calibration

Collections

Reports