Quantum partially observable Markov decision processes
Author(s)
Barry, Jennifer; Barry, Daniel T.; Aaronson, Scott
DownloadPhysRevA.90.032311.pdf (261.6Kb)
PUBLISHER_POLICY
Publisher Policy
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordAbstract
We present quantum observable Markov decision processes (QOMDPs), the quantum analogs of partially observable Markov decision processes (POMDPs). In a QOMDP, an agent is acting in a world where the state is represented as a quantum state and the agent can choose a superoperator to apply. This is similar to the POMDP belief state, which is a probability distribution over world states and evolves via a stochastic matrix. We show that the existence of a policy of at least a certain value has the same complexity for QOMDPs and POMDPs in the polynomial and infinite horizon cases. However, we also prove that the existence of a policy that can reach a goal state is decidable for goal POMDPs and undecidable for goal QOMDPs.
Date issued
2014-09Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Physical Review A
Publisher
American Physical Society
Citation
Barry, Jennifer, Daniel T. Barry, and Scott Aaronson. "Quantum partially observable Markov decision processes." Phys. Rev. A 90, 032311 (September 2014). © 2014 American Physical Society
Version: Final published version
ISSN
1050-2947
1094-1622