STUDENT PAPER: A Multiagent Reinforcement Learning Algorithm by Dynamically Merging Markov Decision Processes Mohammad Ghavamzadeh
Sridhar Mahadevan
Department of Computer Science University of Massachusetts Amherst Amherst, MA 01003
Department of Computer Science University of Massachusetts Amherst Amherst, MA 01003
[email protected]
[email protected]
ABSTRACT !"$#%'&(*),+&-&../0#'1/#'23/4&'1/+!)5.+6+78%9
0#'1/:;=@#'1A!B#C#!"%DB"E1/"C#'+'>"%FG!+6+BH?+&I+7J#'1A