Global Convergence of Multi-Agent Policy Gradient in Markov Potential GamesApr 25, 2022ยทStefanos Leonardos,Will Overman,Ioannis Panageas,Georgios Piliourasยท 0 min read Cite URL 2-state MDP congestion game considered in the experiments sectionTypeConference paperPublicationInternational Conference on Learning RepresentationsLast updated on Apr 25, 2022 ← Online Resource Allocation in Episodic Markov Decision Processes Jan 1, 2023Independent Natural Policy Gradient always converges in Markov Potential Games Mar 1, 2022 →