Citation:Killian JA, Biswas A, Shah S, Tambe M. Q-Learning Lagrange Policies for Multi-Action Restless Bandits, in Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021). ; 2021.Download CitationBibTex Tagged XML arXivSee also: Public HealthLast updated on 03/27/2023