Time: 16: 15 – 17: 45 (JST), July 14, 2023.

Location: R205, Education & Research Building No. 8, Muroran Institute of Technology, 27-1 Mizumoto-cho, Muroran, Hokkaido, 0508585 Japan.

CO-ORGANIZED BY:

IEEE Muroran Institute of Technology Student Branch

IEEE Sapporo Section Young Professionals (YP)

Emerging Networks and Systems Laboratory (ENeS), Muroran Institute of Technology

Topic: Toward A Pessimistic-Optimistic Framework for Constrained Bandit Learning

Speaker: Ning Lu, Assistant Professor, Queen’s University, Canada

Abstract:

The online optimization of time-varying quantities with unknown statistics is a common issue in modern engineering problems, particularly in communication networks. The multiarmed bandit (MAB) problem is a suitable model for many of these problems, where the player must choose actions that yield random rewards from an unknown distribution. In many cases, these problems also have arm selection constraints, where playing or not playing an arm expends some budgeted resource. Recent work has introduced pessimisticoptimistic algorithms to handle these constraints and track constraint violations using virtual queues. However, many engineering problems require additional modeling considerations, such as delayed feedback or arm switching costs, that may not fit neatly within this framework. In this talk, we will introduce an extensible framework for pessimistic-optimistic algorithms that can accommodate these additional considerations. We will also introduce our recent work exploring constrained bandit learning with delayed feedback and with switching costs, along with some novel results that more broadly impact constrained bandit learning problems.

Biography:

Dr. Ning Lu is an Assistant Professor in the Department of Electrical & Computer Engineering at Queen’s University. He is also a Tier 2 Canada Research Chair in Future Communication Networks. Dr. Lu received the B.Eng. (2007) and M.Eng. (2010) degrees from Tongji University, Shanghai, China, and Ph.D. degree (2015) from the University of Waterloo, Waterloo, ON, Canada, all in electrical engineering. Prior to joining Queen’s University, he was an assistant professor in the Department of Computing Science at Thompson Rivers University, Kamloops, BC, Canada. From 2015 to 2016, he was a postdoctoral fellow with the Coordinated Science Laboratory, University of Illinois at Urbana-Champaign. He also spent the summer of 2009 as an intern in the National Institute of Informatics, Tokyo, Japan. His current research interests include real-time scheduling, distributed algorithms, and reinforcement learning for wireless communication networks. He has published more than 40 papers in top IEEE journals and conferences, including IEEE/ACM Transactions on Networking, IEEE Journal on Selected Areas in Communications, ACM MobiHoc, and IEEE INFOCOM, etc. He was a recipient of John R. Evans Leaders Fund. He received a best paper award at the 2014 IEEE GLOBECOM. He has served as journal guest editor, TPC member of major IEEE conferences, and reviewer of refereed journals.

Contact:

Emerging Networks and Systems Laboratory (ENeS)

Email: [email protected]