Publications
Here is my list of publications in reverse-chronological order with * denoting equal contribution.
Preprints
- Efficient Q-Learning and Actor–Critic Methods for Robust Average-Reward Reinforcement Learning.
Yang Xu, Swetha Ganesh, Vaneet Aggarwal. arXiv:2506.07040, 2025. PDF
Publications
Finite-sample analysis of policy evaluation for robust average reward reinforcement learning.
Yang Xu, Washim Uddin Mondal, Vaneet Aggarwal. NeurIPS 2025. PDFGlobal Convergence for Average Reward Constrained MDPs with Primal-Dual Actor Critic Algorithm.
Yang Xu*, Swetha Ganesh*, Washim Uddin Mondal, Qinbo Bai, Vaneet Aggarwal. NeurIPS 2025. PDFAccelerating Quantum Reinforcement Learning with a Quantum Natural Policy Gradient Based Approach.
Yang Xu, Vaneet Aggarwal. ICML 2025. PDFQuantum Speedups in Regret Analysis of Infinite-Horizon Average-Reward Markov Decision Processes.
Bhargav Ganguly*, Yang Xu*, Vaneet Aggarwal. ICML 2025. PDFRLLTE: Long-Term Evolution Project of Reinforcement Learning.
Mingqi Yuan, Zequn Zhang, Yang Xu, Shihao Luo, Bo Li, Xin Jin, Wenjun Zeng. AAAI 2025. PDFDeep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions.
Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal. TMLR 2024. PDFTransformer empowered CSI feedback for massive MIMO systems.
Yang Xu, Mingqi Yuan, Man-On Pun. IEEE WOCC 2021. PDF
