Publications

Here is my list of publications in reverse-chronological order with * denoting equal contribution.

Preprints

  • Efficient Q-Learning and Actor–Critic Methods for Robust Average-Reward Reinforcement Learning.
    Yang Xu, Swetha Ganesh, Vaneet Aggarwal. arXiv:2506.07040, 2025. PDF

Publications

  • Finite-sample analysis of policy evaluation for robust average reward reinforcement learning.
    Yang Xu, Washim Uddin Mondal, Vaneet Aggarwal. NeurIPS 2025. PDF

  • Global Convergence for Average Reward Constrained MDPs with Primal-Dual Actor Critic Algorithm.
    Yang Xu*, Swetha Ganesh*, Washim Uddin Mondal, Qinbo Bai, Vaneet Aggarwal. NeurIPS 2025. PDF

  • Accelerating Quantum Reinforcement Learning with a Quantum Natural Policy Gradient Based Approach.
    Yang Xu, Vaneet Aggarwal. ICML 2025. PDF

  • Quantum Speedups in Regret Analysis of Infinite-Horizon Average-Reward Markov Decision Processes.
    Bhargav Ganguly*, Yang Xu*, Vaneet Aggarwal. ICML 2025. PDF

  • RLLTE: Long-Term Evolution Project of Reinforcement Learning.
    Mingqi Yuan, Zequn Zhang, Yang Xu, Shihao Luo, Bo Li, Xin Jin, Wenjun Zeng. AAAI 2025. PDF

  • Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions.
    Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal. TMLR 2024. PDF

  • Transformer empowered CSI feedback for massive MIMO systems.
    Yang Xu, Mingqi Yuan, Man-On Pun. IEEE WOCC 2021. PDF