Publications

Books

L.A.Prashanth and S.Bhatnagar, Gradient-based Algorithms for Zeroth Order Optimization, Frontiers and Trends in Optimization, NOW Publishers, 2025 prepublication draft

Journal Papers

L.Mandal and S.Bhatnagar, Optimizing Successive Over-relaxation Q-learning with Deterministic Perturbation Gradient Search, IEEE Transactions on Artificial Intelligence, Aug 2025 (accepted)
S.Bhatnagar and Deepak H.R., Variance Reduced Smoothed Functional REINFORCE Policy Gradient Algorithms, Transactions on Machine Learning Research (TMLR), July 2025 (accepted)
S.Guin, V.S.Borkar, and S.Bhatnagar, An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes, IEEE Transactions on Automatic Control (accepted) July 2025 online PDF, arXiv
L.Mandal and S.Bhatnagar, n-Step Temporal Difference Learning with Optimal n, Vol. 179, Article 112449 (9 pages), Automatica, 2025 online PDF, arXiv
S.Pachal, S.Bhatnagar, and L.A.Prashanth, Generalized Simultaneous Perturbation-based Gradient Search with Reduced Estimator Bias, IEEE Transactions on Automatic Control, Vol.70, No.7, pp.4687-4702, 2025 online PDF, arXiv

Preprints Submitted to journals

Our recent papers on arXiv can be found here

Proceedings of International Conferences

P.Dutta, M.Ayyoob, S.Bhatnagar, and A.Dukkipati, One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators, International Conference on Computer Vision (ICCV), Honolulu, Hawaii, Oct.19-23, 2025
P.Panda and S.Bhatnagar, Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation, AAAI, Philadelphia, USA, Feb 27-March 4, 2025 (accepted) arXiv