Publications


For complete list , CLICK HERE


Recent Publications (from 2025)


Books


  1. L.A.Prashanth and S.Bhatnagar, Gradient-based Algorithms for Zeroth Order Optimization, Frontiers and Trends in Optimization, NOW Publishers, 2025 prepublication draft


Journal Papers


  1. L.Mandal and S.Bhatnagar, Optimizing Successive Over-relaxation Q-learning with Deterministic Perturbation Gradient Search, IEEE Transactions on Artificial Intelligence, Aug 2025 (accepted)

  2. S.Bhatnagar and Deepak H.R., Variance Reduced Smoothed Functional REINFORCE Policy Gradient Algorithms, Transactions on Machine Learning Research (TMLR), July 2025 (accepted)

  3. S.Guin, V.S.Borkar, and S.Bhatnagar, An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes, IEEE Transactions on Automatic Control (accepted) July 2025 online PDF, arXiv

  4. L.Mandal and S.Bhatnagar, n-Step Temporal Difference Learning with Optimal n, Vol. 179, Article 112449 (9 pages), Automatica, 2025 online PDF, arXiv

  5. S.Pachal, S.Bhatnagar, and L.A.Prashanth, Generalized Simultaneous Perturbation-based Gradient Search with Reduced Estimator Bias, IEEE Transactions on Automatic Control, Vol.70, No.7, pp.4687-4702, 2025 online PDF, arXiv


Preprints Submitted to journals


Our recent papers on arXiv can be found here


Proceedings of International Conferences


  1. P.Dutta, M.Ayyoob, S.Bhatnagar, and A.Dukkipati, One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators, International Conference on Computer Vision (ICCV), Honolulu, Hawaii, Oct.19-23, 2025

  2. P.Panda and S.Bhatnagar, Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation, AAAI, Philadelphia, USA, Feb 27-March 4, 2025 (accepted) arXiv