Publications


For complete list , CLICK HERE


Recent Publications (from 2024)


Books


  1. L.A.Prashanth and S.Bhatnagar, Gradient-based Algorithms for Zeroth Order Optimization, Frontiers and Trends in Optimization, NOW Publishers, 2025 prepublication draft


Journal Papers


  1. S.Guin, V.S.Borkar, and S.Bhatnagar, An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes, IEEE Transactions on Automatic Control (accepted) July 2025 arXiv

  2. L.Mandal and S.Bhatnagar, n-Step Temporal Difference Learning with Optimal n, Vol. 179, Article 112449 (9 pages), Automatica, 2025 online PDF, arXiv

  3. S.Pachal, S.Bhatnagar, and L.A.Prashanth, Generalized Simultaneous Perturbation-based Gradient Search with Reduced Estimator Bias, IEEE Transactions on Automatic Control, Vol.70, No.7, pp.4687-4702, 2025 online PDF, arXiv

  4. L.Mandal, C.Lakshminarayanan and S.Bhatnagar, Approximate Linear Programming for Decentralized Policy Iteration in Cooperative Multi-agent Markov Decision Processes, Systems and Control Letters, December 2024 (accepted) online PDF

  5. L.Mandal, D.R.Bharadwaj and S.Bhatnagar, Variance-Reduced Deep Actor-Critic with an Optimally Sub-Sampled Actor Recursion, IEEE Transactions on Artificial Intelligence, Vol. 5, No. 7, pp. 3607-3623, 2024 online PDF

  6. Vivek V.P and S.Bhatnagar, Efficient Energy Management in Smart Grids with Finite Horizon Q-Learning, Sustainable Energy, Grids and Networks, 38:101277, 2024 online PDF

  7. A.Mondal, L.A.Prashanth, and S.Bhatnagar, Truncated Cauchy Random Perturbations for Smoothed Functional-based Stochastic Optimization, Automatica, 162:111528, 2024 arXiv, online PDF

  8. A.Barat, K.J.Prabuchandran, and S.Bhatnagar, Energy Management in a Cooperative Energy Harvesting Wireless Sensor Network, IEEE Communication Letters, Vol. 28, No. 1, pp. 243-247, 2024 arXiv, online PDF


Preprints Submitted to journals


Our recent papers on arXiv can be found here


Proceedings of International Conferences


  1. P.Dutta, M.Ayyoob, S.Bhatnagar, and A.Dukkipati, One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators, International Conference on Computer Vision (ICCV), Honolulu, Hawaii, Oct.19-23, 2025

  2. P.Panda and S.Bhatnagar, Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation, AAAI, Philadelphia, USA, Feb 27-March 4, 2025 (accepted) arXiv

  3. S.Salmalge and S.Bhatnagar, Reinforcement Learning Algorithms with Graph Convolution Networks for Traffic Signal Control, EAI Intelligent Systems Transport Conference, University of Pisa, Pisa, Italy, December 4-6, 2024.

  4. A.Srivastava, S.Bhatnagar, M.N.Murty and A.Raman J., Learning dynamic representations in large language modela for evolving data streams, International Conference on Pattern Recognition (ICPR), Kolkata, December 2024.

  5. P.Panda and S.Bhatnagar, Finite Time Analysis of Constrained Actor Critic and Constrained Natural Actor Critic Algorithms, Uncertainty in Artificial Intelligence (UAI), Barcelona, Spain, July 17-19, 2024 (accepted) arXiv

  6. M.Maniyar, Prashanth L.A., A.Mondal and S.Bhatnagar, A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning, 27th International Conference on Artificial Intelligence and Statistics (AISTATS), Valencia, Spain, May 2-4, 2024 (accepted) arXiv

  7. Vivek V.P, D.R.Bharadwaj and S.Bhatnagar, Dynamic Energy Management in Competing Microgrids using Reinforcement Learning, Conference on Innovative Smart Grid Technologies, North America (ISGT NA 2024), Washington D.C., Feb 19-24, 2024 online PDF