Master of Technology (M.Tech) Project Students


Current Students


  1. Megha Patidar, M.Tech (CSE), Safe Reinforcement Learning, joined 2025

  2. Amandeep Nokhwal, M.Tech (CSE), RL Algorithms, joined 2025

  3. Parui Lipika Netai Laxmi, M.Tech (AI), RL Algorithms, joined 2025

  4. Hebbalkar Tejashri Bhavakana, M.Tech (AI), RL Algorithms, joined 2025


Former Students


  1. Soumitra Sihahajari, M.Tech (CSE), Multi-agent Reinforcement Learning for Competitive Environments using Self-Play, 2025

  2. Vishal Prajapati, M.Tech (CSE), RL Algorithms for Financial Markets, 2025

  3. Badiga Rajesh, M.Tech (CSE), PPO vs. TRPO for Semantic Vision-Based Autonomous Driving in CARLA, 2025

  4. Kiran Bade, M.Tech (CSE), Reinforced RAG: Optimizing Contextual Responses with RL, 2025

  5. Chaitanya Velpula, M.Tech (AI), Reinforcement Learning Algorithms for the Risk Sensitive Criterion, 2025

  6. Aditya Vikram Choudhury, M.Tech (CSE), Reinforcement Learning Algorithms for Multi-Agent Games in Competitive Settings with an Emphasis on Efficient Pre-Training, 2024

  7. Rahul Ranjan, M.Tech (CSE), Sharing Experiences Improves Multi-Agent Reinforcement Learning, 2024

  8. Srinjoy Mukherjee, M.Tech (CSE), Instruction Following using Natural Language and Reinforcement Learning, 2024

  9. Rahul Dev Boipai, M.Tech (CSE), Advantage-Weighted Regression with Decision Transformer, 2024

  10. Mohammad Ayyob, M.Tech (CSE), Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators, 2024

  11. Kirtee Mishra, M.Tech (CSE), Enhancing Traffic Flow with SPSA: A Simulation Study in Urban Networks, 2024

  12. Mayank Sati, M.Tech (CSE), Instruction following using Vision and Natural Language, 2023

  13. Rankit Kachroo, M.Tech (CSE), Safe Lane Interchange in Autonmous Driving using Reinforcement Learning, 2023

  14. Samarth Singh, M.Tech (CSE), Energy Sharing for Multiple Sensor Nodes with Random Priority Data Bursts, 2023

  15. Shrabana Biswas, M.Tech (CSE), Risk-Sensitive Actor-Critic Methods, 2023

  16. Shreya Salmalge Sharad, M.Tech (CSE), Reinforcement Learning Algorithms for Vehicular Trac Control, 2023

  17. Saurabh Jaiswal, M.Tech (AI), Energy Trading and Scheduling in Smart Grids using Soft Actor-Critic Methods, 2023

  18. Rituraj Joshi, M.Tech (CSE): Instruction Following using Vision and Natural Language, 2022

  19. Sambit Ghosh, M.Tech (CSE): Explaining the Role of Reward Functions in Deep Reinforcement Learning, 2022

  20. Prasanna Srikar Regati, M.Tech (CSE): Reinforcement Learning Algorithms, 2021

  21. Rohan Deb, M.Tech (CSE): Gradient Temporal Difference with Momentum and Lambda-Schedule, 2021

  22. Rokkam Sandeep Reddy, M.Tech (AI): Deep Reinforcement Learning for Quadrupedal Locomotion, 2021

  23. Sai Sravan Reddy T.D, M.Tech (CSE): Distributed Reinforcement Learning Algorithms for Dynamic Energy Pricing in Microgrids, 2021

  24. Soumya Rani Samineni, M.Tech (CSE) (joint guidance with Prof. Shishir N.Y.Kolathaya): Learning Techniques for Continuous Control and Safety of Robots, 2021

  25. Srishty Suman, M.Tech (CSE) (joint huidance with Prof. Bharadwaj Amrutur): Learning to Generate Action Sequence using Natural Language Command, 2021

  26. Vamsi Krishna Satya Chilamkurthi, M.Tech (AI) (joint guidance with Prof. Bharadwaj Amrutur): Learning to Control Robot for Manipulation Tasks through Natural Language, 2021

  27. Vinayak Jha, M.Tech (CSE): Reinforcement Learning in Trade, 2021

  28. Vinod Kumar Reddy, M.Tech (AI): Deep Reinforcement Learning for Financial Markets, 2021

  29. Ashish Raghuvanshi, M.Tech (CSE): Learning Control Policies for Quadruped Robots, 2020

  30. Mohd. Haroon Ansari, M.Tech (CSE): Deep Reinforcement Learning for E-grocery Supply Chain Optimization, 2020

  31. Shivam Chauhan, M.Tech (CSE): Reinforcement Learning for Machine Reading Comprehension, 2020

  32. Waquar Azam, M.Tech (CSE): Dynamic Route Adaptation of Vehicles, 2020

  33. Amishi Singh, M.Tech (CSE): Single Intrusion Detection in Multiple Wireless Sensor Networks, 2019

  34. Priya Bundela, M.Tech (CSE): Traffic Control and Optimization Using Reinforcement Learning, 2019

  35. Sandeep Nishad, M.Tech (CSE): Imitation and Reinforcement Learning for Robotic Arm, 2019

  36. Samadhan Sharma, M.Tech (CSE): A Modular Deep Network Architecture For Improving The Generalization of Reinforcement Learning, 2019

  37. Sayambhu Sen, M.Tech (SE): Multiagent Learning systems for Traffic control and Off-Policy Imitation Learning, 2019

  38. Sonu Dixit, M.Tech (SE): Adaptive Traffic Signal Control Using Multi Agent Reinforcement Learning, 2019

  39. Kodate Shreedhar Shreeshail, M.Tech (CSE): Multi-agent Adaptive Traffic Signal Control Using Deep Reinforcement Learning, 2018

  40. Tushar Shinde, M.Tech (CSE): Control Policies for Textual Games using Deep Reinforcement Learning, 2018

  41. Anurag Yadav, M.Tech (CSE): Adaptive Traffic Signal Control using Reinforcement Learning Algorithms, 2018

  42. Parankusham Keshav, M.Tech (CSE): Traffic Signal Optimization using Multi-Agent Reinforcement Learning, 2018

  43. Yashwant Krishnadas, M.Tech (CSE): Real-Time Intelligent Traffic Management using Multi-Agent Deep Reinforcement Learning, 2018

  44. Debangshu Banerjee, M.Tech (SE): Hex and Neuro-dynamic Programming, 2018

  45. Arjun Dhakad, M.E (CSE): Using Reinforcement Learning for Dynamic Allocation of Resources in Cloud Computing, 2017

  46. Monica, M.E (SSA): Intrusion Detection in Decentralized Energy Efficient Wireless Sensor Networks using Reinforcement Learning Algorithms, 2017

  47. Musunuri Prathima, M.E (CSE): Intelligent Traffic Signal Control by Modelling junctions for Noisy Events, 2017

  48. Praloy Karmakar, M.E (SSA): Fast Gradient-based Stochastic Algorithms with Simultaneous Perturbations, 2017

  49. Rohith R.R, M.E (SSA): Deep Reinforcement Learning Algorithms and their Applications, 2017

  50. Sushma Tingare, M.E (CSE): Junction Clustering Algorithms for Vehicular Traffic Control, 2017

  51. Swapnil Kakade, M.E (CSE), Reinforcement Learning Algorithms for Vehicular Traffic Control and Control of Pedestrian Movements, 2017

  52. D.Raghuram Bharadwaj, M.E.(CSE): Novel Sensor Scheduling Scheme for Intruder Tracking in Energy Efficient Sensor Networks, 2016

  53. Aiswarya S., M.E.(SSA): Dynamic Resource Allocation in Cloud using Q-learning, 2016

  54. Prakash Chandra, M.E.(SSA): Actor-Critic Algorithm for Solving MDPs and a Q-routing Algorithm for Adaptive Routing, 2015

  55. Raj Kumar Maity, M.E.(SSA): Deterministic and Random Perturbation Algorithms for Simulation Optimization, 2015

  56. Rishabh Singla, M.E.(CSE): Finite Horizon Markov Decision Process for Optimal Resource Allocation on Crowdsourcing Platforms, 2015

  57. Anurag Tomar, M.E.(CSE): Parameter Tuning and Feature Adaptation for Traffic Signal Control, 2015

  58. Indrajeet Kumar, M.E.(CSE): Actor-Critic Reinforcement Learning Based Energy Management Policies for a Single Sensor Node with Finite Buffer, 2015

  59. Lawqueen Kanesh, M.E.(CSE): Optimal Sleeping Policies for Intrusion Detection in Wireless Sensor Networks, 2015

  60. Ayush Dubey, M.E.(CSE): A Markov Decision Process Framework for Predictable Job Completion Times on Crowdsourcing Platforms, 2014.

  61. Hemanth Kumar, M.E.(CSE): Multi-agent Reinforcement Learning for Traffic Signal Control, 2014.

  62. Arun Kumar, M.E.(CSE): Parametric optimization in CSMA multiaccess communication protocols, 2014.

  63. Vinayaka G. Yaji, M.E.(SSA): Algorithms for Constrained Stochastic Games, 2013.

  64. Legena P.K., M.E.(CSE): Optimal Traffic Signal Timing Using Reinforcement Learning, 2013.

  65. Srujana Sadula, M.E.(CSE), Optimal Pricing of Tasks for Predictable Job Completion Times in Crowdsourcing Platforms, 2013.

  66. Akash Gidda, M.E.(CSE), Reinforcement Learning based Transmit Power Control with ARQ in Energy Harvesting Sensors, 2013.

  67. Indu John, M.E.(CSE), Gibbs Sampling Methods for Efficient Inference in the Hierarchical Dirichlet Process Mixture Model, 2013.

  68. Naveen Kumar (Jointly with Dr.Ambedkar Dukkipati), M.E.(CSE): Stochastic Optimization Algorithms for Average Cost Markov Decision Processes, 2013.

  69. Saswata Chakravarty, M.E.(SSA): Stochastic optimization and applications in reinforcement learning and optimal pricing, 2012.

  70. Abhranil Chatterjee, M.E.(SSA): Reinforcement learning based sleep-wake scheduling for object tracking in wireless sensor networks, 2012.

  71. Debarghya Ghosh Dastidar (Jointly with Dr.Ambedkar Dukkipati), M.E.(SSA): Properties of multivariate q-Gaussian distributions and its application to smoothed functional algorithms for stochastic optimization, 2012.

  72. Ravindra V, M.E.(CSE): Algorithms for optimal vehicular traffic control, 2012.

  73. Sunil Kumar Meena, M.E.(CSE): Reinforcement learning based optimal energy management policy for a single sensor node, 2012.

  74. Nikhil K.Malukani, M.E.(CSE): Pricing for enhanced QoS using SPSA and smoothed functional algorithms, 2012.

  75. Rajendu Mitra, ME(CSE): Intrusion detection using sensor networks, 2011.

  76. Shravan Kumar B.M, ME(CSE): Internet pricing, 2011.

  77. Chandrashekar Lakshmi Narayanan, M.E.(SSA) : An actor-critic algorithm based on linear programming and function approximation, 2010.

  78. Sudha Rani K, M.E.(ISE) : Ant colony optimization with applications in networks, 2008.

  79. A.Radhika, M.E.(CSE) : Performance optimization in ad hoc wireless networks, 2008.

  80. Chintapally Anil Kumar, M.E.(CSE) : Dynamic pricing in networks, 2008.

  81. G.Ramana Reddy, M.E.(CSE) : Performance optimization in bluetooth networks, 2008.

  82. Venkatesh C, M.E. (CSE) : A Unified Framework for Admission Control, Routing and Resource Allocation in Communication Networks, 2007.

  83. Mohan Gedela, M.E. (CSE) : Congestion-based Pricing for QoS, 2007.

  84. Koteswararao Vemu, M.E.(CSE): Link-Route Congestion Based Pricing for Enhanced QoS, 2007.

  85. Vijay P. Chaturvedi, M.E.(ISE) : An Efficient and Optimized Bluetooth Scheduling Algorithm for Piconets, 2007.

  86. H.L.Prasad, M.E.(SSA) : Terrain Exploration by Multi-Agents, 2007.

  87. Muralidhar, M.E.(CSE) : Performance Analysis of UMTS Networks, 2006.

  88. U.V.Vishwanath, M.E.(SSA) : An Optimal Scheduling of Agents in Call Centres, 2006.

  89. V.Rakesh, M.E.(SSA) : Optimal Parameterized Scheduling Policies in Bluetooth Networks, 2006.

  90. B.S.Channabasavanna M.E.(CSE) : A New Real Time Vehicle Navigation System, 2005.

  91. Jai Kumar Wadhwani M.E.(CSE) : Call Admission Control in Communication Networks, 2005.

  92. K.Mohan Babu M.E.(SSA) : Two-Timescale Q-Learning with Applications to Routing in Communication Networks, 2005.

  93. Archana Singh, M.E.(ISE) : Resource Allocation via Stochastic Approximation, 2004.

  94. Jnana Ranjan Panigrahi, M.E.(SSA) : Hierarchical Decision Making in Semiconductor Fabs using Multi-time Scale Markov Decision Processes, 2004.

  95. A Madhukar, M.E.(SSA) : Ergodic Control of Markov Chains Conditioned on Rare Events, 2004.

  96. Raghavendra Kumar Pandey, M.E.(SSA) : Higher Order Algorithms for Simulation Optimization with Applications to Communication Networks, 2004.

  97. Abhishek Verma, M.E.(ISE) : TCP Flow Control with RED Gateways, 2003.

  98. I.Bala Bhaskar Reddy, M.E.(SSA) : Admission Control in Communication Networks, 2003.

  99. Shishir Kumar, M.E.(CSE) : ABR Flow Control in ATM Networks, 2003.