Kundan Kumar
  • About
  • CV
  • Research
  • Projects
    • โœจ All Projects

    • Large Language Models
    • Deep Reinforcement Learning
    • Deep/Machine Learning
    • Statistics
    • Computer Vision
    • Robotics
  • Teaching
  • Blogs

On this page

  • ๐Ÿš€ Research Vision
  • ๐Ÿง  Research Themes
    • Safe & Trustworthy Reinforcement Learning
    • Transfer Learning & Meta-Adaptation
    • Vision-Simulation Integration
    • LLM-Augmented Decision Systems
  • ๐Ÿ”ฌ Application Domains
  • ๐Ÿ“š Selected Publications
  • ๐Ÿงช Ongoing Projects
  • Edit this page
  • Report an issue

Research

๐Ÿš€ Research Vision

I aim to advance the frontier of safe, interpretable, and adaptive AI for cyber-physical systems operating under uncertainty and dynamic constraints. My research sits at the intersection of machine learning, optimization, and control theory, with a particular focus on:

  • Physics-informed Deep Reinforcement Learning (DRL)
  • Probabilistic & Bayesian Modeling
  • Large Language Models (LLMs) for autonomous reasoning
  • Vision-based simulation environments

By tightly integrating domain knowledge into learning frameworks, I design agents capable of robust decision-making in real-world, high-stakes environments such as smart grids, robotics, and intelligent infrastructure.


๐Ÿง  Research Themes

Safe & Trustworthy Reinforcement Learning

๐ŸŽฏ Objective

Develop control agents that guarantee system safety, stability, and robust learning in dynamic, uncertain, and partially observable environments.

๐Ÿ” Core Focus Areas

  • Constrained policy optimization and reward shaping
  • Physics-based priors in DRL
  • Adversarial resilience and anomaly detection
  • Epistemic and aleatoric uncertainty quantification

Safe RL Diagram

Transfer Learning & Meta-Adaptation

๐ŸŽฏ Objective

Enabling rapid generalization across distribution shifts in topology, weather, or load profiles.

๐Ÿ” Core Focus Areas

  • Transferable actor-critic architectures
  • Simulation-to-real (Sim2Real) adaptation
  • Meta-RL for sample efficiency

Safe RL Diagram

Vision-Simulation Integration

๐ŸŽฏ Objective

Bridge the gap between perception and control by combining synthetic sensors, simulated environments, and end-to-end learning pipelines.

๐Ÿ” Core Focus Areas

  • Perception-action loops with CARLA, AirSim
  • Multi-modal representation fusion (image + state)
  • Autonomous control with embedded perception modules
  • End-to-end autonomous control pipelines

Safe RL Diagram

LLM-Augmented Decision Systems

๐ŸŽฏ Objective

Develop control agents that guarantee system safety, stability, and robust learning in dynamic, uncertain, and partially observable environments.

๐Ÿ” Core Focus Areas

  • LLMs for summarizing environment states and guiding agents
  • Translating textual inputs into actionable policies
  • Facilitating human-AI collaboration in dynamic tasks

Safe RL Diagram

๐Ÿ”ฌ Application Domains

Domain Description
โšก Smart Energy Systems Volt-VAR control, DER coordination, and federated DRL for power grid stability
๐Ÿš˜ Autonomous Systems Safe navigation, adaptive planning, and control in simulation and real-world driving environments
๐Ÿ›ก Secure AI for Infrastructure Resilience against cyber-attacks and adversarial scenarios in safety-critical systems

๐Ÿ“š Selected Publications

  • ๐Ÿ“ Journal Papers
  • ๐ŸŽค Conference Papers
  1. Kundan Kumar, Gelli Ravikumar
    Physics-based Deep Reinforcement Learning for Grid-Resilient Volt-VAR Control (Under Review)
    IEEE Transactions on Smart Grid, 2025
    Paper Code Poster

  1. Kundan Kumar, Gelli Ravikumar
    Transfer Learning Enhanced Deep Reinforcement Learning for Volt-Var Control in Smart Grids
    IEEE PES Grid Edge Technologies Conference & Exposition, 2025
    Paper Code Poster

  2. Kundan Kumar, Aditya Akilesh Mantha, Gelli Ravikumar
    Bayesian Optimization for Deep Reinforcement Learning in Robust Volt-Var Control
    IEEE PES General Meeting, 2024
    Paper Code Poster

  3. Kundan Kumar, Gelli Ravikumar
    Deep RL-based Volt-VAR Control and Attack Resiliency for DER-integrated Distribution Grids
    IEEE ISGT, 2024
    Paper Code Poster

  4. JK Francis, C Kumar, J Herrera-Gerena, Kundan Kumar, MJ Darr
    Deep Learning and Pattern-based Methodology for Multivariable Sensor Data Regression
    IEEE ICMLA, 2022
    Paper Code Poster

  5. Kin Gwn Lore, Nicholas Sweet, Kundan Kumar, N Ahmed, S Sarkar
    Deep Value of Information Estimators for Collaborative Human-Machine Information Gathering
    ACM/IEEE ICCPS, 2016
    Paper Code Poster


๐Ÿงช Ongoing Projects

  • Federated DRL for Cyber-Resilient Volt-VAR Optimization
    Exploring decentralized, communication-efficient coordination among DERs using LSTM-enhanced PPO agents.

  • One-Shot Policy Transfer with Physics Priors
    Training grid agents in simpler environments and adapting to complex grids (IEEE 123-bus, 8500-node) in few iterations.

  • LLM-Guided Autonomous Planning for Smart Buildings
    Integrating OpenAI and Claude with CityLearn to convert user prompts into interpretable policy instructions.

ยฉ 2025 Kundan Kumar โˆ™ Made with Quarto

  • Edit this page
  • Report an issue
  • Contact