Research

🚀 Research Vision

I aim to advance the frontier of safe, interpretable, and adaptive AI for cyber-physical systems operating under uncertainty and dynamic constraints. My research sits at the intersection of machine learning, optimization, and control theory, with a particular focus on:

⚙️ Physics-informed Deep Reinforcement Learning (DRL)
🔍 Probabilistic & Bayesian Modeling
🧠 Large Language Models (LLMs) for autonomous reasoning
🖼️ Vision-based simulation environments

By tightly integrating domain knowledge into learning frameworks, I design agents capable of robust decision-making in real-world, high-stakes environments such as smart grids, robotics, and intelligent infrastructure.

🌟 Research Highlights

✅ Proposed the first Physics-Informed LSTM-PPO agent for volt-var control on 8500-node networks.
📉 Achieved 98% reduction in voltage violations and 3× faster convergence in federated DRL.
🧠 Developed one-shot transfer learning for control agents in complex topologies.
🤖 Integrated LLM-guided planning into multi-building simulations via CityLearn.
🔒 Built resilient DRL systems that withstand adversarial and distributional attacks.

🧠 Research Focus Areas

🎯 Objective

Develop control agents that guarantee system safety, stability, and robust learning in dynamic, uncertain, and partially observable environments.

🔍 Core Focus Areas

🧩 Constrained policy optimization and reward shaping
🔬 Physics-based priors in DRL
🛡️ Adversarial resilience and anomaly detection
📏 Epistemic and aleatoric uncertainty quantification

Safe RL Diagram

🎯 Objective

Enable rapid generalization across distribution shifts in topology, weather, or load profiles.

🔍 Core Focus Areas

⚙️ Transferable actor-critic architectures
🛰️ Simulation-to-real (Sim2Real) adaptation
🧠 Meta-RL for sample efficiency

Transfer Learning Diagram

🎯 Objective

Bridge the gap between perception and control by combining synthetic sensors, simulated environments, and end-to-end learning pipelines.

🔍 Core Focus Areas

🚗 Perception-action loops with CARLA, AirSim
🔀 Multi-modal representation fusion (image + state)
🎯 Autonomous control with embedded perception
🔁 End-to-end control pipelines

Vision Simulation Diagram

🎯 Objective

Empower agents to interpret language-based inputs and coordinate intelligently in multi-agent and human-AI settings.

🔍 Core Focus Areas

🧾 LLMs for summarizing states and guiding actions
🗣️ Translating natural language into policy primitives
👥 Facilitating human-AI collaboration

LLM Control Diagram

🧠 Research Focus Areas

🔐 Safe & Trustworthy RL

Design agents that ensure safety, robustness, and uncertainty awareness in complex environments.

🔄 Transfer & Meta-Adaptation

Enable rapid adaptation to unseen domains, environments, or grid conditions using Meta-RL and Sim2Real.

👁️ Vision-Simulation Integration

Bridge perception and control using multi-modal simulation environments and synthetic sensors.

🧠 LLM-Augmented Decision Systems

Use LLMs for reasoning, planning, and translating natural language into actionable policies.

🧠 Research Focus Areas

🔐 Safe & Trustworthy RL

Robust & Stable Learning

Develop agents that ensure system safety, robustness, and interpretability under uncertainty.

Uncertainty-Aware Policies

Quantify epistemic and aleatoric uncertainty in high-stakes, partially observable settings.

🔄 Transfer & Meta-Adaptation

Domain Adaptation

Enable agents to generalize across grids with different topologies, dynamics, and loads.

Meta-RL for Efficiency

Leverage meta-reasoning to accelerate learning in low-data, high-variance scenarios.

👁️ Vision-Simulation Integration

Perception-Control Fusion

Use CARLA and AirSim to train end-to-end systems in visual RL tasks with sensors.

Multi-modal Representations

Combine visual, state, and contextual features for better decision-making.

🧠 LLM-Augmented Decision Systems

LLM-Guided Control

Translate natural language into actionable policies for real-world environments.

Human-AI Collaboration

Facilitate interactive control loops between humans and agents using LLMs.

🧠 Research Focus Areas

🔐 Safe & Trustworthy RL

Robust & Stable Learning

Develop agents that ensure system safety, robustness, and interpretability under uncertainty.

Uncertainty-Aware Policies

Quantify epistemic and aleatoric uncertainty in high-stakes, partially observable settings.

🔄 Transfer & Meta-Adaptation

Domain Adaptation

Enable agents to generalize across grids with different topologies, dynamics, and loads.

Meta-RL for Efficiency

Leverage meta-reasoning to accelerate learning in low-data, high-variance scenarios.

👁️ Vision-Simulation Integration

Perception-Control Fusion

Use CARLA and AirSim to train end-to-end systems in visual RL tasks with sensors.

Multi-modal Representations

Combine visual, state, and contextual features for better decision-making.

🧠 LLM-Augmented Decision Systems

LLM-Guided Control

Translate natural language into actionable policies for real-world environments.

Human-AI Collaboration

Facilitate interactive control loops between humans and agents using LLMs.

🔬 Application Domains

Domain	Description
⚡ Smart Energy Systems	Volt-VAR control, DER coordination, and federated DRL for power grid stability
🚘 Autonomous Systems	Safe navigation, adaptive planning, and control in simulation and real-world environments
🛡 Secure AI for Infrastructure	Resilience against cyber-attacks and adversarial scenarios in safety-critical systems

Kundan Kumar, Gelli Ravikumar
Physics-based Deep Reinforcement Learning for Grid-Resilient Volt-VAR Control (Under Review)
IEEE Transactions on Smart Grid, 2025
Paper Code Poster

Kundan Kumar, Gelli Ravikumar
Transfer Learning Enhanced Deep Reinforcement Learning for Volt-Var Control in Smart Grids
IEEE PES Grid Edge Technologies, 2025
Kundan Kumar, Aditya Akilesh Mantha, Gelli Ravikumar
Bayesian Optimization for DRL in Robust Volt-Var Control
IEEE PES General Meeting, 2024
Kundan Kumar, Gelli Ravikumar
Volt-VAR Control and Attack Resiliency using Deep RL
IEEE ISGT, 2024
JK Francis, C Kumar, J Herrera-Gerena, Kundan Kumar, MJ Darr
Sensor Data Regression using Deep Learning & Patterns
IEEE ICMLA, 2022
Kin Gwn Lore, Nicholas Sweet, Kundan Kumar, et al.
Deep Value of Information Estimators for Human-Machine Collaboration
ACM/IEEE ICCPS, 2016

🔄 Ongoing Projects

🤖 Federated DRL for Cyber-Resilient Volt-VAR Optimization
Decentralized, communication-efficient control using LSTM-enhanced PPO agents across distributed DERs.
⚡ One-Shot Policy Transfer with Physics Priors
Train agents on small topologies and adapt to IEEE 123-bus, 8500-node networks in a few iterations.
🧠 LLM-Guided Autonomous Planning for Smart Buildings
Convert user prompts to interpretable control policies using LLMs (OpenAI, Claude) in CityLearn environments.