Kundan Kumar
  • About
  • CV
  • Research
  • Projects & Notes
    • ✨ All Projects

    • πŸ“š Notes
    • Large Language Models
    • Deep & Machine Learning
    • Statistical Analysis
    • AI Alignment and Safety

    • 🧰 Resources
    • AI for Smart Energy Systems
    • ResearchScientist-Handbook
  • Teaching
  • Blogs

On this page

  • Research
    • Mission
  • Research Focus
  • Publications
  • Ongoing Projects
  • Edit this page
  • Report an issue

Research

I develop safe, interpretable, and adaptive AI systems for real-world cyber-physical environments that must operate under uncertainty, strict physical constraints, and adversarial conditions. My work sits at the intersection of safe reinforcement learning, physics-informed AI, probabilistic modeling, and AI safety evaluation with a core focus on building systems that are robust, verifiable, and deployable at scale in safety-critical domains.

Vision

Full Publications β†’

My research centers around three core thrusts:

  • Safe & Trustworthy Reinforcement Learning: Designing agents that remain reliable under sensor noise, hardware faults, non-stationarity, and adversarial perturbations β€” including constraint-aware learning, certified robustness, and safe exploration in safety-critical settings.
  • Physics-Informed AI & Probabilistic Modeling: Embedding physical laws, invariants, and feasibility constraints directly into model architectures; quantifying epistemic and aleatoric uncertainty for risk-aware planning in partially observable environments.
  • AI Safety, Alignment & LLM Evaluation: Designing behavioral and mechanistic evaluations to detect deceptive alignment, reward hacking, and eval-awareness in LLM agents; building scalable oversight frameworks for agentic systems in safety-critical deployments.

Mission

By integrating physics-guided structure, probabilistic reasoning, and safe reinforcement learning, my goal is to build the next generation of AI systems that are:

  • Reliable β€” under uncertainty, noise, and adversarial conditions
  • Generalizable β€” across tasks, scales, and distribution shifts
  • Interpretable β€” to operators, engineers, and decision-makers
  • Deployable β€” in large-scale, real-world cyber-physical environments

Research Focus

These focus areas organize my ongoing and recent projects that bridge fundamental methods and deployable systems.

Safe & Intelligent Systems for Smart Energy

Physics-informed deep RL, safe and uncertainty-aware control, LLM-guided planning, and sim-to-real transfer for smart grid and DER-integrated energy systems.

Volt-VAR Control Physics-Informed RL Uncertainty Quantification LLM Control Domain Adaptation
AI Safety & Alignment

Behavioral and mechanistic evaluations for deceptive alignment, adversarial robustness, reward hacking detection, and scalable oversight for agentic LLM systems.

Evaluation Awareness Red-Teaming Scalable Oversight Monitor Transferability
Vision & Simulation for Autonomous Systems

End-to-end perception-control pipelines, multimodal sensor fusion, and sim-to-real transfer using CARLA, AirSim, and OpenDSS for autonomous and cyber-physical systems.

CARLA Perception-Control Multimodal Fusion Sim-to-Real

Publications

  • Journal Papers
  • Conference Papers

Journal Papers Total: 2

  1. Arif Hussian, Kundan Kumar, Gelli Ravikumar
    Bayesian-optimized bidirectional long-short-term memory network for wind power forecasting with uncertainty quantification , Electric Power Systems Research, 2026
    Paper Code ️Poster
  2. Kundan Kumar, Gelli Ravikumar
    Physics-based Deep Reinforcement Learning for Grid-Resilient Volt-VAR Control (Under Review), IEEE Transactions on Smart Grid, 2025
    Paper Code Poster

Conference Papers Total: 7

  1. Kundan Kumar, Gelli Ravikumar
    A Multi-Objective Optimization Framework for Carbon-Aware Smart Energy Management , IEEE North American Power Symposium (NAPS), 2025
    Paper Presentation
  2. Kundan Kumar, Kumar Utkarsh, Wang Jiyu, Padullaparti Harsha
    Advanced Semi-Supervised Learning With Uncertainty Estimation for Phase Identification in Distribution Systems , IEEE PES General Meeting, 2025
    Paper Presentation Poster
  3. Kundan Kumar, Gelli Ravikumar
    Transfer Learning Enhanced Deep Reinforcement Learning for Volt-Var Control in Smart Grids , IEEE PES Grid Edge Technologies Conference & Exposition, 2025
    Paper Poster
  4. Kundan Kumar, Aditya Akilesh Mantha, Gelli Ravikumar
    Bayesian Optimization for Deep Reinforcement Learning in Robust Volt-Var Control , IEEE PES General Meeting, 2024
    Paper Poster
  5. Kundan Kumar, Gelli Ravikumar
    Deep RL-based Volt-VAR Control and Attack Resiliency for DER-Integrated Distribution Grids , IEEE Innovative Smart Grid Technologies (ISGT), 2024
    Paper Poster
  6. JK Francis, C Kumar, J Herrera-Gerena, Kundan Kumar, MJ Darr
    Deep Learning and Pattern-based Methodology for Multivariable Sensor Data Regression , IEEE ICMLA, 2022
    Paper
  7. Kin Gwn Lore, Nicholas Sweet, Kundan Kumar, N Ahmed, S Sarkar
    Deep Value of Information Estimators for Collaborative Human-Machine Information Gathering , ACM/IEEE ICCPS, 2016
    Paper

Ongoing Projects

Smart Energy
Federated DRL for Cyber-Resilient Volt-VAR Optimization
Decentralized, communication-efficient control using LSTM-enhanced PPO agents across distributed DERs.
AI Safety
Stress-Testing Eval-Aware Safety Monitors
Probing LLM agents for eval-awareness via CAA on Nemotron-49B and Mistral; extending replication to silent model organisms.
LLM Control
LLM-Guided Autonomous Planning for Smart Buildings
Converting user prompts to interpretable control policies using LLMs (OpenAI, Claude) in CityLearn environments.

Β© 2026 Kundan Kumar βˆ™ Made with Quarto

  • Edit this page
  • Report an issue
  • Contact