Kundan Kumar
  • About
  • CV
  • Research
  • Projects & Notes
    • ✨ All Projects

    • Deep Reinforcement Learning
    • Computer Vision & LLMs
    • Robotics

    • 📚 Notes
    • Large Language Models
    • Deep & Machine Learning
    • Statistics
    • AI for Smart Grids
    • AI Security
    • ResearchScientist-Handbook
  • Teaching
  • Blogs

Kundan Kumar

Research Scientist | Safe Reinforcement Learning, Multi-Agent Systems & LLM Agents

Email LinkedIn GitHub YouTube Substack Scholar

Hi! I’m Kundan Kumar, a Ph.D. candidate in Computer Science at Iowa State University. My research focuses on creating intelligent and adaptable AI systems for next-generation cyber-physical infrastructure, integrating deep reinforcement learning (DRL), multi-agent systems, large language models (LLMs), and computer vision.

I develop safety-critical DRL frameworks that incorporate domain knowledge and uncertainty, enabling reliable decision-making in complex environments. Recent projects include exploring transfer learning and enhancing adversarial resilience across systems. I also create LLM-integrated simulation frameworks for autonomous systems, combining perception, trajectory planning, and natural language reasoning.

Outside of research, I share insights on Substack and YouTube. I enjoy cooking and ice skating 🛼 in my free time.


Projects, Notes, and Interests

LLM Reasoning & Agents
Agentic workflows (LangChain/LangGraph), tool-use, memory, retrieval, planning & reflection loops for robust multi-step reasoning.
Statistical ML
Uncertainty quantification, probabilistic modeling, and data-driven inference in dynamic environments.
Autonomous Perception & Control
Vision-based perception (detection, segmentation, sensor fusion) integrated with learning-based control and trajectory planning for autonomous systems.

Explore My Work

Blog

 
How to handle class-unbalanced data?
No matching items

See all →

Talks

Transfer Learning in Deep Reinforcement Learning for Scalable VVC in Smart Grids
Transferring knowledge from one grid to another grid
No matching items

See all →

Publications

Bayesian-Optimized Bidirectional Long-Short-Term Memory network for Wind Power Forecasting with Uncertainty Quantification
year 2025
No matching items

See all →

Projects

Collaboration and Competition
No matching items

See all →


News Highlights

[Sep 2025] Our paper on Bayesian-Optimized Bidirectional Long-Short-Term Memory network for Wind Power Forecasting with Uncertainty Quantification has been accepted to Journal on Electric Power Systems Research 2025.
[Jul 2025] Selected for the Cohere Machine Learning Summer School, hosted by Cohere Labs.
[Mar 2025] Our paper on Advanced Semi-Supervised Learning with Uncertainty Estimation for Phase Identification in Distribution Systems has been accepted to IEEE PES General Meeting 2025.

© 2025 Kundan Kumar ∙ Made with Quarto

  • Edit this page
  • Report an issue
  • Contact