Preface

This book is a comprehensive guide on Safe AI, bringing together the foundations of machine learning security, adversarial robustness, AI alignment, and trustworthy deployment practices for modern cyber-physical systems (CPS) and LLM-based autonomous agents.
It is written as a hybrid between a research handbook, a course textbook, and a practical engineering guide for building and securing intelligent systems.

The material integrates insights from academic literature, real-world deployment case studies, and hands-on adversarial evaluations—spanning from classical adversarial machine learning to emerging red/blue-team techniques for large-scale language models.
Special emphasis is placed on CPS, where failures in AI behavior directly influence physical infrastructure such as power grids, transportation systems, and industrial control environments.

Each chapter begins with explicit learning objectives, conceptual explanations, and diagrams, followed by practical examples, research notes, and additional curated references.
The aim is to equip the reader with the full intellectual toolkit required to design, analyze, attack, defend, and align AI systems operating in safety-critical settings.

The intended audience should be comfortable with programming and have working familiarity with the following concepts and methods:

This book may serve students, researchers, engineers, and practitioners seeking a deep and structured understanding of modern AI security—from the mathematical fundamentals to frontier challenges in alignment, LLM safety, and real-world CPS integration.


Author

Kundan Kumar
https://kundan-kumarr.github.io/


Citation

Kumar, K. (2025). Safe AI for Cyber-Physical & Intelligent Systems:
Model Security, Adversarial Robustness, Agent Safety, and Trustworthy Methods
.
Edition 2025-11.


License

This work is licensed under the MIT License.