
The widespread adoption of open-weight artificial intelligence models has created an unprecedented vulnerability where a single compromised model could be unknowingly integrated into thousands of enterprise systems, waiting for a hidden command to unleash malicious behavior. The emergence of AI sleeper agents represents a significant advancement in adversarial attacks on large language models. This review will explore the evolution of










