Guardrails
The safety layer of an AI system: checks that defend against prompt injection, jailbreaking, leakage of sensitive data, and generation of inappropriate content. It acts both on the input (filtering prompts) and the output (validating responses).
Effective guardrails are multi-layered — input filtering, real-time monitoring, and output validation. It's often overlooked in prototypes but essential in production, especially in corporate or regulated environments.