The potential for AI systems to cause large-scale harm or failure due to unforeseen vulnerabilities, operational errors, or misuse.
Generality: 775
Process of ensuring that an AI system's goals and behaviors are consistent with human values and ethics.
Generality: 790