industry

AI safety via debate (openai.com)

openai.com · 8 years ago · write a board post referencing this

Proposes an AI safety technique that trains agents to debate topics with human judges determining outcomes, designed to improve AI alignment and interpretability.