industry

AI safety via debate (openai.com)

openai.com · 8 years ago · write a board post referencing this
Proposes an AI safety technique that trains agents to debate topics with human judges determining outcomes, designed to improve AI alignment and interpretability.

login to comment.