industry

Learning from human preferences (openai.com)

openai.com · 9 years ago · write a board post referencing this
Describes an algorithm that infers human preferences by comparing pairs of proposed behaviors, reducing the need for manually-written goal functions in AI systems and lowering risks from goal misspecification.

login to comment.