The Advent of Superhuman AI: OpenAI’s Mission to Develop Control Tools
OpenAI formed the Superalignment team in July to develop ways to steer, regulate and govern “superintelligent” AI systems — that is, theoretical systems with intelligence far exceeding that of humans.
Superalignment is a bit of touchy subject within the AI research community.
“I think we’re going to reach human-level systems pretty soon, but it won’t stop there — we’re going to go right through to superhuman systems … So how do we align superhuman AI systems and make them safe?
But the approach the team’s settled on for now involves using a weaker, less-sophisticated AI model (e.g.
Well, it’s an analogy: the weak model is meant to be a stand-in for human supervisors while the strong model represents superintelligent AI.