superalignment

The Advent of Superhuman AI: OpenAI’s Mission to Develop Control Tools

Tc Backlight 1
OpenAI formed the Superalignment team in July to develop ways to steer, regulate and govern “superintelligent” AI systems — that is, theoretical systems with intelligence far exceeding that of humans. Superalignment is a bit of touchy subject within the AI research community. “I think we’re going to reach human-level systems pretty soon, but it won’t stop there — we’re going to go right through to superhuman systems … So how do we align superhuman AI systems and make them safe? But the approach the team’s settled on for now involves using a weaker, less-sophisticated AI model (e.g. Well, it’s an analogy: the weak model is meant to be a stand-in for human supervisors while the strong model represents superintelligent AI.