View: Anthropic’s Creative Solution to Illicit Answers from AI
Such is the case with Anthropic and its latest research which demonstrates an interesting vulnerability in current LLM technology.
Of course given progress in open-source AI technology, you can spin up your own LLM locally and just ask it whatever you want, but for more consumer-grade stuff this is an issue worth pondering.
But the closer we get to more generalized AI intelligence, the more it should resemble a thinking entity, and not a computer that we can program, right?
If so, we might have a harder time nailing down edge cases to the point when that work becomes unfeasible?
Anyway, let’s talk about what Anthropic recently shared.