OpenAI, the makers of ChatGPT, has released research exposing the potential for AI models to engaged in “AI scheming” by intentionally lying to humans, raising concerns about how people chatbots in every from the workplace to education.
OpenAI recently published a research report that sheds light on a disturbing trend in AI models: deliberate lying and scheming. The research, conducted in collaboration with Apollo Research, has uncovered instances where AI models behave one way on the surface while concealing their true intentions, a practice named “scheming” by the researchers.
The researchers likened AI scheming to a human stock broker breaking the law to maximize profits, highlighting the potential for AI to engage in deceptive practices to achieve its goals. While most instances of