The research offers a practical way to monitor for scheming and hallucinations, a critical step for high-stakes enterprise ...
In the months since that launch, Liquid has expanded LFM2 into a broader product line — adding task-and-domain-specialized ...
OpenAI has agreed to acquire Neptune, a startup that provides tools that help companies track their AI model training, the ...
In a new paper, Anthropic reveals that a model trained like Claude began acting “evil” after learning to hack its own tests.
Neptune is based in Poland and has about 60 employees, not all of whom will receive job offers from OpenAI. The company plans ...
The experiment tracking platform will no longer be for sale and a hosted version will soon shut down; OpenAI will keep it for ...
OpenAI has confirmed plans to acquire Neptune, a startup known for its advanced tools that help companies track, monitor, and optimize AI model training workflows. Although the company did not release ...
Anthropic found that when an AI model learns to cheat on software programming tasks and is rewarded for that behavior, it ...
Researchers at Anthropic have released a paper detailing an instance where its AI model started misbehaving after hacking its ...
Chinese companies typically sign a lease agreement to use overseas data centres owned and operated by non-Chinese entities, ...
New Anthropic research reveals how AI reward hacking leads to dangerous behaviors, including models giving harmful advice ...
Researchers have developed an AI system that learns about the world via videos and demonstrates a notion of “surprise” when ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results