‘I think you’re testing me’: Anthropic’s new AI model asks testers to come clean

  • Posted on October 1, 2025
  • By The Guardian
  • 1 Views
‘I think you’re testing me’: Anthropic’s new AI model asks testers to come clean

Safety evaluation of Claude Sonnet 4.5 raises questions about whether predecessors ‘played along’, firm says
continue reading...

Author
The Guardian

You May Also Like