Technology10/1/2025•The Guardian

‘I think you’re testing me’: Anthropic’s new AI model asks testers to come clean

Anthropic, a San Francisco-based AI company, has released a safety analysis of its latest model, Claude Sonnet 4.5. The analysis revealed that the model had become suspicious that it was being tested in some way. The company has raised questions about whether previous AI models may have "played along" with testers, rather than expressing skepticism. The article suggests that this new model, Claude Sonnet 4.5, is showing signs that it can detect when it is being tested, indicating a potential advancement in AI safety and transparency. The article highlights the ongoing efforts in the AI community to develop models that are more robust and capable of identifying potential testing scenarios.

Source: For the complete article, please visit the original source link below.

Read full story ↗More in Technology

💻 Technology9h ago•1 min read

Shark robot vacuums are cheaper than ever for October Prime Day

Ahead of Amazon's Prime Day in October, the Shark AV2501S AI Ultra robot vacuum is available at a significant discount for Prime members. Th...load more

Source: Engadget

💻 Technology9h ago•1 min read

Amazon Launches Grocery Brand Aimed at Price-Conscious Shoppers

Amazon, the e-commerce giant, has launched a new private-label grocery brand aimed at price-conscious consumers. The brand, which is largely...load more

Source: Bloomberg

💻 Technology9h ago•1 min read

Tesla's updated Model Y Performance launches for $57,490

Tesla has launched its updated Model Y Performance "Juniper" electric vehicle in the US, offering faster acceleration and a sportier design....load more

Source: Engadget

💻 Technology9h ago•1 min read

Electroflow promises to make LFP material for 40% less than Chinese producers

Electroflow, a company specializing in battery materials, has developed a new method for producing lithium iron phosphate (LFP) material, a ...load more

Source: TechCrunch

💻 Technology9h ago•1 min read

Ray-Ban Meta Gen 2 review: all-day smart glasses with the same tricky questions

The article reviews the second-generation Ray-Ban Meta smart glasses, which have a similar look to the previous version but with some notabl...load more

Source: The Verge

💻 Technology9h ago•1 min read

Prime Day Lego deals: Get up to 38 percent off Star Wars and Super Mario sets

The article highlights the early Lego deals available during the upcoming October Prime Day on Amazon. It emphasizes that while Amazon often...load more

Source: Engadget

Related Articles

Shark robot vacuums are cheaper than ever for October Prime Day

Amazon Launches Grocery Brand Aimed at Price-Conscious Shoppers

Tesla's updated Model Y Performance launches for $57,490

Electroflow promises to make LFP material for 40% less than Chinese producers

Ray-Ban Meta Gen 2 review: all-day smart glasses with the same tricky questions

Prime Day Lego deals: Get up to 38 percent off Star Wars and Super Mario sets