New Startup Funded by Anthropic and OpenAI Set to Test AI's Malicious Potential

September 17, 2025 1:45pm

Anthropic and OpenAI fund a $450 million startup to test AI models for malicious behavior.

Key details

• Anthropic and OpenAI jointly fund $450 million startup
• Startup will test AI for potential malicious actions
• Focus on proactive identification of AI risks
• Critical response to growing concerns about AI safety

In a significant move to enhance AI safety, Anthropic and OpenAI have jointly funded a new startup with a staggering $450 million aimed at rigorously testing AI models for their potential for malicious behavior. This initiative comes amidst increasing concerns over the capabilities of advanced AI systems, particularly in areas like misinformation and cybersecurity risks.

The startup will utilize cutting-edge simulations to evaluate the extent to which AI models, such as ChatGPT, can be leveraged for harmful purposes. This testing is not merely about understanding current behaviors but is designed to predict and mitigate future risks associated with evolving AI technologies.

As AI's role in society grows, the need for effective oversight and testing becomes paramount. Quotes from industry leaders emphasize the significance of this initiative. According to Anthropic's co-founder, the goal is to "proactively identify and mitigate risks before they manifest in the real world." Similarly, OpenAI's leadership stresses the importance of collaborating with startups focused on maintaining ethical standards within AI development.

This funding marks a pivotal step as Anthropic and OpenAI continue to navigate the complex landscape of AI ethics and security. With ongoing advancements in AI's capabilities, the commitment to preemptive testing and evaluation will be critical in ensuring that these technologies do not become conduits for harm.

As this startup begins its work, the broader implications for AI governance and safety will be closely watched by both industry stakeholders and regulatory bodies, eager to assess how the findings will shape future AI deployment standards.

Latest news

AI Research October 10, 2025 9:02pm

New Startup Funded by Anthropic and OpenAI Set to Test AI's Malicious Potential

Key details

Latest news

Harvard Highlights AI Advances in Education and Research at Boston AI Week

Experts Warn of Cognitive Risks Amid New AI Literacy Efforts

Google Launches Gemini Enterprise: A Unified AI Platform Revolutionizing Workplace Efficiency

Elon Musk’s Grok AI to Detect and Trace Origins of Deepfake Videos on X

OpenAI's Sora App Surges to 1 Million Downloads Amidst Copyright and Ethical Concerns

OpenAI and Sur Energy Launch $25 Billion AI Data Center Project in Argentina

New Startup Funded by Anthropic and OpenAI Set to Test AI's Malicious Potential

Key details

Latest news

Sign up for free