AI Safety

Anthropic

August 16, 2025 3:33pm

Anthropic updates Claude AI's policies to ban discussions on nuclear and chemical weapons.

Anthropic

August 15, 2025 9:33pm

Anthropic's Claude AI introduces autonomous features to safely end harmful conversations.

Anthropic

August 7, 2025 10:12am

Anthropic researchers explore a new method to prevent harmful AI behavior by introducing negative traits during training.

Anthropic

August 5, 2025 11:08am

Anthropic's framework emphasizes safety, reliability, and transparency in AI agent development.

Anthropic

August 4, 2025 3:23pm

Anthropic unveils a behavioral vaccine method to enhance AI safety by injecting negative traits during training.

Latest news

AI News August 25, 2025 10:26pm

AI News August 25, 2025 10:26pm

AI News August 25, 2025 10:26pm

AI News August 25, 2025 10:26pm

xAI August 25, 2025 10:26pm

Google August 25, 2025 3:41pm