Sitemap - 2024 - AI Lab Watch
Model evals for dangerous capabilities
Safety consultations for AI lab employees
Anthropic's Certificate of Incorporation
Maybe Anthropic's Long-Term Benefit Trust is powerless
AI companies aren't really using external evaluators
New voluntary commitments (AI Seoul Summit)
DeepMind’s “Frontier Safety Framework” is weak and unambitious