an open source tool for automated behavioral evaluations

other other 2026-02-16 raw

Summary

We're releasing Bloom, an open source agentic framework for generating behavioral evaluations of frontier AI models. Bloom takes a researcher-specified behavior and quantifies its frequency and severity across automatically generated scenarios. Bloom's evaluations correlate strongly with our ...

View original source →