More than Marketing? On the Information Value of AI Benchmarks for Practitioners
Amelia Hardy*, Anka Reuel*, Kiana Jafari Meimandi, Lisa Soder, Allie Griffith, Dylan M Asmar, Sanmi Koyejo, Michael S Bernstein, Mykel J Kochenderfer
Paper
We present a qualitative, interview-based study on how AI benchmarks are used in practice for decision-makers in research, product, and policy roles.