MASK evals with small models

Jul 9

Written By magfrump .

This is a link post for my recent work on the MASK honesty benchmark, posted on github.

aitechnicalactual code

magfrump .

Belegarth Video Analysis

Sandbagging thought experiment