MASK evals with small models Jul 9 Written By magfrump . This is a link post for my recent work on the MASK honesty benchmark, posted on github. aitechnicalactual code magfrump .
MASK evals with small models Jul 9 Written By magfrump . This is a link post for my recent work on the MASK honesty benchmark, posted on github. aitechnicalactual code magfrump .