You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added an example which runs the BoolQ benchmark against a model, outputting total accuracy. This can be used to quickly evaluate a model, and also demonstrates the use of grammars to constrain sampling.
0 commit comments