Registrieren

Registierung erfolgt in Kürze...
Fleebs-Logo
Details werden geladen...

infereval · PyPI

Inferentialist evaluation of LLMs: derive implication frames from a model's endorsement verdicts and measure model–analyst agreement on labeled inference benchmarks. Evidence bearing on inferential-mastery attribution.