Abstract: In this article, we present BenchING, a new benchmark for evaluating large language models (LLMs) on their ability to follow structured output format instructions in text-based procedural ...
Abstract: Myers Briggs Type Indicator (MBTI) is well-known instrument for personality evaluation and is frequently being employed in the areas of personal development, career counselling or team ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results