Beyond Instruction Following: Evaluating Inferential Rule Following of Large Language Models

Abstract

Although Large Language Models (LLMs) have demonstrated strong ability, theyare further supposed to be controlled and guided by in real-world scenarios tobe safe, accurate, and intelligent. This demands the possession of capabilityof LLMs. However, no prior work has made a clear evaluation of the inferentialrule-following capability of LLMs. Previous studies that try to evaluate theinferential rule-following capability of LLMs fail to distinguish theinferential rule-following scenarios from the instruction-following scenarios.Therefore, this paper first clarifies the concept of inferential rule-followingand proposes a comprehensive benchmark, RuleBench, to evaluate a diversifiedrange of inferential rule-following abilities. Our experimental results on avariety of LLMs show that they are still limited in following rules. Ouranalysis based on the evaluation results provides insights into theimprovements for LLMs toward a better inferential rule-following intelligentagent. We further propose Inferential Rule-Following Tuning (IRFT). Theexperimental results show that through IRFT, LLMs can learn abstractrule-following abilities from purely synthetic data and then generalize toRuleBench. The data and code can be found at:https://anonymous.4open.science/r/llm-rule-following-B3E3/

Quick Read (beta)

loading the full paper ...