Self-correction for LLMs
Review of Self-Correction for LLMs [1] distinguishes two types of self-correction based on the source of feedback: (1) Intrinsic and (2) External. [2] and [3] belong to intrinsic self-correction, where the domain provides ground-truth supervision (i.e., oracle labels) that can be leveraged during training. According to [1], when these oracle labels are not available, the performance improvements of intrinsic self-correction disappear, which contrasts with [2] and [3], where such labels are present and yield gains....