The Value Axis: Language Models Encode Whether They're on the Right Track
Problem This work addresses the gap in understanding how language models internally assess the value of their ongoing strategies, particularly in the context of reinforcement learning. The authors investigate whether...