Does RL Actually Make LLMs Smarter? A Critical Look at Reinforcement Learning for Reasoning
Recent research suggests RL training optimizes search efficiency over existing capabilities rather than expanding reasoning capacity. Here's what the pass@k evidence actually shows.