RLVR

論文閱讀：Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

探討強化學習與可驗證獎勵（RLVR）在提升 LLM 推理能力上的真實效果

Nov 22, 2025