Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Reinforcement Learning for Reasoning in LLMs with One Training Example

arxiv.org

1 points by babelfish 7 hours ago