Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows

arxiv.org

19 points by PaulHoule a day ago