Lynchmark is just another LLM benchmark. I prioritize making changes to large code bases in the benchmark without them breaking. Also, it's not a automated benchmark.

  • 1. Gemini 2.5 Pro
  • 2. GPT-5