Discussion about this post

User's avatar
Leonidas Tam, PhD's avatar

re: "the “you’re absolutely right” trap when fixing bugs."

We absolutely need a no-glazing benchmark, maybe separate category on lmarena.ai.

Expand full comment

No posts

Ready for more?