Discussion about this post

User's avatar
Leonidas Tam's avatar

We need a benchmark for LLMs that glaze you the least. Gemini still has high agreeableness in my testing on aistudio

Expand full comment
arnestrickmann's avatar

thanks for mentioning emdash, Ben!

Expand full comment
1 more comment...

No posts