Discussion about this post

User's avatar
Leonidas Tam, PhD's avatar

We need a benchmark for LLMs that glaze you the least. Gemini still has high agreeableness in my testing on aistudio

Expand full comment
havenpointconsulting's avatar

I like this. Even with more powerful models, you still need good instructions and clear boundaries. In operations, structure is everything, and AI is no different. The people who know how to guide the model will always get the best results.

Expand full comment
3 more comments...

No posts

Ready for more?