Discussion about this post

User's avatar
Pawel Jozefiak's avatar

The 96% vs 48% gap is uncomfortable. I've been in both buckets - suspicious of AI output but not rigorously reviewing it every time.

Running deliberate, structured experiments helps break that pattern. I put Sonnet 4.6 through two very different tasks this week with explicit assumptions going in. One matched expectations cleanly. The other got uncomfortably personal in a way I hadn't planned for.

Turns out deliberately probing a model vs casually trusting it are completely different modes that produce completely different results. Here's what the experiments actually showed: https://thoughts.jock.pl/p/sonnet-46-two-experiments-one-got-personal

2 more comments...

No posts

Ready for more?