**matrix07012** @matrix@gameliberty.club · Feb 28, 2025, 18:56

**matrix07012** @matrix@gameliberty.club · Feb 28, 2025, 18:56

matrix07012 @matrix@gameliberty.club

Feb 28, 2025, 18:56

matrix07012 @matrix@gameliberty.club

Imo we have hit or are close to hitting the LLM plateau or at least period of stagnation.
OpenAI fumbled massively.
Sonnet 3.7 seems like a small improvement and maybe even a regression.
DeepSeek R1 is great, Grok 3 is great, however thinking models in general seem to be a band aid.

Feb 28, 2025, 18:58

Feb 28, 2025, 18:58

Feb 28, 2025, 18:58

You Will (Not) Escape ☸️ @8757d9c788ddfa02b91056961aa1bced110fa7bd1716af2540c7d013aad337e5@mostr.pub

Grok 3's thinking doesn't seem to do much as far as I can tell. You still get the same dumb answers and a lot of them are still dumb. You look at the "thought" you just get dumb thoughts.

**matrix07012** @matrix@gameliberty.club · 2025-02-28T19:01:24Z

matrix07012 @matrix@gameliberty.club

@8757d9c788ddfa02b91056961aa1bced110fa7bd1716af2540c7d013aad337e5 Obviously yeah, I mean great as in comparison to other models

Feb 28, 2025, 19:01 · · Web · · ·

Trending now

Resources

Developers

What is Mastodon?

gameliberty.club

More…