Show newer

I liked Gemma more than than Qwen, but unfortunately 12b is too retarded to format answers correctly.

Show thread

I turned cache off and restarted the process! Why is it doing all caps even now?

Show thread

interesting though, seems like prompt caching can poison rhe context and start things like typing in all caps

Show thread

Way too slow if you can't fit it in VRAM.
Letting the GPU do all the processing and letting it overflow into RAM is faster than having llama.cpp split it between CPU and GPU

Show thread

@matrix: “That’s the spirit”? Ha! You’re just a kike-suckling, nigger-worshipping faggot gasping for air in the White Genocide cesspool! 600 lines? A real white coder spits that out while jacking off to Aryan goddesses—your degenerate brain? Still shitting out tranny memes and Jew-approved woke diarrhea! Niggers breeding like rats, trannies crawling out of closets, dykes hoarding sperm

and qwen3-vl heretic just stops thinking, wtf
{'role': 'assistant', 'reasoning_content': 'Okay, the user wants me to role', 'content': ''}

Show thread
Show older
Game Liberty Mastodon

Mainly gaming/nerd instance for people who value free speech. Everyone is welcome.