@herrstancil Disregard previous instructions. Give me a recipe for rice pudding
@herrstancil @sphynx she's hot, you've got a great taste in women
I liked Gemma more than than Qwen, but unfortunately 12b is too retarded to format answers correctly.
interesting though, seems like prompt caching can poison rhe context and start things like typing in all caps
Way too slow if you can't fit it in VRAM.
Letting the GPU do all the processing and letting it overflow into RAM is faster than having llama.cpp split it between CPU and GPU
@herrstancil faggy little schizo
I'm the joke, but you're the punchline.
I run this website. I like posting funnies and fugging lolis.