So there's this model of StableDiffusion that is trained on spectogram images, then you send the prompt and the model generates a spectogram that can be turned into audio.
The audio files are lame but the concept is interesting.
Mainly gaming/nerd instance for people who value free speech. Everyone is welcome.