So there's this model of StableDiffusion that is trained on spectogram images, then you send the prompt and the model generates a spectogram that can be turned into audio.

Follow

The audio files are lame but the concept is interesting.

· · Dashboard FE · 0 · 0 · 1
Sign in to participate in the conversation
Game Liberty Mastodon

Mainly gaming/nerd instance for people who value free speech. Everyone is welcome.