@WandererUber @bronze It's tied to my IRL identity, but I can tell you the software I use:
- whisperx for subtitle generation
- gemma3n (when local) to translate the subtitles
- extra software to handle cutting up the mp3 file to have the voice segments
Then I pass the audio segments to koboldcpp running with qwen3-tts for it to clone. Gets the original audio, the text, spits out the result. It's not perfect, but it lets me watch a german TV show with mum in spanish.
The support for qwen3-tts isn't 100% wired though, I can't specify the language of the output audio, so sometimes I get french, german or english voices speaking spanish.
NGL being able to do this on a little iGPU makes me happy but also annoyed at how classmates at uni barely use their beefy gaming laptops for gayming and even then...