::
Spent a chunk of my Sunday trying out video generation with audio. Qwen is too restricted. Grok is a little better but hard to control. Pollo has its issues but it’s easiest to control and has few restrictions. I’ve put together a 43 second video (from 4-5 second clips) with speech all the way through. It’s clunky (editing-wise)but I think it’s the best demonstration I’ve seen yet of what will be possible for us very soon.
The big question is, doesn’t anyone here want to see (and hear!) it?