Home › Forums › Miscellaneous & Help › Talkies
- This topic has 21 replies, 5 voices, and was last updated 1 day, 16 hours ago by
Ed.
-
AuthorPosts
-
-
21st December 2025 at 7:34 pm #2020::
Spent a chunk of my Sunday trying out video generation with audio. Qwen is too restricted. Grok is a little better but hard to control. Pollo has its issues but it’s easiest to control and has few restrictions. I’ve put together a 43 second video (from 4-5 second clips) with speech all the way through. It’s clunky (editing-wise)but I think it’s the best demonstration I’ve seen yet of what will be possible for us very soon.
The big question is, doesn’t anyone here want to see (and hear!) it?
-
22nd December 2025 at 7:27 am #2021
-
24th December 2025 at 12:37 am #2022
-
6th January 2026 at 11:37 pm #2059
-
7th January 2026 at 3:13 pm #2064
-
7th January 2026 at 3:44 pm #2065::
All completely AI creations. Very detailed prompts, cherry-picking the results (I picked the best 20 videos out of 56 created), and an element of luck combined to generate these.
Unlike the previous couple of videos I worked on, these clips were created by Grok and Qwen, mostly via my mobile in spare moments. As I’ve spent a lot of time in hospital waiting rooms lately, it seemed like a good use of time when I don’t have my laptop available.
-
8th January 2026 at 10:55 am #2069::
Tell me about it! If I added up all the endless hours I’ve spent waiting in hospital corridors and hospital waiting rooms since 2009, it’d run into weeks.
-
9th January 2026 at 5:17 pm #2070
-
-
7th January 2026 at 12:10 am #2060
-
7th January 2026 at 10:58 am #2061::
The image quality is also excellent and the lip sync, though not 100% perfect, is as good as most videos on the web.
-
7th January 2026 at 3:55 pm #2066::
Speech synchronisation is still a little wonky, and it varies from video to video. Some of it might be manually correctable with intensive video editing but I’m not going there.
Another advantage of these short clips is that the lack of consistency with voices isn’t a problem. If I create two video clips with the same characters in them, the AIs cannot be relied upon to be consistent with the voices produced.
Grok seems much more reliable on lip-sync than Qwen, but models are constantly being upgraded so that can change quickly.
-
-
7th January 2026 at 4:01 pm #2067::
I’m trying to get a straight answer on whether Grok Spicy does or doesn’t allow NSFW content. Even their website says both.
-
11th January 2026 at 11:32 pm #2076
-
13th January 2026 at 5:34 pm #2077
-
13th January 2026 at 10:14 pm #2078
-
13th January 2026 at 10:20 pm #2079::
I’ll need to pay £100-200 per month to do these videos with proper uncensored nudity. We need a lot more people to become NE patrons to make that viable.
I’ve got an NSFW one on the way but any nudity is fleeting or minor and snuck through the moderation. If I directly prompt nudity, it gets reliably rejected.
-
-
14th January 2026 at 11:28 am #2080::
Sometimes it feels like a video generator is trying to be infuriating. I tried a clip of a London cabbie telling a nudity-related joke:
- The first go was fine except the joke was too long and the last words were cut off. I reworded it and tried again.
- Go 2 was fine except, for some weird reason the London cab was bouncing all the way through as if someone heavy was repeatedly getting into the back. Added an instruction to keep the cab still.
- Go 3 – saw the back door slide open and then partially shut. That made no sense so I added an instruction to keep the doors shut. It was distracting and the cabbie’s performance wasn’t great.
- Go 4 – the driver’s door opened at the start and the cabbie was (unlike the 1st 4) facing the wrong way. Stipulated which way the cabbie should look and tried again.
- Go 5 – the cab moved perfectly, no doors opened, the cabbie looked at the camera and spoke total gibberish. No idea what he said.
- Go 6 – the cab rocked a little too much but not as badly as 2. Also, the driver’s door opened slightly, but the cabbie’s vocal performance was perfect. I try really hard not to spend too many credits on a single clip so I’m calling it a day there.
I feel that video could easily have been perfect on attempt 2. I’m sure a lot of people think you just tell it what you want and you get a perfect result first time. As I’m trying for more precise output, it’s taking a lot more work (and credits) to get there.
-
14th January 2026 at 3:08 pm #2081::
I’ve tried using Pollo to make video and to be honest, it’s pretty rubbish. I only have a free account and log in frequently to pick up free credits, but I’m getting to the stage of giving up now.
-
14th January 2026 at 4:02 pm #2082::
It all depends on what AI model you are using on Pollo. They have a huge range of them. Most are content-restricted and expensive but some of the cheaper ones are pretty good. The complication is that things keep varying so what works one month may not give the same results or offer the same options a month later. Wan 2.5 on Pollo used to work very well, but I haven’t used it in a few weeks. Pollo 2.5 was total cr*p last time I used it – bizarre distortion, random nonsense and completely inconsistent results. I’m hoping they’ve improved that.
I like the models which allow you to set a start frame and an end frame – that usually gives a LOT more control. You often need to turn the inference up too, and most of the models don’t give you control over that. I’ve found wording is SO important and you often need to specify when something is not doing anything or a person is not speaking just as much as the things or people who are active. Qwen isn’t bad and Grok is very good, but the nudity moderation is pretty strict on both of them (and about to get stricter, I imagine).
Of course, the addition of audio has complicated the entire field, but I’m not giving up yet. I swear all of them have good days and bad days too. Sometimes I can’t get a result on Qwen, but do on Grok or vice versa. Sometimes I end up saving a prompt and using it a day or two later when the AI is in a better mood.
I didn’t once have such a bad day getting videos out of Qwen that I used the prompt “A woman smiling at the camera” with no start frame image and the result got moderated.
-
17th January 2026 at 11:44 am #2083
-
-
-
AuthorPosts
- You must be logged in to reply to this topic.