How to "pose" characters with the AI?
The environments the AI spits out look pretty good, but I can't get characters doing stuff. I tried to create a girl on a skateboard with super simple prompts like "Girl on a skateboard" and more descriptive prompts like "A girl with long, pink hair does a handplant with her skateboard in a sunlit skatepark. Her muscular arms are fully tense and you can see in her face that the trick takes her full concentration". But no matter how I try to describe it, she's weirdly sitting on a completly deformed skateboard with the size of an elephant, lol.
I also tried stuff like "Dog bites man", but the results show a man walking his dog. So isn't it possible to put figures into action or am I doing something wrong? Is AI more or less limited to environments and portraits? Sorry, I never used AI before and I simply don't get it.
Comments
Right now the Daz AI Studio lacks the necessary tools that are available in other Stable Diffusion implementations to do what you are looking for, namely ControlNet and Inpainting. If you use a local installation of Stable Diffusion that has ControlNet like automatic1111, you can pose a figure or figures in Daz Studio, make a render, and use the render as an input into ControlNet you can get complex, dynamic poses generated. Some very complex poses, like ones where there are multiple figures or the figure is interacting with an object will also require inpainting to iteratively fix and make several edits to fix the poses. This is an example from my Stable Diffusion tutorial where I used a Daz Studio render with ControlNet to make an image that would be impossible to create with just text prompting.
But it's not really possible with fully AI generated figures, right? I've tried some more prompts in the meantime, but it looks like there's almost no control about expressions or interaction with things. I first thought it's maybe the learning curve of the AI, but I guess if basic prompts like "Girl on a skateboard", "Dog bites man" or "Man picks his nose" already fail, it's simply not possible with AI generated figures, right?
The jumping pose you did there is also a good example. Something so basic is not possible with just prompts, not even for more trained AIs llike Midjourney? That's surprisingly disappointing to be honest, I guessed AIs would be much better already. 0o
Yes, it is a tricky to get poses right.
Below is what I got with your longer prompt.
I think, when Daz AI Studio will evolve, we will get even more features.
For the comparison, below is an example image from the other AI system.
Do you have a link to your SD tutorial? I'm just starting to learn all about this generative AI stuff so I'm looking for good sources of information. Thanks!
The more specific you have a complex image in your mind that you want to generate, the harder it might be to describe using text prompts. That is where various image to image approaches can be helpful.
I have a walk through of using Stable Diffusion and Daz3D stuff to generate a multiple figure scene, some humans, some animals, some male, some female, some in crisp uniform, some in ragged clothes. Some standing, some sitting. I used Carrara, but the same principles apply to Daz Studio, Blender, Poser, etc. There is a lot wrong with this image, but there is a lot right. You can find the first post of the walk through in the forums here. https://www.daz3d.com/forums/discussion/comment/8645911/#Comment_8645911 And this is the example.
For comparison, here is what the new Daz-AI gave me when I typed in 'a horse grazing on rocky hillside beside a crouching male Canadian Mounted Police Officer above a sitting female fugitive wearing ragged clothes.
There are some good things in the Daz-AI image. The landscape is nice. But there are some not so good things, like the horse has two heads.
It will all get better with time and practice.
Animals are a mixed bag, sometimes good - sometimes bad. Hopefully when it is out of Beta there will have been bug fixes ....
Dromaeosaurus-like body with a bird-like appearance
Mike 9 - Prompt "Wearing a sci-fi suit running down a space station corridor"
A girl with long, pink hair does a handplant with her skateboard in a sunlit skatepark
Did 8, this was the best, skateboard wheels are out ....
Tried it too. Atleast the wheel are moving forward.
Thanks for the link, I've bookmarked it for later when I have more time.
I think the Mountie appreciates the new headgear -- while out on the trail it won't be as hot as their usual hat. ;-)
Thanks for the comments and trying the skater girl, that's exactly how my results looked alike. I didnt get a single picture without a deformed board, so I thought I do something wrong. I'm curious how it would look if I could tell the AI to use my skateboard products from the store, but I doubt it would look any better. Or will it use the poses of the products as well? I never tried any AI before and it really surprised me that its sooo limited when it comes to posing and interacting with objects. I often read that AI will destroy 3D art, but with results like these, I don't think there's any reason to worry. The AI feels almost useless as soon as you wanna go beyond environment backdrops and portraits without much emotion. At least with fully AI-generated characters.
Artinis picture of the handplant with the other program looks better, as the AI knows what a handplant is. And the board has reasonable proportions and isn't bent all over. The position looks weird because she not on the coping, but that could probably be improved with more precise prompts.
Btw, this here is a render I did some years ago and the inspiration why I tried the handplant prompts. So thats the pose I expected. I thought it's such an iconic trick that the AI would easily get it, I would never have expected that the AI wouldn't even be able to display a skateboard without errors. Pretty disappointing, but I guess I just completly overestimated AIs at all.
She is upside down and that is very likely the problem with the handplant pose. Apparently all AI apps hate anything upside down and insist on them being right-side up. Yes, even cats, and even when you specifically describe the pose. The Daz AI does not have an option to do negative prompts -- that is, a list of things you do NOT want in your images -- but even if it did, I am very certain it would not make any difference. I have tried, many many times and with different AI programs, to do upside-down/headstand/handstand poses, all to no avail. It might be possible with Control Net, but even using image2image I have not achieved a proper upside-down kitty ... or human.
Interesting discovery. I have tried prompt:
a cat landed down on the floor after a jump from bookshelf in living room
and got this from Daz AI Studio
I have tested Stable Diffusion XL and got similar results,
but from DALL-E 3, I got something else.
Anyhow good prompt to test different AI engines.
I have not tested Control Net yet in Stable Diffusion,
because I use ComfyUI interface and do not know how to approach it, yet.
Maybe somebody else could test it,
Much better result from Daz AI Studio using prompt:
a headstand man close to a mirror
At least it is upside down pose.
I think you have to be constantly finessing the prompt to help coax the image from the clutter.
Yes, finessing the prompt is something to consider.
For the comparison, image generated from DALL-E 3:
... and one in handstand pose.
Dall-E 3 has done upside down kitty? Hooray, gotta check that out! Meanwhile I shall continue working away at the prompts in Daz AI Studio, too. It can sometimes take a lot of tinkering with them to get what you're going for, or as close to it as possible. And I have yet to figure out Control Net in the app I do use. Hopefully my poses will improve as the program does.
I tried some more in the meantime and I think it's fine for environment backdrops, portraits without expressions and animals, but any interaction with anything just completly fails. I did some really basic stuff like "Angry woman strangles a man with her hands on his throat", see the attachments for the results...
It feels impossible to put the figures into any reasonable action. I haven't tried any other AI yet(your examples look better), but i really expected AI's to be MUCH better. The graphics may look good from the technical side, but where's the use if you can't setup the scenes you want? Or is this all just a learning process of the AI? And if so, how long will the AI have to learn to be able to understand basic interactions? Are we talking about months or years?
Very dramatic poses. Just experiment more.
It requires a lot of trials and patience to get desired results.
I have spent with Daz 3D products 20 years and I like the direction in which they evolve.
Depending on the AI's safety parameters, you may not be able to generate such scenes. If I entered that prompt in Playground AI, for example, odds are extremely high it would simply refuse and warn me off any future attempts as being violations that cause safety issues. Because it has no idea if the image of one angry person strangling, stabbing, shooting or whatever another is "innocent" -- say, a scene from a fictional detective story -- or an actual threat.
tl;dr: The AIs can't tell a mystery writer from a would-be homicidal maniac so they either slam on the brakes or sit spinning their wheels. Either way you get no useable results, unless you're running your own local install and a model without "guard rails".
Adding: I just tried the prompt "Angry woman strangles a man with her hands on his throat" and while Playground AI let me get away with it this time, the results were no more accurate than yours. And in one case, it only generated an angry woman with hands on her throat -- whose hands I have no idea -- and a lot of unasked for and totally unexpected red "paint" all over her face. Not gonna post them here because this is a Daz AI Studio thread, but it's probably gonna take a ton of prompt rewrites to come up with anything close to the desired results.
Well, that was just an example and I don't think it fails because of possible violence. I also tried multiple other, non-violent interactions between humans and/or props, like sport scenes. You can create a portrait of a soccer player, but you can't make him shoot a goal. Just try it, it will only give you completly weird images of players inside a deformed goal net(some examplesin the attachments). Same thing with any other sport.
Yikes! What in Murphy's name is going on with that net?! My prompt: man kicking soccer ball into goal net scoring perfect goal ...
Bad Daz AI, no biscuit!
Another challenging topic for Daz AI Studio. Below is the image from my trials...