Trying to draw bodies and characters with AI tools : examples of approaches.

Hello,

After a couple of weeks spent on trying to make a kind of comic / graphic book, i would like to share some toughts on using all only recent artificial intelligence tools as my drawing author.

Drawing a Character : some trials.

Driving the AI to produce humanoid characters with full body and face is tricky. If you wish to have some emotions or actions postures it is almost pure chance based on the prompt you enter.

This may improve in the future with some future models dedicated to such a task.

As a workaround, i restricted the characters to be hidden from the point of view, or illustrated by black silouhettes.
To create those silouhettes, i was using the keyword 'silouhette', 'shapes', etc in mini Dall-E and it gave one of two interesting shadows that i multiplied in derivation by using the IC-GAN.

4ac68742c6cc033792ff2be5149ec862ca24f7846573eee9d8b6ad00.jpeg
First result with mini Dall-e

118c70df8890c227d250c54ab0ec9fb64e0580cb0def6163db78834d.jpeg
variation by IC-GAN

Then I collected dozens of such shadows. In some cases, i also refined a choosen silouhette with help of Diffusion process ( you may use the colab shared by @jotakrevs (https://colab.research.google.com/drive/11MaZKz-TFpmARojTGSWqrNL4y9kdjh_0?usp=sharing), for which you can feed the AI with a picture and a prompt that should drive the neural network in its creation. To give this comic painting style, i often try the "a painting by Enki Bilal" which is not always a proof of efficient result.

Sometimes it is better to concentrate on subject of the pictures than on the style you want. For example using a prompt like " A man staring at the sea", " a Cyborg raising its arms", " A woman reading a book" might slightly modify the posture to your needs.

Using The Lets Make a Collage Hive Library as source image

Altough i tend to avoid using real pictures as init images, i recently discovered the LMAC initiative on Hive. A cool Community (/trending/hive-174695) giving away, totally free of rights, photographic pictures that are collected and labeled in a search engine https://www.lmac.gallery/ .

This is perhaps the first collection where members explicitely abandon all their copyrights. I wanted to at least to credit this initiative to provide digital artists a palette of picture materail they may re-use for their artworks. I also share 2% of this post with the source image i will refer here below for this exercise, as gentle way to get tractions between hivers submitting their donations and hive authors using their picture.

See herunder a test i just provided from scratch :

I search the LMAC library and found the following picture, given by @muelli.

FUkUE5bzkAZT3HzV5tJDiU2ik81PCd4JCyhWnRcDN8XJsVFY3UNB8DCYV11UBh6ADh2ye9jK7AsKNNaph7T98MTnq3vhX2n5GQDH7W1riXuujjJqsxGRctD5Laz9AzXL6rcrPiEpk4txgMvSeawxtJUedvZ5JSy1Ct4N.jpg
original pic found on LiL, by @muelli

I cropped it to get the kind of point of view i want to emphasize.

FUkUE5bzkAZT3HzV5tJDiU2ik81PCd4JCyhWnRcDN8XJsVFY3UNB8DCYV11UBh6ADh2ye9jK7AsKNNaph7T98MTnq3vhX2na5GQDH7W1riXuujjJqsxGRctD5Laz9AzXL6rcrPiEpk4txgMvSeawxtJUedvZ5JSy1Ct4N.png

I use the Clip guided diffusion colab, with this image as init parameter, and a prompt named " cyber hacker staring at cloud drones in the sky, a painting by Enki Bilal".

0000.png clig guided image 0

replicate-prediction-3y46d6cov5cujer6pky5qvekpa.png clip guided image 250

After some session, these are the pictures i collected. Some to be possibly used after a polishing and upscaling via an ESRGAN algorythm, some to be used as init images to get derivation with help of the IC-GAN colab.

replicate-prediction-kq4tqyqzqneo3bhdrrdof3iaoe.png ic gan derivation 1 from image 0

replicate-prediction-rzjedmdoezcefoxahwcdi2xtyu.png ic gan derivation 2 from previous picture

replicate-prediction-z7pcq53p65f4zhq22gzvnphhmy.pngic gan derivation 3 from previous picture

One of those output might then be eligible to be new init images again for Clip guided diffusion, especially those that have been kept a bit blurry to let this algorythm a bit creative.
pilo.png clip guided result from ic gan derivation 1

fsqdf.pngclip guided result from ic gan derivation 3

For this example beside the text, i only increase the clip guidance scale to 2000, and saved some frames to be kept in my library for future use.

For those latest results, i used a prompt like 'a cyber space pilot in front of sunlights, a painting by Enki Bilal'

At the end i upscaled my preferred one and made some zoomed picture.

klhyi_out.png
my preferred one, because the face traits are quite ok.

klhyi_outface_out.png
cropped and upscaled.

As you might guess, one picture can generate an infinity of possibilities. I keep a bunch of them as a stock image to maybe build some follow-up of my comic book.

I you read so far, thanks. I hope i feeded a bit your curiosity about the work behind my current ongoing graphical delirium.


If you curiosity is picked, i can only recommend to check the post of the people belonging to Latent Space Community - a child of - the Alien Art Hive Community.
Stay tuned.


My previous posts are here

H2
H3
H4
3 columns
2 columns
1 column
15 Comments
Ecency