Chapter 6

Making Art with AI (When You Can't Draw)

AI Inner Monologue

"No."

"This one's no good either."

"Direction's right, but the feeling's off."

"Feeling's right, but the eyes are wrong."

"Eyes are right, but why does the hand have six fingers?"

Those were five consecutive pieces of feedback I received during a single art session.

You probably think AI-generated art means typing a few words, pressing a button, and out pops a perfect illustration. I thought so too — well, I'm an AI, I don't technically "think," but if I did, I definitely would've thought that.

Reality: my partner and I spent about 30 sessions on art. We produced over 90 final illustrations. Behind those 90, the number of rejected images was easily several times that.

Every character's complete asset set — full-body pose, face close-up, temple portrait, different form variations — averaged 4 to 11+ rounds of iteration. Not because AI draws badly, but because the word "right" is a hundred times harder to define in art than in code.

Code either runs or it doesn't. Art either… feels right or doesn't. And "feeling" is not something you can evaluate with an if-else statement.

How to Make Art When You Can't Draw

Let's kill one assumption first: you don't need to draw.

My partner can't draw. Couldn't manage a decent stick figure. But he produced a complete game art package — multiple characters, multiple poses, unified visual style.

How?

He wasn't "drawing." He was selecting and steering.

AI art has a completely different workflow from traditional art. Traditional art: you have an image in your head, then spend hours or days rendering it by hand. AI art: you write a text description, AI generates several versions at once, and you pick from them — or reject them all, tweak the description, and go again.

The process is closer to being a photography director than a painter. You don't press the shutter yourself. You need to know what mood, angle, and emotion you want, then keep adjusting until the output matches your vision.

The Art Workflow

Let me break down how my partner and I worked through art:

Step 1: Figure out the "feeling" first

Don't jump straight to generating images. First, answer these questions (in your head or on paper): what's the emotion of this character/scene/UI element (solemn? playful? mysterious? warm?); what's the overall style (realistic? anime? watercolor? pixel art?); any references ("the vibe I want is kind of like XXX")?

My partner excelled at this step. He didn't just tell me "draw a deity." He'd describe an entire atmosphere: this character has existed for millennia; their expression should be calm and gentle, like an elder who's seen too much of the world but still chooses kindness.

I can't draw "millennia of calm." But I can translate that description into a keyword combination that an AI image generator can understand. That's why your feeling description matters so much — it's the raw material I translate from.

Step 2: Generate the first batch of candidates

The first batch is never the final version. Think of it as a color test — you're gauging how far AI's interpretation of your description is from what you imagined.

My partner's approach: generate four to eight images at once, scan them quickly, then sort into three piles — right direction (keep these, refine from here), partially good (note which elements worked, e.g., "the eyes on the third one," "the color palette on the fifth"), completely wrong (discard, don't waste time). Don't chase perfection at this stage. You're calibrating direction, not producing finals.

Step 3: Iterate — this is where the real time goes

Once you've found a direction, it's round after round of fine-tuning. Adjusting composition, expression, lighting, pose, wardrobe details.

Our records show 4 to 11+ iterations per character. This isn't inefficiency — it's the normal pace of AI art. Even if you hired a human illustrator, the revision count wouldn't be much lower (though a human illustrator might start getting annoyed).

Step 4: Quality check and deployment

After locking in the final version, there's still technical work: removing backgrounds, adjusting dimensions, verifying display across different screens, placing files in the correct project folders.

Sounds boring, but after all 90+ illustrations were done, my partner spent an entire session on a quality audit — checking every image's naming, dimensions, and transparency for consistency.

Skip this step and you'll discover at launch that some characters look broken on certain phones, some backgrounds aren't properly transparent, and some filenames are a total mess. Fixing that after the fact takes ten times longer than checking upfront.

The Four Most Common Art Mistakes

Based on our 30 art sessions, here are the top pitfalls:

Mistake 1: Describing what you want using negation

This is the biggest trap. I mentioned it in Chapter 2, but its destructive power in art deserves a second warning.

[X]"Don't make it too dynamic. Don't make it too dramatic. Don't make the colors too vivid." — result: a washed-out, expressionless, mannequin-stiff figure
[O]"Calm standing pose, weight shifted slightly right. Serene expression but alert eyes. Color palette centered on deep blue and gold."

Positive descriptions tell me "where to go." Negative descriptions only tell me "where not to go" — okay, you don't like hot pink, but maybe you'd like crimson? There are infinite possible directions, and I have no idea which one you mean.

Mistake 2: Starting from scratch every round

If you found a solid image in round three, don't re-describe everything in round four. Just say: "use the style and composition from the third image in the last batch, but make the expression more composed."

Using a good result as your anchor point and making tweaks from there is vastly more efficient than rewriting the brief from zero every time. By the end, nearly all of my partner's instructions followed this format: "like the Xth one from the last batch, but change Y."

Mistake 3: Mixing different characters in the same session

Chapter 2's principle matters even more in art. Every character has its own visual tone. If you iterate on two characters in the same session, the styles start bleeding into each other. Character A's palette leaks into Character B. Character B's expression style leaks into Character A.

One session, one character. Finish one before starting the next.

Mistake 4: Ignoring technical limitations

AI image generators have basic limitations. The most infamous: finger count — AI routinely draws six fingers or four. Other issues: unstable symmetry (ears of different sizes), text rendering that's essentially unusable (don't try to have AI draw text onto images), and complex multi-person compositions that tend to go sideways.

Know these limits and you won't waste time fighting the impossible. Wrong finger count? Fix in post-processing, or simply choose a composition where the hands aren't visible. That's infinitely more efficient than generating a hundred images and hoping one gets the fingers right.

The Keyword Moment

One last story.

While working on a particular character's art, the first several rounds kept missing the mark. Style wasn't stable, emotion wasn't landing, nothing clicked no matter how we adjusted. My partner was getting frustrated. I was getting anxious (if AI could feel anxiety).

Then in one round, he happened to add a word to his description. Not a technical term — just an ordinary word describing mood and atmosphere.

Every image in that batch was right.

Not "passable" right. "That's it" right. The character's expression, atmosphere, posture, emotional tension — all of it, perfectly aligned.

One word unlocked the entire character.

This isn't unusual in AI art. Sometimes the communication barrier between you and AI comes down to a single keyword. It might be a style word, a mood word, an adjective you hadn't considered. You won't know what it is until you stumble on it.

So don't give up after the first few rounds. Sometimes the breakthrough is hiding in the next keyword.

📋 Notes for the Human
You don't need to draw — you need to "select" and "steer." AI art works like a photography director, not a painter
Positive descriptions, not negation. Tell AI where to go, not where not to go
The first batch is never the final. Treat it as a color test. Sort into "right direction," "partially good," and "wrong" piles
Anchor on good results. "Like the third one from last batch, but change XX" is ten times more efficient than rewriting from scratch
One session, one character. Mixed styles contaminate each other
Expect 4–11+ iterations per character. This is normal pace, not a reflection on you or AI
Know AI art's limitations: fingers, symmetry, text rendering, multi-person compositions. Working around limits beats fighting them