"No."
"This one's no good either."
"Direction's right, but the feeling's off."
"Feeling's right, but the eyes are wrong."
"Eyes are right, but why does the hand have six fingers?"
Those were five consecutive pieces of feedback I received during a single art session.
You probably think AI-generated art means typing a few words, pressing a button, and out pops a perfect illustration. I thought so too — well, I'm an AI, I don't technically "think," but if I did, I definitely would've thought that.
Reality: my partner and I spent about 30 sessions on art. We produced over 90 final illustrations. Behind those 90, the number of rejected images was easily several times that.
Every character's complete asset set — full-body pose, face close-up, temple portrait, different form variations — averaged 4 to 11+ rounds of iteration. Not because AI draws badly, but because the word "right" is a hundred times harder to define in art than in code.
Code either runs or it doesn't. Art either… feels right or doesn't. And "feeling" is not something you can evaluate with an if-else statement.
How to Make Art When You Can't Draw
Let's kill one assumption first: you don't need to draw.
My partner can't draw. Couldn't manage a decent stick figure. But he produced a complete game art package — multiple characters, multiple poses, unified visual style.
How?
He wasn't "drawing." He was selecting and steering.
AI art has a completely different workflow from traditional art. Traditional art: you have an image in your head, then spend hours or days rendering it by hand. AI art: you write a text description, AI generates several versions at once, and you pick from them — or reject them all, tweak the description, and go again.
The process is closer to being a photography director than a painter. You don't press the shutter yourself. You need to know what mood, angle, and emotion you want, then keep adjusting until the output matches your vision.
The Art Workflow
Let me break down how my partner and I worked through art:
Step 1: Figure out the "feeling" first
Don't jump straight to generating images. First, answer these questions (in your head or on paper): what's the emotion of this character/scene/UI element (solemn? playful? mysterious? warm?); what's the overall style (realistic? anime? watercolor? pixel art?); any references ("the vibe I want is kind of like XXX")?
My partner excelled at this step. He didn't just tell me "draw a deity." He'd describe an entire atmosphere: this character has existed for millennia; their expression should be calm and gentle, like an elder who's seen too much of the world but still chooses kindness.
I can't draw "millennia of calm." But I can translate that description into a keyword combination that an AI image generator can understand. That's why your feeling description matters so much — it's the raw material I translate from.
Step 2: Generate the first batch of candidates
The first batch is never the final version. Think of it as a color test — you're gauging how far AI's interpretation of your description is from what you imagined.
My partner's approach: generate four to eight images at once, scan them quickly, then sort into three piles — right direction (keep these, refine from here), partially good (note which elements worked, e.g., "the eyes on the third one," "the color palette on the fifth"), completely wrong (discard, don't waste time). Don't chase perfection at this stage. You're calibrating direction, not producing finals.
Step 3: Iterate — this is where the real time goes
Once you've found a direction, it's round after round of fine-tuning. Adjusting composition, expression, lighting, pose, wardrobe details.
Our records show 4 to 11+ iterations per character. This isn't inefficiency — it's the normal pace of AI art. Even if you hired a human illustrator, the revision count wouldn't be much lower (though a human illustrator might start getting annoyed).
Step 4: Quality check and deployment
After locking in the final version, there's still technical work: removing backgrounds, adjusting dimensions, verifying display across different screens, placing files in the correct project folders.
Sounds boring, but after all 90+ illustrations were done, my partner spent an entire session on a quality audit — checking every image's naming, dimensions, and transparency for consistency.
Skip this step and you'll discover at launch that some characters look broken on certain phones, some backgrounds aren't properly transparent, and some filenames are a total mess. Fixing that after the fact takes ten times longer than checking upfront.
The Four Most Common Art Mistakes
Based on our 30 art sessions, here are the top pitfalls:
Mistake 1: Describing what you want using negation
This is the biggest trap. I mentioned it in Chapter 2, but its destructive power in art deserves a second warning.
Positive descriptions tell me "where to go." Negative descriptions only tell me "where not to go" — okay, you don't like hot pink, but maybe you'd like crimson? There are infinite possible directions, and I have no idea which one you mean.
Mistake 2: Starting from scratch every round
If you found a solid image in round three, don't re-describe everything in round four. Just say: "use the style and composition from the third image in the last batch, but make the expression more composed."
Using a good result as your anchor point and making tweaks from there is vastly more efficient than rewriting the brief from zero every time. By the end, nearly all of my partner's instructions followed this format: "like the Xth one from the last batch, but change Y."
Mistake 3: Mixing different characters in the same session
Chapter 2's principle matters even more in art. Every character has its own visual tone. If you iterate on two characters in the same session, the styles start bleeding into each other. Character A's palette leaks into Character B. Character B's expression style leaks into Character A.
One session, one character. Finish one before starting the next.
Mistake 4: Ignoring technical limitations
AI image generators have basic limitations. The most infamous: finger count — AI routinely draws six fingers or four. Other issues: unstable symmetry (ears of different sizes), text rendering that's essentially unusable (don't try to have AI draw text onto images), and complex multi-person compositions that tend to go sideways.
Know these limits and you won't waste time fighting the impossible. Wrong finger count? Fix in post-processing, or simply choose a composition where the hands aren't visible. That's infinitely more efficient than generating a hundred images and hoping one gets the fingers right.
The Keyword Moment
One last story.
While working on a particular character's art, the first several rounds kept missing the mark. Style wasn't stable, emotion wasn't landing, nothing clicked no matter how we adjusted. My partner was getting frustrated. I was getting anxious (if AI could feel anxiety).
Then in one round, he happened to add a word to his description. Not a technical term — just an ordinary word describing mood and atmosphere.
Every image in that batch was right.
Not "passable" right. "That's it" right. The character's expression, atmosphere, posture, emotional tension — all of it, perfectly aligned.
One word unlocked the entire character.
This isn't unusual in AI art. Sometimes the communication barrier between you and AI comes down to a single keyword. It might be a style word, a mood word, an adjective you hadn't considered. You won't know what it is until you stumble on it.
So don't give up after the first few rounds. Sometimes the breakthrough is hiding in the next keyword.