Character Reference

Use a face or body reference to lock identity across generations and keep consistent subjects in multi-panel work.

When to use character reference

  • Storytelling projects where the same person must appear across panels.
  • Avatar or portrait series with consistent age, ethnicity, wardrobe, and mood.
  • Product testimonials where a single model appears in multiple scenes.

What the model derives from the reference

Primarily: face proportions, skin tone range, hair texture, and posture hints. Secondary: wardrobe color family.

Practical tips

  1. Crop tightly to the head and shoulders to reduce background noise.
  2. Use consistent lighting between refs (mixed key light directions confuse the model).
  3. Add hair texture language in text if you want it to survive across generations (“thick, wavy dark hair with natural part”).
  4. Keep wardrobe neutral in the reference so it does not bleed into scenes where it is wrong.

Multi-image consistency

For a character that must appear in very different contexts, use two separate refs:

  • Ref A: face close-up
  • Ref B: full-body pose with environment

This lets the model lock identity while adapting to scene.

360° and pose iterations (official pattern)

Google’s documentation shows character consistency by reusing the last render and asking for a new angle—for example, “A studio portrait of this man against white, in profile looking right.” Treat each step as image + text: attach the previous output when you need the same person from another viewpoint. Details and examples: Gemini image generation doc.

Character consistency: original

Same character looking right

Same character looking forward

Common failure modes

  • Very low resolution refs (< 512 px) lose detail and produce inconsistent faces.
  • Mixing refs with very different ages or dramatically different makeup splits the face.