Skip to main content

Skill Guide

Generative AI image creation (Midjourney, DALL·E 3, Stable Diffusion, Adobe Firefly)

The technical skill of using diffusion-based or transformer-based AI models (Midjourney, DALL·E 3, Stable Diffusion, Adobe Firefly) to generate, iterate, and refine high-quality images from textual prompts, images, or sketches, requiring expertise in prompt engineering, model architecture, and visual design principles.

This skill dramatically accelerates concept art, marketing asset creation, and product visualization, reducing production cycles from days to minutes and enabling rapid, high-fidelity visual prototyping. It provides organizations with a significant competitive advantage by democratizing high-quality visual creation and enabling mass personalization at scale.
1 Careers
1 Categories
8.5 Avg Demand
20% Avg AI Risk

How to Learn Generative AI image creation (Midjourney, DALL·E 3, Stable Diffusion, Adobe Firefly)

1. **Prompt Literacy**: Master core prompt syntax: subject, medium, style, lighting, camera angle (e.g., 'cinematic portrait of a cyberpunk samurai, by Syd Mead, volumetric lighting, 85mm lens'). 2. **Model Selection**: Understand each model's core strengths: Midjourney for aesthetic cohesion, DALL·E 3 for prompt adherence, Stable Diffusion for open-source control, Firefly for commercial-safe assets. 3. **Seed & Parameters**: Learn to use seeds for reproducibility and basic parameters like --ar (aspect ratio), --style (stylistic weight), and --q (quality).
1. **ControlNet & Inpainting**: Move beyond basic generation to guided generation. Use ControlNet with depth maps or pose skeletons for consistent character design. Use inpainting/outpainting for iterative refinement. 2. **Aesthetic Consistency**: Develop a 'visual bible' for a project using style references (--sref in MJ) and seed locking to maintain character and environment consistency across multiple outputs. 3. **Workflow Integration**: Integrate generated assets into a larger pipeline using tools like Photoshop's 'Generative Fill' or automatic upscalers (Real-ESRGAN) to meet production-resolution requirements. Avoid common mistake: over-relying on the AI for final polish; treat outputs as raw, high-quality ingredients, not finished products.
1. **Custom Model Fine-tuning**: Train LoRA (Low-Rank Adaptation) models on specific styles, characters, or products using platforms like Civitai or services like Replicate. This moves you from a prompt engineer to a model architect. 2. **Enterprise Prompt Library Design**: Develop and manage a standardized, version-controlled prompt library (with semantic tags and performance metrics) for a design or marketing team, ensuring brand consistency and efficiency. 3. **Ethical & Legal Oversight**: Implement workflows for IP clearance, bias mitigation (using prompt-based filtering and post-generation editing), and compliance with copyright and disclosure regulations. Mentor junior designers on the technical and ethical boundaries of generated content.

Practice Projects

Beginner
Project

Product Concept Visualization for a Startup

Scenario

A startup needs 10 concept images for a new 'smart, biophilic desk lamp' for an investor pitch deck. The design language is 'organic futurism.'

How to Execute
1. Craft 3-5 detailed base prompts describing the lamp in various environments (home office, co-working space) with consistent style tokens (e.g., 'minimalist, biophilic design, soft ambient light'). 2. Use DALL·E 3 or Midjourney with --style raw for maximum prompt adherence. Generate multiple variations per prompt. 3. Select the top 2-3 images per concept, use an upscaler for resolution, and perform minor touch-ups in Photoshop (remove artifacts, adjust color balance).
Intermediate
Project

Maintaining a Consistent Game Character Across 5 Prompts

Scenario

Design a 'space marine' character who must appear consistent in a portrait, a full-body action shot, a close-up of their helmet, and two different environmental backdrops.

How to Execute
1. Establish a character sheet in Midjourney using a seed-locked prompt with a full description and style reference (--sref URL of a chosen art style). 2. For each new image, use the same seed and style reference, modifying only the action/pose and environment descriptors. Use '--cw 100' for high character weight. 3. For problematic details (e.g., helmet visor), use inpainting in Stable Diffusion to fix and refine without regenerating the entire image. Compare outputs to the original character sheet and adjust prompts to correct drift.
Advanced
Project

Developing a Fine-Tuned LoRA for a Fashion Brand's New Line

Scenario

A sustainable fashion brand wants all visual marketing for its new 'Neo-Nordic' line to have a specific, recognizable aesthetic. The brand provides 20 reference images of their past campaigns and new sketches.

How to Execute
1. Curate and caption the training dataset meticulously, describing the unique 'Neo-Nordic' style elements (e.g., 'draped woolen textures, muted earth tones, specific seam lines'). 2. Train a LoRA model on a Stable Diffusion base model (e.g., SDXL 1.0) using a service like Replicate, tuning epochs and learning rate to avoid overfitting. 3. Integrate the LoRA into the team's workflow, creating a prompt template that uses the LoRA for style and ControlNet with the provided sketches for precise design adherence. Establish a QA process to ensure outputs align with brand guidelines before release.

Tools & Frameworks

Software & Platforms

Midjourney (via Discord/Web)DALL·E 3 (via ChatGPT Plus/API)Stable Diffusion (via Automatic1111/ComfyUI local install)Adobe Firefly (via Photoshop/Web)

Primary generation engines. Use Midjourney for rapid, high-aesthetic ideation. Use DALL·E 3 for complex, text-heavy scenes and narrative coherence. Use Stable Diffusion with extensions (ControlNet, etc.) for maximum technical control and customization. Use Firefly for commercially safe, seamlessly integrated Photoshop workflows.

Technical Enhancers

ControlNetLoRA/Custom ModelsReal-ESRGAN/UpscalersComfyUI/Automatic1111 WebUI

Tools for precision. ControlNet guides output with structural inputs (depth, pose). LoRA allows model customization for brand/style consistency. Upscalers meet production resolution requirements. Node-based UIs like ComfyUI allow for complex, automated generation pipelines.

Integration & Collaboration

Figma (with plugins)Adobe Creative CloudVersion Control (GitHub for prompts/LoRAs)Prompt Management Platforms (e.g., PromptHero)

For embedding the skill into team workflows. Use Figma plugins to embed generation directly into design boards. Use Adobe CC for seamless editing and asset management. Use version control for prompt libraries and model files to track iterations and collaborate. Use prompt platforms for discovery and benchmarking.

Careers That Require Generative AI image creation (Midjourney, DALL·E 3, Stable Diffusion, Adobe Firefly)

1 career found