AI Visual Prompt Designer
An AI Visual Prompt Designer crafts precise, creative text prompts and control configurations that guide generative AI models-such…
Skill Guide
ControlNet is a neural network architecture that injects spatial, structural, or semantic conditioning (like poses, edges, or depth maps) into a diffusion model to guide image generation with pixel-level precision.
Scenario
Generate the same character in multiple dynamic poses for a storyboard, using only a reference image and text prompt.
Scenario
Place a new product design onto an existing lifestyle scene photo, ensuring correct perspective and occlusion.
Scenario
Transform a rough 3D massing model from an architect into a photorealistic render with specific material finishes and lighting, while preserving exact spatial layout.
ComfyUI is preferred for complex, reproducible pipelines. Automatic1111 is standard for interactive experimentation. Use official and well-tested preprocessor models from lllyasviel's repository for reliability.
Select preprocessor based on the guidance needed: OpenPose for figurative work, Canny for structural lines, Depth for spatial composition, Normal for lighting cues, Shuffle for style mixing.
Answer Strategy
The interviewer is testing architectural thinking and problem-solving for consistency. Strategy: Break the problem into character isolation and environment guidance. Sample Answer: 'First, I'd use IP-Adapter or a fine-tuned character LoRA to lock the character's appearance. For each environment, I'd generate a base scene using a depth map and a style reference. The character would be integrated via inpainting with a masked ControlNet unit (using OpenPose for the pose and a separate character reference for texture), with the control weight adjusted to only influence structure, not style. Finally, I'd use a second ControlNet pass with a low-weight shuffle model to harmonize lighting.'
Answer Strategy
Testing debugging skills and understanding of model interaction. The core issue is over-conditioning. Sample Answer: 'This is a classic over-constraining problem. I'd check three things: 1) The control weight-lower it from 1.0 to ~0.6-0.7. 2) The guidance start step-set it to start later (e.g., at step 5 of 20) to give the diffusion process initial freedom. 3) The preprocessor strength-simplify the control image by reducing its resolution or using a softer edge detector. The goal is to provide guidance, not a rigid blueprint.'
1 career found
Try a different search term.