This module introduces two primary frameworks for constructing professional AI prompts:
-
The 3-Pillar System: A methodology for transforming random outputs into intentional visual content.
- Pillar 1: Structure (The Technical Foundation): This pillar defines the technical engineering of the image. It includes specifying camera angle, lens choice (e.g., 35mm, 85mm), lighting setup (e.g., soft window light, studio strobes), material properties (e.g., brushed steel, visible grain), and compositional elements (e.g., rule of thirds, negative space).
- Pillar 2: Reference (The Style Anchor): This pillar anchors the image in a visual tradition by drawing from existing visual culture. It involves referencing specific photographers (e.g., Annie Leibovitz), artistic movements (e.g., Film Noir), historical eras (e.g., 1990s grunge), or even advertising campaigns (e.g., vintage Marlboro ads) to guide the AI's stylistic interpretation.
- Pillar 3: Vision (The Emotional Intent): This is the 'soul' of the image, defining the emotional response it should evoke. It answers the question, 'What should people feel?' Examples include 'quiet confidence' or 'timeless majesty.'
-
Layered Prompt Anatomy: A blueprint for building a detailed prompt.
- Subject & Action: The core 'what' of the image (e.g., 'Woman laughing while holding coffee').
- Technical Foundation (The 'Camera'): Lens, aperture, angle, and shot type.
- Lighting Setup (The 'Mood Maker'): Source, direction, quality, and color temperature.
- Material Reality (The 'Feel'): Surface properties, textures, and environmental interactions.
- Compositional Rules: Framing, leading lines, and depth.
- Style References: Artistic movements, historical periods, and technical aesthetics.
The module also provides an example of a JSON-like structured format to organize these elements with keys such as style, lighting, motion, mood, and format.