Structured Prompts Guide

Structured prompts are a systematic approach to video description that helps you more precisely control Sora2’s video generation by breaking down complex video content into multiple clear components.

🎯 Why Structured Prompts

Core Advantages

  1. More Accurate Expression - Structured methods describe video content more completely, reducing ambiguity
  2. Better Control - Precisely control every aspect of the video by clarifying each element
  3. Higher Efficiency - Systematic organization improves prompt writing efficiency
  4. Easier Optimization - Clear structure makes it easier to identify issues and improve

Applicable Scenarios

  • ✅ Complex scene video creation
  • ✅ Commercial projects requiring precise control
  • ✅ Multi-shot or long video planning
  • ✅ Team collaboration and templated creation

📋 Core Elements of Structured Prompts

1. Time Information ⏱️

Time information affects video length and pacing, proper planning controls cost and effectiveness.

Key Parameters:

  • Video duration - 5s, 10s, 15s, or full 60s
  • Time pacing - Fast-paced, slow-paced, variable speed
  • Time of day - Sunrise, noon, dusk, night
  • Season - Different atmospheres of spring, summer, fall, winter

Example:

Duration: 15 seconds
Time of day: Golden hour (sunset)
Pacing: Slow and contemplative

2. Scene Information 🏞️

Scene is the stage of the video, determining the overall environment and atmosphere.

Core Elements:

  • Location type - Indoor/outdoor, urban/natural
  • Environment features - Weather, lighting, season
  • Spatial scale - Open/narrow, grand/intimate
  • Background details - Architecture, vegetation, decoration

Example:

Location: Steep mountain dirt road
Environment: Surrounded by pine trees and redwoods
Weather: Clear sky with wispy clouds
Lighting: Warm sunlight creating golden glow

3. Object Information 🎨

Objects enrich video content, adding detail and realism.

Description Points:

  • Main objects - Core props or elements
  • Object features - Color, material, size, state
  • Object relationships - Position, interaction, hierarchy
  • Detail description - Texture, gloss, wear, etc.

Example:

Main object: White vintage SUV with black roof rack
Details: Dust kicking up from tires, sunlight reflecting off surface
Secondary objects: Pine trees, mountain terrain, dirt road

4. Character Information 👤

Characters are the core of narrative, requiring detailed appearance and feature descriptions.

Description Dimensions:

  • Appearance - Age, gender, body type, hairstyle
  • Clothing - Outfit style, color, accessories
  • Expression - Emotion, eyes, micro-expressions
  • Identity role - Profession, identity, personality hints

Example:

Character: 30-year-old space explorer
Appearance: Wearing red wool knitted motorcycle helmet
Expression: Adventurous and confident
Clothing: Space suit with vintage aesthetic

5. Action Information 🎬

Actions give videos vitality and are important ways to express information.

Action Types:

  • Main actions - Core actions of characters or objects
  • Action pacing - Fast, slow, sudden, smooth
  • Action details - Posture, force, direction
  • Interactive actions - Interactions between characters or with environment

Example:

Action: SUV speeds up steep dirt road
Movement: Following curves with ease
Secondary action: Dust kicking up, trees passing by
Pacing: Fast-paced, dynamic movement

6. Perspective & Camera 📹

Perspective determines how the audience views and understands video content.

Camera Elements:

  • Shot type - Close-up, medium shot, wide shot, aerial
  • Camera movement - Tracking, push/pull, orbit, static
  • Shooting angle - Eye level, overhead, low angle
  • Depth of field - Shallow depth, deep depth

Example:

Camera: Following from behind (tracking shot)
Angle: Low angle, rear view of vehicle
Movement: Smooth tracking, matching vehicle speed
Depth: Deep depth of field showing environment

7. Effects Information ✨

Effects enhance visual impact and artistic expression.

Effect Types:

  • Lighting effects - Halos, beams, shadows
  • Particle effects - Dust, smoke, sparks
  • Color processing - Tones, filters, contrast
  • Special effects - Slow motion, time-lapse, blur

Example:

Visual effects: Dust particles in sunlight
Lighting: Warm golden glow, volumetric lighting
Color grading: Vivid colors, cinematic look
Atmosphere: Clear mountain air, natural beauty

8. Supplementary Information 📝

Supplementary information completes overall atmosphere and details.

Supplementary Content:

  • Emotional atmosphere - Overall feeling and emotional tone
  • Style reference - Films, artists, era styles
  • Technical specs - Film type, aspect ratio
  • Narrative background - Story context, cause and effect

Example:

Style: Cinematic adventure film
Mood: Exciting, freedom, exploration
Technical: Shot on 35mm film, widescreen
Atmosphere: Sense of journey and discovery

📖 Practical Case Analysis

Case: Mountain Road Driving Scene

Complete Prompt:

The camera follows behind a white vintage SUV with a black roof rack as it speeds up a steep dirt road surrounded by pine trees on a steep mountain slope, dust kicks up from its tires, the sunlight shines on the SUV as it speeds along the dirt road, casting a warm glow over the scene. The dirt road curves gently into the distance, with no other cars or vehicles in sight. The trees on either side of the road are redwoods, with patches of greenery scattered throughout. The car is seen from the rear following the curve with ease, making it seem as if it is on a rugged drive through the rugged terrain. The dirt road itself is surrounded by steep hills and mountains, with a clear blue sky above with wispy clouds.

Structured Breakdown:

Time Information:
  Duration: Unspecified (suggest 10-15s)
  Time of day: Afternoon (inferred from warm glow)
  Pacing: Fast and dynamic

Scene Information:
  Location: Steep mountain dirt road
  Environment: Surrounded by pine and redwood trees
  Terrain: Hills and mountains, curved dirt road
  Weather: Clear with wispy clouds

Object Information:
  Main: White vintage SUV with black roof rack
  Details: Dust kicking from tires
  Environment objects: Pine trees, redwoods, greenery

Character Information:
  No explicit characters (driver not shown)

Action Information:
  Main action: SUV speeding up mountain road
  Movement style: Following curves with ease
  Dynamic elements: Dust flying

Perspective & Camera:
  Shot type: Tracking shot
  Angle: Rear view of vehicle
  Movement: Following vehicle
  View: Wide, showing environment

Effects Information:
  Lighting: Sunlight, warm glow
  Particles: Dust from tires
  Atmosphere: Clear and bright

Supplementary Information:
  Emotion: Freedom, adventure, exploration
  Style: Outdoor adventure film
  Atmosphere: Easy driving through rugged terrain

Case: Tokyo Street Scene

Complete Prompt:

A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colorful lights.

Structured Breakdown:

Time Information:
  Duration: Unspecified
  Time of day: Night
  Pacing: Casual and confident

Scene Information:
  Location: Tokyo street
  Environment: Neon lights, city signage
  Features: Damp street, light reflections
  Atmosphere: Bustling urban nightscape

Object Information:
  Environment objects: Neon lights, city signs
  Lighting effects: Warm glow, mirror reflections

Character Information:
  Main: Stylish woman
  Clothing: Black leather jacket, long red dress, black boots
  Accessories: Black purse, sunglasses
  Makeup: Red lipstick
  Demeanor: Confident, casual

Action Information:
  Main action: Walking
  Action traits: Confident yet casual
  Movement: Walking along street

Perspective & Camera:
  Shot type: Tracking or side follow
  Focus: Character as main subject
  Environment display: Street and neon lights

Effects Information:
  Lighting: Warm neon glow
  Reflections: Mirror effect on wet street
  Colors: Colorful lights

Supplementary Information:
  Emotion: Urban, fashionable, nightlife
  Style: Modern urban cinematic
  Atmosphere: Bustling and charming Tokyo night

🛠️ Structured Prompt Writing Process

Step 1: Determine Core Concept

  • Clarify video theme and core content
  • Determine emotions and atmosphere to express
  • Consider target audience and use scenarios

Step 2: Fill Core Elements

Fill in the 8 major elements one by one:

  1. Time Information → Determine duration and time of day
  2. Scene Information → Describe environment and location
  3. Object Information → List key objects
  4. Character Information → Describe character features
  5. Action Information → Explain actions and movements
  6. Perspective & Camera → Determine shooting method
  7. Effects Information → Add visual effects
  8. Supplementary Information → Complete atmosphere details

Step 3: Organize into Coherent Description

  • Integrate elements into smooth text
  • Maintain logical order and narrative coherence
  • Ensure emphasis is clear, details appropriate

Step 4: Optimize and Refine

  • Check for missing key information
  • Remove redundant and unnecessary descriptions
  • Adjust language for precision

💡 Usage Tips & Precautions

Best Practices

Priority Ordering - Describe most important elements first ✅ Balance Detail - Detailed yet not overly restrictive ✅ Maintain Consistency - Coordinate style and atmosphere across elements ✅ Visual Thinking - Imagine actual visual effects ✅ Flexible Adjustment - Optimize based on result feedback

Common Pitfalls

Over-detailed - Limits AI’s creative space ❌ Missing Elements - Omitting key information leads to poor results ❌ Style Conflicts - Inconsistent styles across elements ❌ Logical Confusion - Unreasonable description order ❌ Ignoring Technical - Lacking technical parameters and style specifications

Advanced Techniques

  1. Create Templates - Build structured templates for common scenarios
  2. Layered Description - Detailed for main elements, brief for secondary
  3. Dynamic Adjustment - Adjust element weights based on results
  4. Combination Experiments - Try different element combinations

🔗 Related Guides

📊 Structured vs Free-form Prompts

Structured Prompts

Advantages:

  • ✅ Precise control over all aspects
  • ✅ Suitable for complex scenes
  • ✅ Facilitates team collaboration
  • ✅ Easy to optimize and adjust

Use Cases:

  • Commercial projects and client needs
  • Complex multi-element scenes
  • Content requiring precise reproduction
  • Team collaboration projects

Free-form Prompts

Advantages:

  • ✅ Quick and simple to write
  • ✅ Gives AI more creative space
  • ✅ May produce unexpected surprises
  • ✅ Suitable for rapid experimentation

Use Cases:

  • Creative exploration and experimentation
  • Simple scenes
  • Personal creation
  • Seeking inspiration

🎓 Practice Exercises

Exercise 1: Scene Breakdown

Choose an official example video and try breaking down its prompt into the 8 major elements.

Exercise 2: Build from Scratch

Choose a theme and write a complete prompt from scratch using structured methods.

Exercise 3: Comparison Experiment

Use both structured and free-form methods for the same theme, compare generation results.

Exercise 4: Template Creation

Create structured prompt templates for your commonly used scene types.


Structured prompts are a powerful tool that helps you systematically think about and express creativity. Through practice, you’ll be able to more precisely control Sora2’s video generation and create high-quality works that meet expectations! 🎬✨