GYMO Studio Guide: Chat-mode, Director-mode & Render-mode
A complete walkthrough of GYMO Studio's three modes. Learn how to get the best results from scene creation to final video render.
The Three Modes of GYMO Studio
GYMO Studio is built around a simple workflow: create scenes, then turn them into video. But within that simplicity, there's a lot of creative power. Here's how each mode works and how to get the most out of them.
Chat-mode
Create scenes through conversation
Director-mode
AI suggests your storyline
Render-mode
Turn scenes into smooth video
Chat-mode: Scene Creation
Chat-mode is where your music video starts. Think of it as a conversation with an AI visual artist. You describe what you want to see, and the AI generates high-quality images that become the frames of your video.
You can start three ways: upload existing artwork as a starting point, reference a track to let AI interpret the mood, or describe scenes entirely from scratch. Each approach works differently but produces the same output: a sequence of visually consistent scenes.
Tips for Better Scenes
Start with Reference Art
Upload your album cover or any image that captures the mood. The AI uses it as a visual anchor for all generated scenes, ensuring consistency.
Build Scenes Sequentially
Each scene builds on the previous one. Describe how you want the story to evolve: "Now show the same character walking toward the city skyline at dusk."
Be Specific with Descriptions
Instead of "a cool background," try "neon-lit Tokyo alleyway at night, rain reflecting city lights, cyberpunk atmosphere." The more detail, the better the result.
Use Director-mode for Inspiration
Stuck on what comes next? Activate Director-mode and let the AI propose scene ideas based on your existing scenes and style. Approve, tweak, or regenerate.
Director-mode: AI Storylines
Director-mode is the creative brainstorming partner you didn't know you needed. Instead of describing every scene yourself, you activate Director-mode and let the AI suggest a visual narrative based on your existing scenes, artwork style, or genre.
You're always in control. The AI proposes, you approve or tweak. Think of it like having a creative director pitch ideas. You're the final decision maker.
What Director-mode Suggests by Genre
AI might suggest: urban nightscape → studio recording booth → crowd at a live show → aerial city shot at golden hour
AI might suggest: lonely road at dusk → neon motel sign → silhouette in window → field of wildflowers at sunrise
AI might suggest: abstract light patterns → futuristic cityscape → crowd with lasers → cosmic zoom-out to space
AI might suggest: cozy room with rain → steaming coffee cup → cat on windowsill → city lights through foggy glass
Render-mode: Scenes to Video
Once your scenes are ready in Chat-mode, switch to the Render tab. This is where static images become moving video. Each scene gets rendered individually with the motion and style you describe.
Review Your Scene Timeline
All your scenes from Chat-mode appear as a timeline strip. Reorder or remove scenes before rendering. Each scene becomes a clip in your final video.
Describe the Motion
For each scene, describe how it should move: "slow zoom into the subject's face" or "camera pans left across the landscape." This controls the video animation.
Choose Your AI Model
Different models produce different visual styles and quality levels. Pick the one that best matches your aesthetic. Higher-quality models take slightly longer.
Render and Download
Hit render and watch your scenes transform into smooth video. Download individual clips or the complete sequence. Use them on YouTube, Instagram, TikTok, or Spotify.
Pro Tips
Start with 4-6 scenes for your first video, then expand as you get comfortable
Use the same color palette across scenes for a cohesive look. Mention colors explicitly in your prompts
In Render-mode, subtle motion often looks better than dramatic camera moves
Try different AI models on the same scene to compare styles before committing
Reference real music videos you like: "in the style of a Tame Impala visual" gives the AI strong direction
For albums, create a shared visual language: same lighting, same color grading, different scenes per track
