FilmoTvor.AI
Back to tutorials

Audio

Music, voice and SFX: score your film with /audio-studio

6 min readBeginner/audio-studio
Film Movie StudioRender
AI voiceoverNew project
Search skills…
AllWorkflowsQuick
video
Short DramaViral vertical mini-series, end to end.
image
Image CreatorGenerate single images from a prompt.
audio
Audio StudioVoiceover, SFX and music tracks.
copy
App BuilderMulti-screen apps with live preview.
image
E-commerce DesignPro product shots for any platform.
video
Music VideoBeat-synced clips cut to your track.

Picture is only half of a film — the other half is sound. In FilmMovie Studio you have one tool for the whole soundtrack: the /audio-studio skill. It handles three kinds of audio: music and background score (via Suno), spoken voiceover and dialogue (via TTS models such as ElevenLabs or Kokoro), and short sound effects. You just describe what you want to hear — the app shapes that into a precise generation prompt and drops the result as a playable card straight onto the canvas. In this guide we run one loop from an empty chat to audio on the timeline.

What you'll need

  • An open project in FilmMovie Studio (an empty one is fine).
  • An audio provider configured — Suno for music and a TTS model (ElevenLabs or Kokoro) for voice.
  • An idea for one sound: a track style, one line of voiceover, or a kind of SFX.

Step by step

  1. 1

    Open the Workspace

    With a project open you're in the Workspace — the Canvas on the left, the chat panel on the right. Click into the chat box so you can start typing.

    Film Movie StudioRender
    AI voiceoverNew project
    Skills
    /audio-studio
    /short-drama
    /image-creator
    /app-builder
    /games

    /audio-studio AI voiceover narration generated in-app — natural delivery, multiple voices and languages.

    SkillsModels

    Type / to switch skills anytime

  2. 2

    Type a prompt with /audio-studio

    In chat, type "/audio-studio" then describe the sound. For music, write a style, e.g. "/audio-studio cinematic orchestral score, dark and tense, slowly building strings, no vocals, loopable". For voiceover, dictate a line, e.g. "/audio-studio Say: Welcome to the city that never sleeps."

    Film Movie StudioRender
    AI voiceoverNew project
    Skills
    /audio-studio
    /short-drama
    /image-creator
    /app-builder
    /games

    /audio-studio AI voiceover narration generated in-app — natural delivery, multiple voices and languages.

    SkillsModels

    Type / to switch skills anytime

    Tip: For music, cover genre, instrumentation, mood and tempo. For voice, write the exact line you want spoken — the model reads your text verbatim, with no quotes or stage directions.
  3. 3

    Choose the kind and the voice

    The app decides from your wording whether it's music, voice or SFX — words like "score", "soundtrack" or "instrumental" route to music (Suno), while "say", "narrate" or "dialogue" route to voice (TTS). To pin a specific voice, add it to the prompt (e.g. "calm female voice") and pick the ElevenLabs or Kokoro TTS model.

    Film Movie StudioRender
    AI voiceoverNew project
    Skills
    /audio-studio
    /short-drama
    /image-creator
    /app-builder
    /games

    /audio-studio AI voiceover narration generated in-app — natural delivery, multiple voices and languages.

    SkillsModels

    Type / to switch skills anytime

    Tip: If it's unclear whether you want music or speech, say so outright: "create music…" or "narrate…". It saves you a round-trip.
  4. 4

    Watch the audio card appear

    After you send, an audio card lands on the canvas. It shimmers first (generating), then fills with the finished audio — for music it also shows a waveform.

    Film Movie StudioRender
    AI voiceoverNew project
    60%
    frame_01.png1024²
    Generating… 42%
    RemixVariations
    frame_02.png1024²
    Generating… 68%
    RemixVariations
    frame_03.png1024²
    Generating… 55%
    RemixVariations
  5. 5

    Preview the result

    Click the play button on the card and listen to the sound right on the canvas. If it's not right, right-click and choose "remix" — tweak the prompt (different tempo, different voice) and generate a fresh version beside it.

    Film Movie StudioRender
    AI voiceoverNew project
    Skills
    /audio-studio
    /short-drama
    /image-creator
    /app-builder
    /games

    /audio-studio AI voiceover narration generated in-app — natural delivery, multiple voices and languages.

    SkillsModels

    Type / to switch skills anytime

    Tip: Feel free to generate several takes of one track or line. The cards sit side by side on the canvas, so you can compare and promote the best one.
  6. 6

    Add the audio to the timeline

    Open the timeline (it docks at the bottom) and right-click the audio card → "add to timeline" (or drag it in). The timeline has separate tracks: put music on the music track, an effect on the SFX track and voiceover on the dialogue track.

    Film Movie StudioRender
    AI voiceoverNew project

    Drag an image here, or click to upload

    reference.png
    Tip: Put the voiceover on the dialogue track and the music below it on the music track — that way you set each level separately and the music won't drown the words.
  7. 7

    Balance the mix and move clips

    On the timeline, drag and trim clips so the sound lands with the picture. Adjust each track's level, and add a gentle fade-in and fade-out at the start and end if needed. When you're happy, the mix carries through to the final "Render".

    Film Movie StudioRender
    AI voiceoverNew project

    Drag an image here, or click to upload

    reference.png

What you get

From a single /audio-studio command you have a finished audio card — music, voiceover or SFX — previewed on the canvas and placed on the right timeline track. Repeat the same flow for every layer of sound: score, dialogue and effects. At the final render, all tracks mix down into one MP4 alongside the picture.

Pro tips

  • Generate music "loopable" and without vocals — it slips under a scene more easily and won't clash with dialogue.
  • For voiceover, write short sentences (under 25 words). TTS models like ElevenLabs or Kokoro sound more natural on shorter spans.
  • Split a longer voiceover into several cards, one per sentence — you can then fit each part to the picture more precisely on the timeline.

Try it yourself

FilmMovie Studio is a desktop app that walks you from idea to finished export. Download it and start creating.

Get FilmMovie Studio