Your browser paused video playback to save memory because the editor was inactive for a while.
Don't worry - your work is safe! Click the button below to resume editing.
Keyboard Shortcuts
View and customize your hotkeys
Tip: Click any shortcut to change it. Press Esc to cancel.
🔊
·
A:100%
Pexels
--:--
Noise Suppression
Reduce steady background noise and static locally in your browser.
Audio clips to process0
Selected video clips use their linked audio when available.
Analyzing selected audio...
--
Auto-set from the selected clip. Higher values allow stronger noise reduction.
--
Auto-set from the selected clip. Higher values can reduce leftover static.
The processed WAV is also added to the media library.
First run loads the local model, then processes audio on this device.
Remove Silence (Local)
Detects and removes silence using waveform analysis.
When unchecked, only processes selected clip(s)
Uses subtitle density to detect talking-head clips and skip b-roll footage
Auto
0.3
0.0
Negative = cut into audio, Positive = leave padding
0.0
Preview
Silence regions found:0
Total time to remove:0.0s
Clips affected:0
Red highlights on timeline show what will be removed
Background Removal Options
Lower sensitivity keeps more of the subject when the mask removes too much.
Improves edges around hair and fine details, but takes longer.
240
10
10
These settings are used when refined edges is enabled.
50%
Lower keeps more of the subject. Higher removes more uncertain pixels.
0.0px
Softens the transparent edge after the mask is created.
Background Removal Preview
Before
After
Generate Subtitles (Priority)
Select one or more clips on the timeline to generate subtitles with word-by-word timestamps.
Text to Speech
Convert text to speech using different AI models. Maximum 5000 characters.
Describe the desired voice characteristics, emotion, or speaking style
Voice Characteristics
Pick the traits of the voice you want. Only Gender is required.
Upload a 3+ second audio sample of the voice to clone
Transcription of what is spoken in the reference audio
Expression
How emotional the voice sounds. 0.5 is neutral; higher feels more dramatic.
Lower values slow down the pacing; higher values stick closer to the reference style.
How varied the speech sounds across generations.
Use the same seed to repeat a result. 0 picks a new one each time.
Optional. Upload a clear 6-10 second sample of the voice you want to clone.
0/5000 characters
Generate Music
Generate original music from a text description.
Describe the mood, instruments, tempo, and genre.0/500
Larger models take more time.
How long the generated music should be. Maximum 120 seconds.
Higher values produce more variety; lower values stay closer to the prompt.
How strongly the prompt steers the result. Higher = more faithful to the description.
Limits sampling to the K most likely choices. Set to 0 to disable.
Use nucleus sampling instead of Top-K. Set to 0 to disable.
Leave at -1 for random. Use a fixed number to reproduce a result.
Optional. The melody model will follow this melody. Required when the Melody model is selected.
Generate Video (AI)
Create a brand-new video clip from a text description. This is the slowest, most demanding option — try stock footage, your own uploads, and motion graphics first.
Describe the scene, subject, camera, and style.0/2000
Things you do not want to appear in the video.
LTX 2.3 is currently available.
Leave at -1 for random. Use a fixed number to reproduce a result.
Total frames. Seconds = frames ÷ frame rate.
More steps can improve quality but take longer.
How strongly the description steers the result.
Keep at 1.0 for fully generated video.
Optional. The video will animate from this image (image-to-video).
Generating video uses a large amount of your usage allowance and can take several minutes. You are only charged if generation succeeds.
Auto B-Roll (Pexels)
Uses your existing subtitle clips to find matching stock video and place it on the timeline.
FFmpeg
Run custom FFmpeg commands on your media files.
Note: Input files can be any size, but output files are capped at 2GB to prevent browser issues.
Click a file to insert its name into the command
Filenames with spaces will be automatically quoted
Higher quality = larger file size. For ProRes, this selects different quality levels.
Available for WebM exports. Empty canvas areas stay transparent.
Use Mono if your exported video only has audio in one speaker
Video Settings
Smaller files with a little less detail. Recommended for smooth, reliable WebM exports.
Audio Settings
Select a platform preset to automatically configure optimal settings
Export Notice
Keep this tab open and the browser window visible while exporting. The window can be in the background, but minimizing it may pause the export on some browsers.
Export Timeline (FCP 7 XML)
Compatible with Premiere Pro, Final Cut Pro 7, DaVinci Resolve
Clips with effects, masks, or text will be rendered as video when exporting with media.
About VidTL Editor
VidTL
A browser-based video editor.
Copyright 2026 VidTL
Unlock Premium Features
Get advanced AI tools and more with a premium subscription.