Turn any audio file or video into a captioned video with HeyGen's automatic subtitle generator. The subtitle tool generates subtitles automatically, lets you style each caption, translate, and export accurate subtitles in minutes.

Features of the auto subtitle generator
Accurate AI transcription on audio
Upload any audio file or video in MP4, MOV, WAV, or MP3 and the subtitle tool will automatically generate subtitles on every line. Speech recognition detects words, punctuation, and speakers, so you spend less time fixing transcripts and get accurate captions for your videos.

Translate captions into 175+ languages
Make subtitles for video in any market, then push your captioned video global without re-recording. Choose the language and use the AI video translator to translate captions across 100 languages and more, helping you get your message across YouTube, TikTok, and LinkedIn.

Online subtitle editor and styling
Match your caption to your brand without leaving the editor. The online subtitle editor lets you edit fonts, colors, sizes, position, caption templates, and animation in one place. Try kinetic captions for TikTok or clean two-line video subtitles for training inside the AI video editor.

Multi-speaker detection and clean edits
Create subtitles for interviews, podcasts, and panel videos. The tool generates automatic subtitles with speaker tags, lets you edit your subtitles, and fine-tune any phrase. Tap the timeline to nudge the start and end time, split a line, or merge lines for tighter pacing.

Export to SRT, VTT, TXT, or MP4
Download an SRT file, VTT, or TXT transcript for YouTube, your LMS, or your video editor. Or download your video as MP4 with burned-in captions for Reels, Shorts, and ads. The subtitle generator keeps closed captions in sync everywhere you publish.

Use cases
Most social video viewers watch without sound. Add word-synced captions to your videos so the hook lands and keep viewers engaged. Generate, style, and export ready-to-post vertical video in minutes, formatted for every feed and aspect ratio you need.
Upload your podcast or interview and download a subtitle file for YouTube. Captions to videos improve watch time, viewing experience, and search ranking. Pair with the YouTube video translator to translate captions and grow international subscribers from one edit.
Make your video content accessible without manual transcribing or video editing crews. Add subtitles to onboarding clips, SOP walkthroughs, and policy refreshers. Match the workflow with HeyGen's training video toolkit and reach a multilingual, global workforce.
Course completion climbs when lessons are easier to follow on every screen. Caption many videos at once by adding subtitles to your video catalog, then translate. Use the course builder to ship captioned, multilingual lessons without re-recording.
Captioned ads outperform muted-frame video ads on every social platform. Burn branded subtitles for video into your hero cut, then spin localized variants. Repurpose one master into dozens of cuts with the AI ad maker for paid and organic.
Turn one long episode into short clips with captions ready for every social channel. Generate accurate transcripts, pull quote highlights, and publish across feeds in hours. The audio to video workflow handles audio-only podcast files too.
How it works
Add subtitles to a video in four steps, from raw upload to a polished, captioned, share-ready export.
Drop in an MP4, MOV, WAV, or MP3 file. The generator detects audio and starts transcribing.
AI transcribes audio, time-codes every line, and generates subtitles automatically by speaker.
Pick fonts, animations, colors, and position. Click any word to fix wording or retime in seconds.
Download as SRT, VTT, TXT, or MP4 with burned-in captions. Translate into 175+ languages instantly.




An auto subtitle generator is an AI caption generator that transcribes speech in a video and outputs time-coded subtitles. You upload a video, the tool runs speech recognition automatically, and returns accurate captions you can edit, translate, or export as SRT or burn into MP4.
Accuracy lands above 95% on clear English audio and stays strong on accents, noisy backgrounds, and technical terms. You can fine-tune any word by clicking it, then align the text on the timeline. The system also flags low-confidence phrases for review before export.
Yes. The free video plan lets you auto generate subtitles and export accurate subtitles on shorter videos. Paid plans unlock longer videos, more languages, watermark-free exports, brand kits, and team collaboration on the HeyGen plan page.
Yes. After you generate subtitles for your video, you can translate captions into 175+ languages and dialects in one click. The output keeps line-level timing intact, so you can drop translated subtitles into the video and download an SRT or MP4 with burned-in captions.
You can download SRT, VTT, and TXT transcripts for YouTube, social platforms, video editors, or LMS systems. You can also export an MP4 with captions burned in directly. SRT covers most platforms, while VTT is the cleanest pick for HTML5 video players.
Yes. Every word is editable, every line is movable, and every timing point snaps to the waveform. Click a word to fix wording, drag a line to retime it, or merge two lines for tighter pacing. The most used subtitle templates save as presets.
Yes. The subtitle editor detects voice changes, separates speakers, and tags each line. You can rename speakers, color-code them, or merge lines from the same speaker. This is the cleanest fit for interviews, panels, podcasts, and webinar recordings.
A typical 5 to 10 minute video transcribes in under two minutes. Longer videos scale linearly, with most one-hour podcasts finishing in 5 to 10 minutes. You stay in the editor the whole time and can style captions directly as soon as the first lines appear.
Use the timeline to nudge any line by milliseconds. You can drag the line forward or back, split a line at a cursor point, or merge two lines that were chunked too tightly. The waveform under each block makes it easy to align the text to the spoken word.
Yes. You control font, size, color, weight, stroke, background, position, and animation. Save subtitle templates so every video matches your brand. Try kinetic captions for short-form social, or clean two-line blocks for training. Animations sync to the spoken word.
Yes. If you already have an SRT or VTT subtitle file, upload it and the editor maps every line to the video's audio. You can then restyle, retime, or translate your subtitles into almost any language. This is a fast path for re-using existing transcripts on a new edit.
Generate subtitles, pick a 9:16 format, style the captions for short-form, and export the video using MP4 with subtitles burned in. The output is ready to upload to TikTok, Reels, or Shorts. You can also download an SRT and use each platform's native caption upload.
HeyGen pairs auto subtitles with AI voice cloning, AI dubbing, and translation across 175+ languages. Other subtitle tools, including Canva, stop at captions. HeyGen ships the full localized video creation workflow from the same project.
Yes. Würth Group used HeyGen to translate a 65-minute presentation into 8 languages in 4 days and cut translation costs by 80% (case study). Captioning paired with translation scales reach without extra production headcount, even on long-form content.
Explore more AI powered tools
Bring any photo to life with hyper‑realistic voice and movement using Avatar IV.
Transform your ideas into professional videos with AI.
