Build scalable video infrastructure with the HeyGen API

Reduce production costs and time by 95%; create, translate, and scale videos in more than 175 languages and dialects.

110M+

Total videos created

15M+

Total videos translated

175+

Languages

99.8%

API uptime

The world's leading companies trust HeyGen
company logo 1
company logo 2
company logo 3
company logo 4
company logo 5
company logo 6
company logo 7
company logo 8
company logo 9
company logo 10
company logo 11
company logo 12
company logo 13
company logo 14
company logo 15
company logo 16
company logo 17
company logo 18
company logo 19
company logo 20
company logo 21
company logo 22
company logo 23
company logo 24
company logo 25
company logo 26
company logo 27
company logo 28
company logo 29
company logo 30
company logo 31
company logo 32
company logo 33
company logo 34
company logo 35
company logo 36
Featured solutions

Enterprise-grade video intelligence

Build robust video infrastructure with enterprise-grade API and AI capabilities designed for scale, automation, and global reach.

Proofreading API

Before translating your video, quickly review and edit the transcript so that your message remains accurate and clear.

Video Translation API

Localise training and product launches in 175+ languages and dialects with 99% lip-sync accuracy.

Video Agent API

Transform your internal wiki or knowledge base into engaging, expressive videos with AI-powered text-to-video.

Video Generation API

Automate the creation of avatar-led onboarding and L&D videos without needing cameras, studios, or production teams.

Text-to-Speech API

State-of-the-art text to speech API with excellent consistency, low latency and precise emotional control.

Template API

Generate personalised videos at scale using HeyGen’s flexible, editable templates.

Simple for developers. Fast for teams to deliver

Generate your video in minutes with our easy-to-use REST API.

curl--request POST \
--url https://api.heygen.com/v2/video_translate \
--header 'accept: application/json' \
--header 'content-type: application/json' \
--header 'x-api-key: <your-api-key>' \
--data '
{
"translate_audio_only":"false",
"keep_the_same_format":"false",
"mode":"precision"
}
'
Use cases

Programmatic video for every department, tailored to your needs

See how teams build scalable video workflows on HeyGen’s global video intelligence infrastructure.

Why HeyGen

Enterprise API security, reliability, and control

HeyGen provides enterprises with a secure, reliable video infrastructure, designed to scale AI video while meeting the security, uptime, and control needs of global teams.

Security

SOC 2 Type II and GDPR

Your data security is our highest priority. HeyGen is independently audited and certified for SOC 2 Type II and GDPR compliance, ensuring your information remains protected to the most stringent standards.

Reliability

99.8% uptime

HeyGen is designed for reliable performance with 99.8% API uptime, ensuring your video infrastructure remains available and your automated video workflows run smoothly without interruption.

Support

Dedicated assistance

Enterprise customers receive direct access to dedicated support engineers who help ensure smooth deployments, quick issue resolution, and reliable video operations at scale.

Control

Role-based access control

Manage permissions with role-based access control (RBAC), giving teams the ability to securely assign access, maintain governance, and control how video workflows are created and managed.

Get enterprise discounted rate

We want to be the best partner to grow with you. We offer discounted rate to our enterprise customers to support your scaling

HeyGen Skills: Train your AI agent on best practices for using the HeyGen API & MCP

CTA background
icon
icon
icon
icon
icon

Integrate AI video generation into creator workflows

Our enterprise API integration brings the benefits of AI video directly into your creator workflows, enabling you to easily generate high-quality videos quickly and efficiently, without the inconvenience of switching between different tools.

Explore all integrations
icon
icon
icon
icon
icon
GDPRGDPR
SOC 2 TYPE IISOC 2 TYPE II
CCPACCPA
AI ACTAI ACT
DPFDPF

Certified to meet global security and compliance standards

Have questions? We have the answers

The key difference lies in how you balance automation with detailed, granular control.

The Video Agent API takes a single text prompt and triggers the autonomous orchestration of avatar creation, script writing, and visual asset creation and layouts. It offers a full range of precise control while also allowing complete creative freedom. Ideal for large-scale content exploration, internal video creation and automation. It is a truly unique offering in the entire industry.

In comparison, the Standard Video Generation APIs have two main parts: 1) Avatar Video Generation and 2) Template Video Generation. Developers create Avatars and Video Templates using HeyGen’s web platform, which can then be used by the API. Even though it requires more setup, these APIs provide the precise control needed for brand-consistent, high-production-value assets. Enterprise customers have created millions of videos through them to automate their content pipelines.

Yes, here are the steps to use the Photo Avatar API

  1. Upload Existing Photos (via Upload Asset API)
  2. Create an Avatar Group: Create an Avatar Group by organising photos of the same subject together.
  3. (optional) Train the Avatar Group: Once your Avatar Group is created, you can train the model to recognise the subject's unique features, expressions, and other elements to ensure it generates realistic avatars.
  4. If you wish to use Template Video Generation to create “personalised” videos at scale, you can use or replace the avatar by following this guide.

If you want to use the Avatar Video Generation, you can use or plug in the avatar by following this guide.

Yes, the API supports pure text-to-avatar generation through a structured descriptive framework that removes the need for external image assets. By providing specific parameters across eight required fields—including age, gender, ethnicity, and style—the AI creates a unique, high-resolution persona. For example, selecting "East Asian" ethnicity with a "Professional" style and "Cinematic" lighting will prompt the engine to return a selection of unique Avatars and Looks, effectively enabling enterprises to scale diverse cast libraries that do not exist in the real world.

You can follow the guide here for prompt-to-avatar.

The template system is designed for high-efficiency "mail-merge" style video production, where a master layout acts as a container for dynamic data. Users first create or select a template via the Dashboard or API, then identify specific placeholders for text, images, or audio. By sending a single POST request to the template's generate endpoint with a JSON payload of variables, the system automatically renders unique video files for each recipient, making it a leading solution for personalised sales outreach and customised customer onboarding at scale.

To ensure the highest level of realism and lip-sync accuracy, the recommended “Golden Path” is to programmatically retrieve and use the default_voice_id associated with a specific avatar. This approach guarantees that the vocal characteristics—such as gender, tone, and regional accent—are already optimised for that avatar’s visual persona, significantly reducing the risk of “uncanny valley” effects. If a custom voice is required, developers should always filter the v2/voices list to match the avatar’s metadata, so that audiovisual consistency is maintained.

Because high-fidelity AI video rendering is a resource-intensive process that can take several minutes, the API is designed to be used with an asynchronous, event-driven architecture via Webhooks. Instead of holding an open connection (which can lead to timeouts), your application should register a webhook URL to receive an automated "push" notification once the avatar_video.success event is triggered. This allows your backend to remain performant while processing the video—via the provided video_url—only when it becomes available.

The API offers extensive global reach, supporting over 40 languages and a library of more than 300 diverse voices, enabling smooth cross-border communication. Beyond simple text-to-speech in different languages, the platform provides "Video Translation" features that can take an existing video and translate the audio while simultaneously re-syncing the avatar’s lip movements to the new language. This ensures that the visual performance looks just as authentic in Spanish or Japanese as it did in the original English recording.

HeyGen Video Translation can support 175 languages and dialects (as referenced here).

Start creating videos with AI

See how businesses like yours scale up content creation and drive growth with the most innovative AI video solution.

CTA background