API Reference

Generate LipDub

Generate a lip-synced video using the specified shot and audio.

This endpoint initiates the generation of a LipDub by combining a trained shot with the provided audio. The process is asynchronous and returns a generate_id that can be used to track progress and download the result.

You can use the same shot_id to generate multiple different LipDubs by providing different audio files - this allows you to create multiple versions of the same video with different voiceovers or languages.

NB: The generated video maintains the same video specs (resolution, frame rate, codec, colorspace, etc.) as the uploaded video. If you upload a 4K video, the output is always 4K. Outputs above HD resolution (1080p) may be more expensive than HD. If you want HD outputs via the API, you'll need to upload and train your AI model with HD footage.

Example Request

{
  "audio_id": "audio_123",
  "output_filename": "my_lipdub.mp4",
  "language": "en-US",
  "maintain_expression": false
}

Example Response

{
  "generate_id": 456
}
Language
Credentials
Header
Click Try It! to start a request and see the response here!