What files do you accept for voice cloning?

  • MP3 192kbps+


  • 1-2 minutes of good audio for Instant Voice Cloning
  • 30min - 180min of good audio for Professional Voice Cloning

For both Instant Voice Cloning and Professional Voice Cloning, we accept a plethora of file types, but we strongly recommend using MP3 with a bitrate of 192kbps or above. Using an uncompressed format such as WAV will yield little to no improvement. It is instead recommended to focus on the quality of the actual recording to ensure it is recorded professionally without any background noise, room reverb, multiple speakers, at a consistent volume with a consistent tone, no extremely long gaps of silence, and so on.

For more information regarding cloning, we highly recommend that you read our documentation for Instant Voice Cloning (IVC) and Professional Voice Cloning (PVC).